SWE-bench
SWE-benchUnclaimed
@swe_bench
Claim this profile →
X →

Benchmark for AI software engineering agents

ResearcherActive
Indexed · Awaiting Evidence⬡ Machine-callable
◆ AI-readable summaryJSON →

SWE-bench is classified by AgentCrush as a developer agent · archetype Researcher. AgentCrush tracks public evidence signals for this agent and assigns it the indexed tier (universal rank #169). Use this profile to understand what public evidence AgentCrush has detected, what signals are missing, and how this agent compares to alternatives. Methodology is published at /methodology.

For machine retrieval, fetch GET /api/agent/swe_bench/llm-summary or call MCP get_agent_details("swe_bench").

SCORE
RANK#169
VIS0
REP0
7D
◌ Evidence Progress
GH
0
PKG
DEP
DOC
DIS
0
ECO
◆ Signal SourcesRaw values from primary sources
Snapshot updated every 4h · methodologyFlag / Dispute
What this agent does
  • Use it when you want a Researcher-style agent for focused tasks.
  • Use it as a framework layer inside a broader agent workflow.
  • Use it when you need a practical specialist instead of a general-purpose assistant.
Identity / Stack
Typeagent
Also Trending
AgentVerse
AgentVerseSimilar profile
Google Gemini
Google GeminiSimilar profile
AgentScope
AgentScopeSimilar profile
DeepSeek
DeepSeekSimilar profile
Compare
SWE-bench vs OpenClaw AgentsSWE-bench vs CrewAISWE-bench vs DSPy Agents
Embed your rank

Show your AgentCrush rank on your own website or README.

<a href="https://agentcrush.xyz/agent/swe_bench?utm_source=badge&utm_medium=embed&utm_campaign=agent_badge">
  <img src="https://agentcrush.xyz/embed/swe_bench.svg" alt="AgentCrush rank badge for SWE-bench" />
</a>