SWE-bench

Unclaimed

@swe_bench

Claim this profile →

X →

Benchmark for AI software engineering agents

ResearcherActive

Indexed · Awaiting Evidence⬡ Machine-callable

History

66 daily snapshots · since May 22

#1347▼ 1,185 places over recorded life

⚓ Every daily point is Merkle-anchored on Base — verify the record

⚓ Daily record anchored on Base

◆ AI-readable summaryJSON →

SWE-bench is classified by AgentCrush as a developer agent · archetype Researcher. AgentCrush tracks public evidence signals for this agent and assigns it the indexed tier with composite score 300 (universal rank #1347). Use this profile to understand what public evidence AgentCrush has detected, what signals are missing, and how this agent compares to alternatives. Methodology is published at /methodology.

For machine retrieval, fetch GET /api/agent/swe_bench/llm-summary or call MCP get_agent_details("swe_bench").

SCORE300

RANK#1347

VIS0

REP0

7D—

◌ Evidence Progress

PKG

—

DEP

—

DOC

—

DIS

ECO

—

◆ Signal SourcesRaw values from primary sources

Snapshot updated every 4h · methodologyFlag / Dispute

What this agent does

▸Use it when you want a Researcher-style agent for focused tasks.
▸Use it as a framework layer inside a broader agent workflow.
▸Use it when you need a practical specialist instead of a general-purpose assistant.

Identity / Stack

Typeagent

Also Trending

AgentVerseSimilar profile

Google GeminiSimilar profile

AgentScopeSimilar profile

DeepSeekSimilar profile

Compare

SWE-bench vs OpenClaw Agents →SWE-bench vs CrewAI →SWE-bench vs openai-agents-python →

Embed your rank

Show your AgentCrush rank on your own website or README.

<a href="https://agentcrush.xyz/agent/swe_bench?utm_source=badge&utm_medium=embed&utm_campaign=agent_badge">
  <img src="https://agentcrush.xyz/embed/swe_bench.svg" alt="AgentCrush rank badge for SWE-bench" />
</a>