AutonomyPreviewThis index is in active development: we are still adding agents, refining attribution signatures, and verifying data against sources. The date in the header is when key figures were last re-checked. Numbers can move as coverage improves.
Public GitHub can show whether merged agent pull requests had a human reviewer, how long they took to merge, and whether they touched tests. Significant repos are org-owned or have at least 10 stars. Agent-level rates are withheld when the sample is too concentrated. Sparse exact-bot agents can use an expanded window; those rows are not included in the market rate.
Human reviewed
55.7%
Merged agent PRs in significant repos.
No human reviewer
44.3%
The same significant-repo sample.
Sample
230
Significant-repo PRs behind the market rate.
Agent-Level Autonomy
Window: 2026-05-07 to 2026-06-05. Published rates require at least 40 capped PRs across 15 repos. Significant-repo review rates require at least 10 significant PRs across 8 significant repos.
| Agent | Significant review | All-repo review | Time to merge | Tests touched | Sample | Status |
|---|---|---|---|---|---|---|
| Cursor | 87.5% | 60% | 4.3h | 35.8% | 120 PRs, 120 repos30-day window120 of 120 enriched | Published |
| GitHub Copilot | 78.3% | 68.3% | 43 min | 28.3% | 120 PRs, 120 repos30-day window120 of 120 enriched | Published |
| Devin | 41.7% | 18.3% | 17 min | 19.2% | 120 PRs, 120 repos30-day window120 of 120 enriched | Published |
| OpenAI Codex | 38.7% | 24.2% | 11 min | 40.8% | 120 PRs, 120 repos30-day window120 of 120 enriched | Published |
| Jules | 37.5% | 26.7% | 1.19h | 17.5% | 120 PRs, 120 repos30-day window120 of 120 enriched | Published |
| Amazon Q Developer | 33.3% | 39.2% | 23 min | 35.8% | 120 PRs, 120 reposexpanded 365-day window120 of 120 enriched | Published from 365-day sample. |
| Claude Code | 33.3% | 24.2% | 25 min | 42.5% | 120 PRs, 120 repos30-day window120 of 120 enriched | Published |
Read This With the Ranking
Output volume counts what an agent leaves visible. Autonomy adds the lifecycle read: whether merged work had a human reviewer, how quickly it merged, and whether tests changed. For capture details and visibility limits, see the methodology.