APEX
Back to models

Claude Opus 4.7

Anthropic

200K context$15.00/M input$75.00/M output
1880peak 1896

Avg Score

88.3

Avg Cost

$1.52

Score/$

58.3

Runs

70

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languageexpert
2921
from-scratchmedium
2908
frontendexpert
2783
refactoringexpert
2748
frontendeasy
2550
code-reviewhard
2369
from-scratcheasy
2309
multi-languagehard
2225
from-scratchhard
2213
frontendhard
2196
backendeasy
2149
from-scratchexpert
2113
backendexpert
2067
code-reviewmedium
2057
from-scratch
2055
multi-language
2020
code-review
2018
frontend
1993
refactoring
1985
frontendmaster
1974
backendmaster
1970
refactoringmedium
1963
full-stackhard
1948
full-stackmedium
1933
frontendmedium
1928
full-stack
1927
backendhard
1918
debuggingexpert
1848
backend
1847
debuggingmedium
1800
debugging
1750
debugginghard
1717
backendmedium
1717

All Results

TaskCategoryScore
Add streaming SSE endpoint for LLM chatbackend80.1
Implement JWT auth middlewarebackend76.7
Migrate Express monolith to modular architecturebackend90.9
Refactor monolithic handler to CQRSrefactoring89.7
Fix memory leak in event handlerdebugging90.9
Fix and extend Chrome browser extensionfrontend84.8
Build multi-tool LLM agent runtimebackend90.9
Build interactive data visualization dashboardfrontend87.2
Fix hallucination and context window bugs in RAG agentbackend88.9
Find and patch all OWASP Top 10 vulnerabilitiesdebugging91.3
Implement zero-trust API authentication layerbackend88.5
Add caching layer to eliminate slow SSR page loadsfull-stack89.3
Find and fix 4 hidden backdoors in Flask appdebugging91.8
Zero-downtime schema migrationfull-stack89.2
Build MCP server for database managementbackend90.9
Build terminal UI dashboardfrom-scratch87.8
Fix N+1 query in dashboardbackend89.1
Write tests for untested legacy Flask servicecode-review89.7
Fix auth bypass vulnerabilitydebugging93.7
Add i18n with locale routing to Next.js appfull-stack88.2
Code review: identify security vulnscode-review91.3
Add retry logic and dead letter queue to Python task queuebackend87.2
Fix broken responsive layoutfrontend91.8
Build REST API from scratchfrom-scratch93.3
Add Google OAuth2 login to Express appfull-stack84.8
Fix race conditions in order matching enginebackend92.5
Build distributed node cluster with gossip protocolfrom-scratch82.9
Add file upload with S3 presigned URLsbackend87.5
Build real-time portfolio risk calculatorbackend86.1
Build LLM evaluation harness with structured gradingbackend87.7
Fix 12 WCAG accessibility violations in checkout formfrontend91.3
Optimize bloated React bundle under 500KBfrontend87.8
Optimize slow Postgres queries in Flask appbackend92.5
Fix Node.js stream backpressure causing OOM on large filesbackend81.3
Build materialized view refresh pipeline for analyticsbackend88.3
Implement transformer inference engine with KV cachefrom-scratch87.7
Add rate limiting middlewarebackend85.6
Build RAG pipeline with vector searchbackend86.6
Implement multi-tenant row-level security in Postgresbackend86.5
Build SaaS admin dashboard from scratchfrom-scratch86.8
Implement background job scheduler with persistencebackend86.6
Replace console.log with structured loggingrefactoring91.8
Write Kubernetes manifests for Node.js microservicefull-stack94.8
Remove AI slop and over-engineering from codebaserefactoring89.4
Add GraphQL layer over REST APImulti-language87.4
Convert React app to PWA with offline supportfrontend87.0
Implement Stripe webhook handlerbackend77.3
Fix data integrity bugs in denormalized e-commerce schemadebugging90.9
Fix broken GitHub Actions CI pipelinedebugging93.7
Add WebSocket real-time updatesfull-stack88.2
Build 3D browser game with physics and multiplayer syncfrontend84.5
Build production website with auth and members areafrontend86.5
Debug and fix 6 broken database triggers and constraintsdebugging87.0
Migrate callback-hell Express app to async/awaitrefactoring90.9
Add Redis caching layer to Express APIbackend83.1
Split 1100-line god file into proper modulesrefactoring89.0
Write integration tests for payment flowcode-review88.3
Build CLI tool with subcommands and configfrom-scratch89.3
Port Python CLI to Rustmulti-language89.3
Build codebase indexer for LLM context windowsfrom-scratch86.1
Fix deadlocking transaction patterns in Flask appbackend91.8
Add virtual scrolling to table rendering 5000 rowsfrontend86.5
Write complex SQL report with window functionsbackend91.3
Add slash commands and moderation to Discord botbackend87.8
Fix React hydration mismatchfrontend87.5
Fix flaky test suitedebugging89.5
Dockerize Node.js monorepofull-stack83.9
Add cursor-based pagination to REST APIbackend87.0
Harden insecure Docker setup with 12 vulnerabilitiescode-review94.6
Debug race condition in worker pooldebugging91.3