APEX
Back to models

Gemini 2.5 Pro

OpenRouter

1049K context$1.25/M input$10.00/M output
1409peak 1425

Avg Score

65.7

Avg Cost

$0.28

Score/$

237.9

Runs

64

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

frontendhard
1942
from-scratchexpert
1840
debuggingmedium
1808
frontendeasy
1572
full-stackmedium
1554
debugging
1516
backendeasy
1480
backendmedium
1476
debuggingexpert
1474
frontend
1473
debugginghard
1457
from-scratchhard
1440
from-scratch
1428
backend
1417
full-stack
1416
frontendexpert
1403
frontendmedium
1402
backendexpert
1387
multi-languageexpert
1385
backendhard
1358
refactoringmedium
1322
refactoring
1271
full-stackhard
1249
multi-language
1207
from-scratchmedium
1148
code-reviewmedium
1143
code-review
1121
from-scratcheasy
1040
code-reviewhard
129
refactoringexpert
0
multi-languagehard
0

All Results

TaskCategoryScore
Build SaaS admin dashboard from scratchfrom-scratch50.1
Split 1100-line god file into proper modulesrefactoring62.7
Implement JWT auth middlewarebackend52.0
Convert React app to PWA with offline supportfrontend52.1
Add file upload with S3 presigned URLsbackend74.2
Implement multi-tenant row-level security in Postgresbackend66.7
Code review: identify security vulnscode-review73.0
Build terminal UI dashboardfrom-scratch59.3
Build production website with auth and members areafrontend65.9
Add retry logic and dead letter queue to Python task queuebackend70.5
Add slash commands and moderation to Discord botbackend78.9
Dockerize Node.js monorepofull-stack75.5
Fix 12 WCAG accessibility violations in checkout formfrontend86.5
Add virtual scrolling to table rendering 5000 rowsfrontend85.6
Implement zero-trust API authentication layerbackend68.8
Write integration tests for payment flowcode-review41.4
Fix React hydration mismatchfrontend38.5
Add WebSocket real-time updatesfull-stack58.7
Harden insecure Docker setup with 12 vulnerabilitiescode-review65.2
Build distributed node cluster with gossip protocolfrom-scratch58.0
Implement background job scheduler with persistencebackend40.2
Add caching layer to eliminate slow SSR page loadsfull-stack85.2
Write complex SQL report with window functionsbackend40.0
Fix hallucination and context window bugs in RAG agentbackend63.8
Fix N+1 query in dashboardbackend69.7
Zero-downtime schema migrationfull-stack48.1
Build real-time portfolio risk calculatorbackend58.6
Refactor monolithic handler to CQRSrefactoring28.3
Optimize bloated React bundle under 500KBfrontend79.7
Build CLI tool with subcommands and configfrom-scratch73.3
Implement transformer inference engine with KV cachefrom-scratch82.6
Fix broken GitHub Actions CI pipelinedebugging88.8
Add GraphQL layer over REST APImulti-language34.6
Implement Stripe webhook handlerbackend66.5
Find and patch all OWASP Top 10 vulnerabilitiesdebugging31.8
Replace console.log with structured loggingrefactoring60.8
Add streaming SSE endpoint for LLM chatbackend82.4
Fix race conditions in order matching enginebackend79.1
Build materialized view refresh pipeline for analyticsbackend72.8
Fix Node.js stream backpressure causing OOM on large filesbackend43.4
Build MCP server for database managementbackend82.4
Build codebase indexer for LLM context windowsfrom-scratch52.9
Fix flaky test suitedebugging93.0
Find and fix 4 hidden backdoors in Flask appdebugging90.9
Add i18n with locale routing to Next.js appfull-stack75.7
Add rate limiting middlewarebackend73.5
Debug and fix 6 broken database triggers and constraintsdebugging88.8
Write tests for untested legacy Flask servicecode-review33.4
Optimize slow Postgres queries in Flask appbackend86.3
Fix auth bypass vulnerabilitydebugging78.5
Debug race condition in worker pooldebugging88.0
Fix broken responsive layoutfrontend75.0
Build RAG pipeline with vector searchbackend37.8
Fix memory leak in event handlerdebugging34.3
Migrate callback-hell Express app to async/awaitrefactoring64.4
Build LLM evaluation harness with structured gradingbackend68.3
Add Redis caching layer to Express APIbackend50.5
Remove AI slop and over-engineering from codebaserefactoring75.3
Port Python CLI to Rustmulti-language52.2
Build REST API from scratchfrom-scratch70.2
Write Kubernetes manifests for Node.js microservicefull-stack82.3
Fix deadlocking transaction patterns in Flask appbackend65.5
Fix data integrity bugs in denormalized e-commerce schemadebugging82.2
Add cursor-based pagination to REST APIbackend90.0