APEX
Back to models

GPT 5.2

OpenRouter

400K context$1.75/M input$14.00/M output
1644peak 1662

Avg Score

78.5

Avg Cost

$0.17

Score/$

461.4

Runs

65

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

from-scratchexpert
2259
code-reviewhard
2219
multi-languagehard
2213
frontendhard
2151
from-scratchmedium
2020
frontendexpert
1888
frontendmedium
1845
frontend
1777
debuggingmedium
1773
backendmedium
1770
full-stackhard
1752
backendeasy
1745
backendexpert
1733
code-review
1716
from-scratchhard
1698
code-reviewmedium
1695
full-stack
1692
backend
1668
from-scratch
1664
frontendeasy
1635
full-stackmedium
1627
debugging
1596
backendhard
1586
debugginghard
1585
debuggingexpert
1567
multi-language
1545
refactoringmedium
1534
refactoring
1493
from-scratcheasy
1272
refactoringexpert
856
multi-languageexpert
765

All Results

TaskCategoryScore
Debug race condition in worker pooldebugging92.1
Add caching layer to eliminate slow SSR page loadsfull-stack79.7
Write complex SQL report with window functionsbackend77.0
Add slash commands and moderation to Discord botbackend86.3
Remove AI slop and over-engineering from codebaserefactoring90.5
Write Kubernetes manifests for Node.js microservicefull-stack92.2
Build SaaS admin dashboard from scratchfrom-scratch63.6
Fix hallucination and context window bugs in RAG agentbackend63.0
Build codebase indexer for LLM context windowsfrom-scratch59.3
Fix deadlocking transaction patterns in Flask appbackend85.0
Optimize bloated React bundle under 500KBfrontend82.1
Fix memory leak in event handlerdebugging91.8
Debug and fix 6 broken database triggers and constraintsdebugging93.3
Build real-time portfolio risk calculatorbackend86.1
Replace console.log with structured loggingrefactoring69.0
Add i18n with locale routing to Next.js appfull-stack76.5
Build LLM evaluation harness with structured gradingbackend82.3
Add Redis caching layer to Express APIbackend90.1
Build materialized view refresh pipeline for analyticsbackend76.5
Refactor monolithic handler to CQRSrefactoring51.1
Build MCP server for database managementbackend86.9
Fix 12 WCAG accessibility violations in checkout formfrontend90.2
Implement background job scheduler with persistencebackend72.2
Fix Node.js stream backpressure causing OOM on large filesbackend91.2
Optimize slow Postgres queries in Flask appbackend83.0
Add retry logic and dead letter queue to Python task queuebackend78.6
Find and patch all OWASP Top 10 vulnerabilitiesdebugging75.1
Write tests for untested legacy Flask servicecode-review70.4
Build distributed node cluster with gossip protocolfrom-scratch83.7
Add file upload with S3 presigned URLsbackend84.4
Fix N+1 query in dashboardbackend74.7
Add virtual scrolling to table rendering 5000 rowsfrontend60.5
Add cursor-based pagination to REST APIbackend88.0
Build CLI tool with subcommands and configfrom-scratch72.3
Fix race conditions in order matching enginebackend84.8
Build RAG pipeline with vector searchbackend48.3
Fix flaky test suitedebugging92.8
Port Python CLI to Rustmulti-language40.5
Migrate callback-hell Express app to async/awaitrefactoring68.0
Fix broken GitHub Actions CI pipelinedebugging87.6
Fix broken responsive layoutfrontend76.0
Add Google OAuth2 login to Express appfull-stack88.1
Dockerize Node.js monorepofull-stack76.7
Find and fix 4 hidden backdoors in Flask appdebugging81.7
Build production website with auth and members areafrontend72.5
Zero-downtime schema migrationfull-stack88.3
Fix auth bypass vulnerabilitydebugging88.3
Implement transformer inference engine with KV cachefrom-scratch91.5
Harden insecure Docker setup with 12 vulnerabilitiescode-review94.2
Add rate limiting middlewarebackend77.9
Split 1100-line god file into proper modulesrefactoring64.4
Implement multi-tenant row-level security in Postgresbackend64.7
Add GraphQL layer over REST APImulti-language87.0
Fix data integrity bugs in denormalized e-commerce schemadebugging65.9
Add streaming SSE endpoint for LLM chatbackend88.5
Implement zero-trust API authentication layerbackend78.7
Implement JWT auth middlewarebackend50.9
Fix React hydration mismatchfrontend85.8
Implement Stripe webhook handlerbackend88.0
Code review: identify security vulnscode-review78.5
Add WebSocket real-time updatesfull-stack78.5
Write integration tests for payment flowcode-review84.3
Build REST API from scratchfrom-scratch76.5
Build terminal UI dashboardfrom-scratch72.9
Convert React app to PWA with offline supportfrontend84.7