APEX
Back to models

Deepseek R1 0528

OpenRouter

164K context$0.40/M input$1.75/M output
1445peak 1446

Avg Score

64.2

Avg Cost

$0.05

Score/$

1295.9

Runs

103

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

refactoringexpert
1959
from-scratchexpert
1956
from-scratchmedium
1922
backendeasy
1882
frontendexpert
1800
frontendeasy
1722
multi-languageexpert
1635
backendexpert
1560
full-stackhard
1559
debuggingexpert
1519
debugginghard
1493
full-stack
1475
debugging
1472
backend
1470
backendmedium
1446
backendhard
1443
multi-languagehard
1440
frontend
1439
frontendmedium
1416
multi-language
1404
full-stackmedium
1390
debuggingmedium
1384
from-scratch
1376
frontendhard
1375
refactoring
1372
from-scratchhard
1316
code-reviewmedium
1314
from-scratcheasy
1304
code-review
1244
refactoringmedium
1186
code-reviewhard
363

All Results

TaskCategoryScore
Fix 12 WCAG accessibility violations in checkout formfrontend68.8
Fix deadlocking transaction patterns in Flask appbackend67.6
Build REST API from scratchfrom-scratch62.8
Build codebase indexer for LLM context windowsfrom-scratch47.1
Write tests for untested legacy Flask servicecode-review51.2
Zero-downtime schema migrationfull-stack81.7
Debug race condition in worker pooldebugging91.0
Build distributed node cluster with gossip protocolfrom-scratch39.3
Fix React hydration mismatchfrontend82.4
Fix memory leak in event handlerdebugging83.3
Build production website with auth and members areafrontend65.0
Add virtual scrolling to table rendering 5000 rowsfrontend51.5
Implement multi-tenant row-level security in Postgresbackend66.2
Write complex SQL report with window functionsbackend73.5
Fix broken GitHub Actions CI pipelinedebugging75.6
Debug and fix 6 broken database triggers and constraintsdebugging79.3
Add GraphQL layer over REST APImulti-language72.9
Fix Node.js stream backpressure causing OOM on large filesbackend85.7
Replace console.log with structured loggingrefactoring55.9
Add caching layer to eliminate slow SSR page loadsfull-stack87.0
Fix N+1 query in dashboardbackend66.1
Add WebSocket real-time updatesfull-stack74.0
Add i18n with locale routing to Next.js appfull-stack66.1
Refactor monolithic handler to CQRSrefactoring50.6
Fix auth bypass vulnerabilitydebugging94.1
Find and patch all OWASP Top 10 vulnerabilitiesdebugging68.5
Add streaming SSE endpoint for LLM chatbackend81.7
Build real-time portfolio risk calculatorbackend57.6
Implement zero-trust API authentication layerbackend70.7
Implement transformer inference engine with KV cachefrom-scratch85.8
Migrate callback-hell Express app to async/awaitrefactoring52.0
Add file upload with S3 presigned URLsbackend24.6
Fix broken responsive layoutfrontend73.0
Port Python CLI to Rustmulti-language39.5
Build RAG pipeline with vector searchbackend60.3
Split 1100-line god file into proper modulesrefactoring62.2
Optimize slow Postgres queries in Flask appbackend85.3
Optimize bloated React bundle under 500KBfrontend76.8
Code review: identify security vulnscode-review45.7
Build MCP server for database managementbackend71.7
Harden insecure Docker setup with 12 vulnerabilitiescode-review81.0
Build materialized view refresh pipeline for analyticsbackend59.5
Fix race conditions in order matching enginebackend90.7
Build SaaS admin dashboard from scratchfrom-scratch38.5
Build LLM evaluation harness with structured gradingbackend80.5
Add Redis caching layer to Express APIbackend30.5
Fix hallucination and context window bugs in RAG agentbackend63.5
Implement JWT auth middlewarebackend51.8
Build terminal UI dashboardfrom-scratch59.4
Write Kubernetes manifests for Node.js microservicefull-stack75.6
Convert React app to PWA with offline supportfrontend10.8
Remove AI slop and over-engineering from codebaserefactoring33.3
Build codebase indexer for LLM context windowsfrom-scratch24.5
Find and patch all OWASP Top 10 vulnerabilitiesdebugging72.0
Dockerize Node.js monorepofull-stack64.0
Add i18n with locale routing to Next.js appfull-stack62.6
Optimize bloated React bundle under 500KBfrontend57.0
Replace console.log with structured loggingrefactoring48.4
Implement JWT auth middlewarebackend80.3
Split 1100-line god file into proper modulesrefactoring63.7
Fix broken responsive layoutfrontend58.5
Harden insecure Docker setup with 12 vulnerabilitiescode-review71.5
Add caching layer to eliminate slow SSR page loadsfull-stack68.3
Implement zero-trust API authentication layerbackend52.5
Implement multi-tenant row-level security in Postgresbackend61.1
Write Kubernetes manifests for Node.js microservicefull-stack42.3
Implement background job scheduler with persistencebackend60.0
Implement transformer inference engine with KV cachefrom-scratch53.8
Build MCP server for database managementbackend67.5
Build SaaS admin dashboard from scratchfrom-scratch50.9
Build production website with auth and members areafrontend54.0
Build CLI tool with subcommands and configfrom-scratch38.3
Build real-time portfolio risk calculatorbackend36.5
Build LLM evaluation harness with structured gradingbackend45.0
Fix hallucination and context window bugs in RAG agentbackend65.5
Fix race conditions in order matching enginebackend80.9
Write complex SQL report with window functionsbackend73.3
Fix data integrity bugs in denormalized e-commerce schemadebugging72.7
Debug and fix 6 broken database triggers and constraintsdebugging81.8
Fix deadlocking transaction patterns in Flask appbackend64.5
Find and fix 4 hidden backdoors in Flask appdebugging73.7
Write tests for untested legacy Flask servicecode-review48.1
Add slash commands and moderation to Discord botbackend60.9
Add retry logic and dead letter queue to Python task queuebackend76.2
Fix 12 WCAG accessibility violations in checkout formfrontend77.0
Optimize slow Postgres queries in Flask appbackend80.6
Fix Node.js stream backpressure causing OOM on large filesbackend82.4
Add virtual scrolling to table rendering 5000 rowsfrontend70.1
Build distributed node cluster with gossip protocolfrom-scratch61.9
Write integration tests for payment flowcode-review41.5
Add cursor-based pagination to REST APIbackend34.2
Add rate limiting middlewarebackend80.6
Fix auth bypass vulnerabilitydebugging88.0
Implement Stripe webhook handlerbackend54.3
Zero-downtime schema migrationfull-stack62.4
Fix flaky test suitedebugging74.3
Fix N+1 query in dashboardbackend83.8
Refactor monolithic handler to CQRSrefactoring72.3
Fix React hydration mismatchfrontend73.3
Build terminal UI dashboardfrom-scratch48.0
Fix memory leak in event handlerdebugging56.6
Debug race condition in worker pooldebugging90.3
Build REST API from scratchfrom-scratch78.8