APEX
Back to models

Deepseek V4 Pro

OpenRouter

1049K context$0.50/M input$2.00/M output
1656peak 1672

Avg Score

80.2

Avg Cost

$0.37

Score/$

214.5

Runs

70

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

code-reviewhard
2307
from-scratcheasy
2253
frontendeasy
2191
multi-languagehard
2165
frontendexpert
2136
backendeasy
2109
from-scratchmedium
2020
multi-languageexpert
1896
from-scratchhard
1827
backendhard
1803
from-scratchexpert
1802
debuggingmedium
1800
code-review
1782
frontendmedium
1765
frontendmaster
1760
code-reviewmedium
1758
from-scratch
1752
frontend
1748
multi-language
1747
backendexpert
1732
debuggingexpert
1704
refactoringexpert
1691
refactoringmedium
1661
backend
1646
refactoring
1639
full-stackmedium
1630
full-stack
1605
debugging
1594
full-stackhard
1592
frontendhard
1548
backendmedium
1499
debugginghard
1482
backendmaster
1450

All Results

TaskCategoryScore
Fix and extend Chrome browser extensionfrontend76.9
Build 3D browser game with physics and multiplayer syncfrontend83.2
Build real-time portfolio risk calculatorbackend85.2
Debug and fix 6 broken database triggers and constraintsdebugging86.6
Build interactive data visualization dashboardfrontend78.6
Split 1100-line god file into proper modulesrefactoring79.5
Add caching layer to eliminate slow SSR page loadsfull-stack84.8
Build MCP server for database managementbackend83.0
Optimize slow Postgres queries in Flask appbackend92.5
Build CLI tool with subcommands and configfrom-scratch73.4
Fix auth bypass vulnerabilitydebugging89.7
Implement transformer inference engine with KV cachefrom-scratch82.1
Fix hallucination and context window bugs in RAG agentbackend85.7
Build multi-tool LLM agent runtimebackend84.4
Implement Stripe webhook handlerbackend87.8
Zero-downtime schema migrationfull-stack82.8
Build LLM evaluation harness with structured gradingbackend76.7
Add cursor-based pagination to REST APIbackend78.0
Write tests for untested legacy Flask servicecode-review84.2
Migrate Express monolith to modular architecturebackend63.1
Remove AI slop and over-engineering from codebaserefactoring82.7
Implement zero-trust API authentication layerbackend75.6
Find and fix 4 hidden backdoors in Flask appdebugging89.7
Write complex SQL report with window functionsbackend86.3
Add i18n with locale routing to Next.js appfull-stack79.5
Find and patch all OWASP Top 10 vulnerabilitiesdebugging81.8
Fix broken GitHub Actions CI pipelinedebugging91.8
Write Kubernetes manifests for Node.js microservicefull-stack87.3
Build codebase indexer for LLM context windowsfrom-scratch81.0
Build materialized view refresh pipeline for analyticsbackend76.8
Add retry logic and dead letter queue to Python task queuebackend80.7
Add Redis caching layer to Express APIbackend80.3
Fix race conditions in order matching enginebackend87.3
Implement background job scheduler with persistencebackend78.3
Fix N+1 query in dashboardbackend81.2
Replace console.log with structured loggingrefactoring72.0
Build distributed node cluster with gossip protocolfrom-scratch78.0
Fix Node.js stream backpressure causing OOM on large filesbackend90.7
Add GraphQL layer over REST APImulti-language85.7
Fix data integrity bugs in denormalized e-commerce schemadebugging85.6
Add virtual scrolling to table rendering 5000 rowsfrontend85.6
Fix 12 WCAG accessibility violations in checkout formfrontend79.7
Fix broken responsive layoutfrontend83.3
Implement multi-tenant row-level security in Postgresbackend76.4
Add Google OAuth2 login to Express appfull-stack64.0
Add streaming SSE endpoint for LLM chatbackend71.6
Fix memory leak in event handlerdebugging55.6
Fix deadlocking transaction patterns in Flask appbackend82.0
Optimize bloated React bundle under 500KBfrontend79.9
Code review: identify security vulnscode-review78.9
Harden insecure Docker setup with 12 vulnerabilitiescode-review91.2
Convert React app to PWA with offline supportfrontend81.7
Implement JWT auth middlewarebackend61.0
Port Python CLI to Rustmulti-language65.5
Add slash commands and moderation to Discord botbackend69.8
Build production website with auth and members areafrontend75.8
Add file upload with S3 presigned URLsbackend60.3
Migrate callback-hell Express app to async/awaitrefactoring87.0
Build terminal UI dashboardfrom-scratch73.1
Build SaaS admin dashboard from scratchfrom-scratch70.3
Build RAG pipeline with vector searchbackend82.4
Add rate limiting middlewarebackend83.5
Build REST API from scratchfrom-scratch92.1
Refactor monolithic handler to CQRSrefactoring70.5
Fix flaky test suitedebugging90.4
Fix React hydration mismatchfrontend85.3
Write integration tests for payment flowcode-review87.0
Add WebSocket real-time updatesfull-stack81.2
Dockerize Node.js monorepofull-stack76.5
Debug race condition in worker pooldebugging83.1