APEX
Back to models

Deepseek V3.2

OpenRouter

164K context$0.25/M input$0.38/M output
1414peak 1416

Avg Score

63.8

Avg Cost

$0.14

Score/$

446.1

Runs

80

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languageexpert
2731
code-reviewhard
1675
debuggingmedium
1574
code-review
1519
multi-language
1509
code-reviewmedium
1509
frontendmedium
1498
backendexpert
1488
debugginghard
1488
debugging
1467
frontend
1467
refactoringmedium
1462
frontendexpert
1446
debuggingexpert
1442
backendhard
1417
frontendhard
1415
frontendeasy
1414
from-scratchmedium
1408
refactoring
1396
full-stackhard
1396
from-scratchhard
1391
backend
1384
from-scratch
1378
full-stack
1333
backendmedium
1317
from-scratcheasy
1275
full-stackmedium
1268
from-scratchexpert
1024
refactoringexpert
357
backendeasy
115
multi-languagehard
61

All Results

TaskCategoryScore
Build distributed node cluster with gossip protocolfrom-scratch30.7
Write complex SQL report with window functionsbackend70.6
Find and fix 4 hidden backdoors in Flask appdebugging72.5
Convert React app to PWA with offline supportfrontend63.6
Debug and fix 6 broken database triggers and constraintsdebugging72.5
Add retry logic and dead letter queue to Python task queuebackend38.0
Implement zero-trust API authentication layerbackend76.8
Implement multi-tenant row-level security in Postgresbackend60.0
Implement background job scheduler with persistencebackend62.0
Add file upload with S3 presigned URLsbackend40.0
Optimize slow Postgres queries in Flask appbackend63.1
Add i18n with locale routing to Next.js appfull-stack65.8
Find and patch all OWASP Top 10 vulnerabilitiesdebugging67.9
Build real-time portfolio risk calculatorbackend52.9
Fix memory leak in event handlerdebugging39.3
Remove AI slop and over-engineering from codebaserefactoring79.1
Fix broken GitHub Actions CI pipelinedebugging90.8
Add Redis caching layer to Express APIbackend66.2
Add Google OAuth2 login to Express appfull-stack8.6
Add GraphQL layer over REST APImulti-language33.3
Add streaming SSE endpoint for LLM chatbackend84.0
Fix auth bypass vulnerabilitydebugging95.0
Migrate callback-hell Express app to async/awaitrefactoring58.7
Port Python CLI to Rustmulti-language54.3
Build materialized view refresh pipeline for analyticsbackend62.9
Build RAG pipeline with vector searchbackend51.3
Code review: identify security vulnscode-review83.8
Optimize slow Postgres queries in Flask appbackend84.8
Add WebSocket real-time updatesfull-stack75.3
Implement zero-trust API authentication layerbackend68.3
Optimize bloated React bundle under 500KBfrontend68.5
Fix broken responsive layoutfrontend72.0
Find and patch all OWASP Top 10 vulnerabilitiesdebugging67.5
Convert React app to PWA with offline supportfrontend65.0
Implement JWT auth middlewarebackend69.8
Split 1100-line god file into proper modulesrefactoring75.9
Dockerize Node.js monorepofull-stack67.3
Write Kubernetes manifests for Node.js microservicefull-stack71.5
Add caching layer to eliminate slow SSR page loadsfull-stack78.5
Remove AI slop and over-engineering from codebaserefactoring85.7
Build codebase indexer for LLM context windowsfrom-scratch27.0
Implement multi-tenant row-level security in Postgresbackend72.5
Harden insecure Docker setup with 12 vulnerabilitiescode-review79.5
Replace console.log with structured loggingrefactoring40.8
Add i18n with locale routing to Next.js appfull-stack55.5
Add rate limiting middlewarebackend43.1
Build production website with auth and members areafrontend61.7
Build SaaS admin dashboard from scratchfrom-scratch65.0
Fix hallucination and context window bugs in RAG agentbackend53.6
Build LLM evaluation harness with structured gradingbackend64.2
Implement background job scheduler with persistencebackend49.1
Build MCP server for database managementbackend81.8
Build CLI tool with subcommands and configfrom-scratch52.5
Implement transformer inference engine with KV cachefrom-scratch68.9
Build real-time portfolio risk calculatorbackend46.1
Fix race conditions in order matching enginebackend81.5
Fix data integrity bugs in denormalized e-commerce schemadebugging76.9
Write tests for untested legacy Flask servicecode-review56.9
Fix deadlocking transaction patterns in Flask appbackend47.0
Write complex SQL report with window functionsbackend73.0
Debug and fix 6 broken database triggers and constraintsdebugging57.8
Find and fix 4 hidden backdoors in Flask appdebugging71.0
Write integration tests for payment flowcode-review68.1
Fix 12 WCAG accessibility violations in checkout formfrontend77.3
Add retry logic and dead letter queue to Python task queuebackend63.5
Add slash commands and moderation to Discord botbackend63.5
Fix Node.js stream backpressure causing OOM on large filesbackend80.8
Add virtual scrolling to table rendering 5000 rowsfrontend72.6
Build distributed node cluster with gossip protocolfrom-scratch41.3
Add cursor-based pagination to REST APIbackend85.9
Build terminal UI dashboardfrom-scratch58.5
Zero-downtime schema migrationfull-stack63.1
Refactor monolithic handler to CQRSrefactoring40.0
Implement Stripe webhook handlerbackend53.5
Fix flaky test suitedebugging58.0
Build REST API from scratchfrom-scratch76.3
Fix React hydration mismatchfrontend80.3
Fix N+1 query in dashboardbackend53.9
Fix memory leak in event handlerdebugging54.5
Debug race condition in worker pooldebugging90.5