APEX
Back to models

Deepseek V3.2

OpenRouter

164K context$0.25/M input$0.38/M output
1360peak 1377

Avg Score

64.3

Avg Cost

$0.04

Score/$

1765.2

Runs

65

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languageexpert
1521
code-reviewmedium
1519
debuggingmedium
1473
debugginghard
1464
code-review
1456
frontendhard
1435
frontend
1406
debugging
1403
frontendexpert
1403
frontendeasy
1398
frontendmedium
1394
backendhard
1386
backendexpert
1369
backend
1368
backendmedium
1361
debuggingexpert
1325
refactoringmedium
1296
refactoring
1278
from-scratch
1273
from-scratcheasy
1272
full-stack
1255
full-stackmedium
1241
multi-language
1235
full-stackhard
1235
from-scratchhard
1198
code-reviewhard
1181
from-scratchmedium
1164
from-scratchexpert
959
backendeasy
582
refactoringexpert
328
multi-languagehard
0

All Results

TaskCategoryScore
Build distributed node cluster with gossip protocolfrom-scratch38.7
Write complex SQL report with window functionsbackend70.6
Find and fix 4 hidden backdoors in Flask appdebugging72.5
Convert React app to PWA with offline supportfrontend63.6
Debug and fix 6 broken database triggers and constraintsdebugging72.5
Add retry logic and dead letter queue to Python task queuebackend38.0
Implement zero-trust API authentication layerbackend76.8
Implement multi-tenant row-level security in Postgresbackend60.0
Implement background job scheduler with persistencebackend62.0
Add file upload with S3 presigned URLsbackend40.0
Optimize slow Postgres queries in Flask appbackend84.8
Add i18n with locale routing to Next.js appfull-stack65.8
Find and patch all OWASP Top 10 vulnerabilitiesdebugging67.9
Build real-time portfolio risk calculatorbackend52.9
Fix memory leak in event handlerdebugging39.3
Remove AI slop and over-engineering from codebaserefactoring79.1
Fix broken GitHub Actions CI pipelinedebugging90.8
Add Redis caching layer to Express APIbackend66.2
Add Google OAuth2 login to Express appfull-stack8.6
Add GraphQL layer over REST APImulti-language33.3
Add streaming SSE endpoint for LLM chatbackend84.0
Fix auth bypass vulnerabilitydebugging95.0
Migrate callback-hell Express app to async/awaitrefactoring58.7
Port Python CLI to Rustmulti-language54.3
Build materialized view refresh pipeline for analyticsbackend62.9
Build RAG pipeline with vector searchbackend51.3
Code review: identify security vulnscode-review83.8
Add WebSocket real-time updatesfull-stack75.3
Optimize bloated React bundle under 500KBfrontend68.5
Fix broken responsive layoutfrontend72.0
Implement JWT auth middlewarebackend69.8
Split 1100-line god file into proper modulesrefactoring75.9
Dockerize Node.js monorepofull-stack67.3
Write Kubernetes manifests for Node.js microservicefull-stack71.5
Add caching layer to eliminate slow SSR page loadsfull-stack78.5
Build codebase indexer for LLM context windowsfrom-scratch27.0
Harden insecure Docker setup with 12 vulnerabilitiescode-review79.5
Replace console.log with structured loggingrefactoring40.8
Add rate limiting middlewarebackend43.1
Build production website with auth and members areafrontend65.7
Build SaaS admin dashboard from scratchfrom-scratch65.0
Fix hallucination and context window bugs in RAG agentbackend53.6
Build LLM evaluation harness with structured gradingbackend64.2
Build MCP server for database managementbackend81.8
Build CLI tool with subcommands and configfrom-scratch53.8
Implement transformer inference engine with KV cachefrom-scratch68.9
Fix race conditions in order matching enginebackend81.5
Fix data integrity bugs in denormalized e-commerce schemadebugging76.9
Write tests for untested legacy Flask servicecode-review56.9
Fix deadlocking transaction patterns in Flask appbackend47.0
Write integration tests for payment flowcode-review68.1
Fix 12 WCAG accessibility violations in checkout formfrontend77.3
Add slash commands and moderation to Discord botbackend63.5
Fix Node.js stream backpressure causing OOM on large filesbackend80.8
Add virtual scrolling to table rendering 5000 rowsfrontend72.6
Add cursor-based pagination to REST APIbackend85.9
Build terminal UI dashboardfrom-scratch59.7
Zero-downtime schema migrationfull-stack63.1
Refactor monolithic handler to CQRSrefactoring40.0
Implement Stripe webhook handlerbackend53.5
Fix flaky test suitedebugging58.0
Build REST API from scratchfrom-scratch76.3
Fix React hydration mismatchfrontend80.3
Fix N+1 query in dashboardbackend53.9
Debug race condition in worker pooldebugging90.5