APEX
Back to models

Qwen3 Coder

OpenRouter

262K context$0.22/M input$1.00/M output
1417peak 1427

Avg Score

60.6

Avg Cost

$0.11

Score/$

560.0

Runs

119

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

frontendexpert
1838
multi-languagehard
1829
refactoringexpert
1711
debuggingmedium
1703
debugginghard
1531
full-stackmedium
1516
debugging
1495
frontendhard
1488
multi-languageexpert
1487
multi-language
1468
backendmedium
1465
backendhard
1436
from-scratchexpert
1414
backend
1413
from-scratchhard
1412
frontend
1410
full-stack
1405
refactoring
1405
debuggingexpert
1385
frontendmedium
1382
from-scratch
1367
refactoringmedium
1359
code-reviewmedium
1354
full-stackhard
1342
frontendeasy
1315
backendexpert
1291
from-scratcheasy
1290
code-review
1248
from-scratchmedium
1229
backendeasy
923
code-reviewhard
669

All Results

TaskCategoryScore
Write tests for untested legacy Flask servicecode-review49.0
Add file upload with S3 presigned URLsbackend80.1
Build real-time portfolio risk calculatorbackend49.8
Debug and fix 6 broken database triggers and constraintsdebugging49.6
Debug race condition in worker pooldebugging89.2
Fix 12 WCAG accessibility violations in checkout formfrontend75.3
Migrate callback-hell Express app to async/awaitrefactoring64.1
Add cursor-based pagination to REST APIbackend71.2
Fix Node.js stream backpressure causing OOM on large filesbackend88.3
Add streaming SSE endpoint for LLM chatbackend81.8
Add i18n with locale routing to Next.js appfull-stack62.5
Fix broken GitHub Actions CI pipelinedebugging84.8
Fix deadlocking transaction patterns in Flask appbackend72.3
Fix hallucination and context window bugs in RAG agentbackend44.8
Add rate limiting middlewarebackend40.0
Port Python CLI to Rustmulti-language39.0
Build SaaS admin dashboard from scratchfrom-scratch57.8
Add slash commands and moderation to Discord botbackend79.5
Build production website with auth and members areafrontend66.1
Replace console.log with structured loggingrefactoring45.0
Add virtual scrolling to table rendering 5000 rowsfrontend71.5
Build materialized view refresh pipeline for analyticsbackend73.3
Implement multi-tenant row-level security in Postgresbackend59.1
Refactor monolithic handler to CQRSrefactoring79.3
Build RAG pipeline with vector searchbackend47.4
Implement Stripe webhook handlerbackend82.8
Build terminal UI dashboardfrom-scratch37.1
Fix React hydration mismatchfrontend85.3
Fix data integrity bugs in denormalized e-commerce schemadebugging81.0
Implement background job scheduler with persistencebackend37.3
Build CLI tool with subcommands and configfrom-scratch61.4
Split 1100-line god file into proper modulesrefactoring67.1
Optimize slow Postgres queries in Flask appbackend62.4
Find and fix 4 hidden backdoors in Flask appdebugging91.3
Fix flaky test suitedebugging86.0
Fix auth bypass vulnerabilitydebugging82.8
Add retry logic and dead letter queue to Python task queuebackend79.4
Add GraphQL layer over REST APImulti-language78.1
Implement zero-trust API authentication layerbackend61.2
Fix broken responsive layoutfrontend64.0
Fix N+1 query in dashboardbackend54.4
Build LLM evaluation harness with structured gradingbackend45.4
Convert React app to PWA with offline supportfrontend72.4
Harden insecure Docker setup with 12 vulnerabilitiescode-review47.1
Find and patch all OWASP Top 10 vulnerabilitiesdebugging66.2
Add Redis caching layer to Express APIbackend81.3
Write integration tests for payment flowcode-review64.2
Add Google OAuth2 login to Express appfull-stack82.0
Fix memory leak in event handlerdebugging70.9
Implement transformer inference engine with KV cachefrom-scratch74.2
Implement JWT auth middlewarebackend44.3
Write Kubernetes manifests for Node.js microservicefull-stack85.0
Code review: identify security vulnscode-review75.0
Dockerize Node.js monorepofull-stack70.8
Write complex SQL report with window functionsbackend62.4
Remove AI slop and over-engineering from codebaserefactoring80.0
Build MCP server for database managementbackend74.5
Add WebSocket real-time updatesfull-stack59.5
Build codebase indexer for LLM context windowsfrom-scratch42.4
Zero-downtime schema migrationfull-stack53.9
Build REST API from scratchfrom-scratch72.5
Add caching layer to eliminate slow SSR page loadsfull-stack83.5
Build distributed node cluster with gossip protocolfrom-scratch27.4
Optimize bloated React bundle under 500KBfrontend33.8
Fix race conditions in order matching enginebackend59.9
Add file upload with S3 presigned URLsbackend9.8
Convert React app to PWA with offline supportfrontend31.6
Add i18n with locale routing to Next.js appfull-stack34.3
Remove AI slop and over-engineering from codebaserefactoring31.9
Implement zero-trust API authentication layerbackend33.9
Implement JWT auth middlewarebackend42.0
Replace console.log with structured loggingrefactoring34.0
Find and patch all OWASP Top 10 vulnerabilitiesdebugging29.0
Split 1100-line god file into proper modulesrefactoring33.5
Implement multi-tenant row-level security in Postgresbackend52.0
Write Kubernetes manifests for Node.js microservicefull-stack79.3
Build codebase indexer for LLM context windowsfrom-scratch2.1
Add caching layer to eliminate slow SSR page loadsfull-stack31.5
Harden insecure Docker setup with 12 vulnerabilitiescode-review15.0
Optimize bloated React bundle under 500KBfrontend36.8
Dockerize Node.js monorepofull-stack60.9
Fix broken responsive layoutfrontend7.0
Build production website with auth and members areafrontend56.0
Build SaaS admin dashboard from scratchfrom-scratch66.3
Implement background job scheduler with persistencebackend67.5
Build CLI tool with subcommands and configfrom-scratch45.5
Build MCP server for database managementbackend59.9
Implement transformer inference engine with KV cachefrom-scratch53.3
Build LLM evaluation harness with structured gradingbackend55.9
Fix hallucination and context window bugs in RAG agentbackend58.8
Build real-time portfolio risk calculatorbackend53.0
Fix race conditions in order matching enginebackend74.8
Fix data integrity bugs in denormalized e-commerce schemadebugging73.2
Write complex SQL report with window functionsbackend72.5
Fix deadlocking transaction patterns in Flask appbackend60.5
Debug and fix 6 broken database triggers and constraintsdebugging71.0
Find and fix 4 hidden backdoors in Flask appdebugging83.3
Write tests for untested legacy Flask servicecode-review45.3
Add Google OAuth2 login to Express appfull-stack70.0
Optimize slow Postgres queries in Flask appbackend58.0
Fix 12 WCAG accessibility violations in checkout formfrontend78.7
Add slash commands and moderation to Discord botbackend57.1
Add retry logic and dead letter queue to Python task queuebackend62.2
Fix Node.js stream backpressure causing OOM on large filesbackend80.3
Add virtual scrolling to table rendering 5000 rowsfrontend54.4
Build distributed node cluster with gossip protocolfrom-scratch41.9
Write integration tests for payment flowcode-review45.8
Fix auth bypass vulnerabilitydebugging92.4
Zero-downtime schema migrationfull-stack48.7
Add rate limiting middlewarebackend64.3
Add cursor-based pagination to REST APIbackend62.4
Fix flaky test suitedebugging80.2
Refactor monolithic handler to CQRSrefactoring63.1
Fix N+1 query in dashboardbackend53.6
Fix React hydration mismatchfrontend77.7
Fix memory leak in event handlerdebugging68.3
Build terminal UI dashboardfrom-scratch55.0
Build REST API from scratchfrom-scratch77.2
Debug race condition in worker pooldebugging93.3