APEX
Back to models

Qwen3 Coder Plus

OpenRouter

1000K context$1.00/M input$5.00/M output
1420peak 1428

Avg Score

62.8

Avg Cost

$0.12

Score/$

504.2

Runs

117

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

from-scratchexpert
1733
from-scratchmedium
1644
debugginghard
1608
debuggingexpert
1572
from-scratcheasy
1534
debugging
1495
full-stackhard
1460
backendmedium
1450
full-stack
1438
frontendmedium
1430
frontendexpert
1420
full-stackmedium
1417
backendhard
1415
backend
1411
from-scratch
1410
refactoringmedium
1406
frontend
1400
refactoring
1399
backendexpert
1373
code-reviewhard
1309
code-review
1297
from-scratchhard
1288
code-reviewmedium
1285
multi-languagehard
1284
multi-language
1271
frontendeasy
1264
debuggingmedium
1220
refactoringexpert
1160
multi-languageexpert
1147
frontendhard
939
backendeasy
337

All Results

TaskCategoryScore
Build production website with auth and members areafrontend54.8
Build terminal UI dashboardfrom-scratch55.1
Build CLI tool with subcommands and configfrom-scratch44.4
Zero-downtime schema migrationfull-stack72.2
Split 1100-line god file into proper modulesrefactoring61.6
Build REST API from scratchfrom-scratch74.5
Fix data integrity bugs in denormalized e-commerce schemadebugging87.0
Optimize slow Postgres queries in Flask appbackend82.6
Add Redis caching layer to Express APIbackend84.9
Remove AI slop and over-engineering from codebaserefactoring66.3
Fix React hydration mismatchfrontend74.8
Add virtual scrolling to table rendering 5000 rowsfrontend76.6
Implement background job scheduler with persistencebackend49.9
Replace console.log with structured loggingrefactoring64.0
Optimize bloated React bundle under 500KBfrontend66.1
Fix 12 WCAG accessibility violations in checkout formfrontend69.3
Write tests for untested legacy Flask servicecode-review47.3
Harden insecure Docker setup with 12 vulnerabilitiescode-review47.3
Fix broken GitHub Actions CI pipelinedebugging77.0
Build distributed node cluster with gossip protocolfrom-scratch35.5
Write integration tests for payment flowcode-review71.8
Implement JWT auth middlewarebackend49.4
Build codebase indexer for LLM context windowsfrom-scratch43.0
Code review: identify security vulnscode-review63.0
Write complex SQL report with window functionsbackend62.6
Fix broken responsive layoutfrontend44.2
Add slash commands and moderation to Discord botbackend70.5
Fix Node.js stream backpressure causing OOM on large filesbackend74.0
Build RAG pipeline with vector searchbackend37.3
Implement transformer inference engine with KV cachefrom-scratch79.5
Add rate limiting middlewarebackend45.0
Build materialized view refresh pipeline for analyticsbackend72.3
Refactor monolithic handler to CQRSrefactoring38.5
Implement zero-trust API authentication layerbackend67.1
Add file upload with S3 presigned URLsbackend75.7
Add Google OAuth2 login to Express appfull-stack80.0
Add retry logic and dead letter queue to Python task queuebackend77.0
Debug race condition in worker pooldebugging86.4
Build MCP server for database managementbackend80.2
Find and fix 4 hidden backdoors in Flask appdebugging87.7
Implement Stripe webhook handlerbackend74.9
Fix auth bypass vulnerabilitydebugging73.2
Dockerize Node.js monorepofull-stack71.0
Add GraphQL layer over REST APImulti-language70.1
Fix hallucination and context window bugs in RAG agentbackend27.4
Write Kubernetes manifests for Node.js microservicefull-stack75.5
Convert React app to PWA with offline supportfrontend69.3
Build LLM evaluation harness with structured gradingbackend70.9
Add caching layer to eliminate slow SSR page loadsfull-stack80.8
Build real-time portfolio risk calculatorbackend37.2
Port Python CLI to Rustmulti-language32.8
Add i18n with locale routing to Next.js appfull-stack61.9
Migrate callback-hell Express app to async/awaitrefactoring60.7
Fix deadlocking transaction patterns in Flask appbackend68.2
Find and patch all OWASP Top 10 vulnerabilitiesdebugging63.9
Fix flaky test suitedebugging22.8
Build SaaS admin dashboard from scratchfrom-scratch51.8
Fix N+1 query in dashboardbackend37.0
Implement multi-tenant row-level security in Postgresbackend61.6
Add streaming SSE endpoint for LLM chatbackend71.7
Debug and fix 6 broken database triggers and constraintsdebugging83.8
Add cursor-based pagination to REST APIbackend76.9
Fix race conditions in order matching enginebackend79.3
Fix memory leak in event handlerdebugging78.8
Add WebSocket real-time updatesfull-stack70.5
Optimize bloated React bundle under 500KBfrontend67.4
Add caching layer to eliminate slow SSR page loadsfull-stack74.2
Add i18n with locale routing to Next.js appfull-stack57.9
Implement JWT auth middlewarebackend80.9
Find and patch all OWASP Top 10 vulnerabilitiesdebugging71.0
Convert React app to PWA with offline supportfrontend62.4
Dockerize Node.js monorepofull-stack69.8
Replace console.log with structured loggingrefactoring42.6
Fix broken responsive layoutfrontend70.0
Implement zero-trust API authentication layerbackend69.5
Harden insecure Docker setup with 12 vulnerabilitiescode-review73.5
Remove AI slop and over-engineering from codebaserefactoring78.8
Split 1100-line god file into proper modulesrefactoring66.8
Build codebase indexer for LLM context windowsfrom-scratch41.5
Write Kubernetes manifests for Node.js microservicefull-stack81.5
Implement multi-tenant row-level security in Postgresbackend65.9
Build production website with auth and members areafrontend61.1
Build SaaS admin dashboard from scratchfrom-scratch59.4
Build MCP server for database managementbackend65.3
Implement transformer inference engine with KV cachefrom-scratch69.5
Build CLI tool with subcommands and configfrom-scratch38.0
Build real-time portfolio risk calculatorbackend53.7
Build materialized view refresh pipeline for analyticsbackend50.0
Implement background job scheduler with persistencebackend26.5
Fix hallucination and context window bugs in RAG agentbackend63.0
Fix race conditions in order matching enginebackend77.3
Build LLM evaluation harness with structured gradingbackend32.3
Fix data integrity bugs in denormalized e-commerce schemadebugging38.0
Write complex SQL report with window functionsbackend72.7
Fix deadlocking transaction patterns in Flask appbackend57.9
Debug and fix 6 broken database triggers and constraintsdebugging53.8
Find and fix 4 hidden backdoors in Flask appdebugging39.4
Fix 12 WCAG accessibility violations in checkout formfrontend64.9
Optimize slow Postgres queries in Flask appbackend61.9
Write tests for untested legacy Flask servicecode-review24.5
Add retry logic and dead letter queue to Python task queuebackend60.6
Build distributed node cluster with gossip protocolfrom-scratch27.5
Add virtual scrolling to table rendering 5000 rowsfrontend63.9
Fix Node.js stream backpressure causing OOM on large filesbackend87.9
Write integration tests for payment flowcode-review63.9
Add GraphQL layer over REST APImulti-language53.6
Fix auth bypass vulnerabilitydebugging94.7
Add rate limiting middlewarebackend49.8
Zero-downtime schema migrationfull-stack61.0
Fix flaky test suitedebugging61.8
Refactor monolithic handler to CQRSrefactoring57.8
Fix N+1 query in dashboardbackend58.8
Fix memory leak in event handlerdebugging56.5
Fix React hydration mismatchfrontend74.9
Build terminal UI dashboardfrom-scratch62.4
Debug race condition in worker pooldebugging92.5
Build REST API from scratchfrom-scratch73.3