APEX
Back to models

Qwen3 Coder Flash

OpenRouter

1000K context$0.30/M input$1.50/M output
1344peak 1367

Avg Score

58.8

Avg Cost

$0.09

Score/$

662.3

Runs

120

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

code-reviewhard
1808
from-scratchexpert
1578
frontendeasy
1501
full-stackhard
1496
full-stack
1459
backendmedium
1433
code-review
1411
frontendmedium
1397
full-stackmedium
1381
backend
1368
backendexpert
1362
refactoringmedium
1356
frontend
1346
refactoring
1333
code-reviewmedium
1325
backendhard
1310
multi-language
1262
from-scratchhard
1260
multi-languagehard
1226
debuggingexpert
1223
from-scratch
1190
debugginghard
1185
debugging
1181
from-scratchmedium
1151
multi-languageexpert
1147
frontendhard
1143
refactoringexpert
1028
frontendexpert
922
debuggingmedium
714
from-scratcheasy
668
backendeasy
337

All Results

TaskCategoryScore
Fix 12 WCAG accessibility violations in checkout formfrontend52.3
Build SaaS admin dashboard from scratchfrom-scratch51.5
Build materialized view refresh pipeline for analyticsbackend60.4
Fix React hydration mismatchfrontend85.9
Port Python CLI to Rustmulti-language33.0
Implement JWT auth middlewarebackend38.8
Migrate callback-hell Express app to async/awaitrefactoring62.7
Implement zero-trust API authentication layerbackend65.0
Fix N+1 query in dashboardbackend44.4
Add GraphQL layer over REST APImulti-language69.2
Find and fix 4 hidden backdoors in Flask appdebugging42.0
Add cursor-based pagination to REST APIbackend80.2
Implement background job scheduler with persistencebackend39.2
Convert React app to PWA with offline supportfrontend67.7
Write Kubernetes manifests for Node.js microservicefull-stack82.4
Fix memory leak in event handlerdebugging72.0
Write complex SQL report with window functionsbackend70.2
Add virtual scrolling to table rendering 5000 rowsfrontend34.0
Write integration tests for payment flowcode-review80.5
Implement multi-tenant row-level security in Postgresbackend51.5
Add file upload with S3 presigned URLsbackend53.0
Implement transformer inference engine with KV cachefrom-scratch76.2
Add rate limiting middlewarebackend40.2
Find and patch all OWASP Top 10 vulnerabilitiesdebugging64.3
Fix Node.js stream backpressure causing OOM on large filesbackend85.9
Fix hallucination and context window bugs in RAG agentbackend30.3
Code review: identify security vulnscode-review75.0
Refactor monolithic handler to CQRSrefactoring33.5
Add slash commands and moderation to Discord botbackend70.8
Replace console.log with structured loggingrefactoring38.5
Build codebase indexer for LLM context windowsfrom-scratch40.2
Build CLI tool with subcommands and configfrom-scratch29.8
Fix race conditions in order matching enginebackend81.5
Split 1100-line god file into proper modulesrefactoring70.8
Add Google OAuth2 login to Express appfull-stack55.6
Fix flaky test suitedebugging69.5
Fix broken GitHub Actions CI pipelinedebugging67.2
Build distributed node cluster with gossip protocolfrom-scratch28.9
Add streaming SSE endpoint for LLM chatbackend67.0
Add i18n with locale routing to Next.js appfull-stack66.8
Build production website with auth and members areafrontend48.8
Build MCP server for database managementbackend53.5
Remove AI slop and over-engineering from codebaserefactoring72.9
Build REST API from scratchfrom-scratch68.5
Add retry logic and dead letter queue to Python task queuebackend64.1
Optimize bloated React bundle under 500KBfrontend62.6
Zero-downtime schema migrationfull-stack80.5
Fix broken responsive layoutfrontend68.5
Build LLM evaluation harness with structured gradingbackend50.0
Optimize slow Postgres queries in Flask appbackend53.0
Build RAG pipeline with vector searchbackend46.2
Debug race condition in worker pooldebugging80.2
Add WebSocket real-time updatesfull-stack56.6
Debug and fix 6 broken database triggers and constraintsdebugging58.7
Build real-time portfolio risk calculatorbackend29.9
Dockerize Node.js monorepofull-stack65.8
Build terminal UI dashboardfrom-scratch50.3
Add Redis caching layer to Express APIbackend74.3
Add caching layer to eliminate slow SSR page loadsfull-stack81.7
Harden insecure Docker setup with 12 vulnerabilitiescode-review70.6
Fix data integrity bugs in denormalized e-commerce schemadebugging77.7
Implement Stripe webhook handlerbackend59.3
Write tests for untested legacy Flask servicecode-review32.1
Fix auth bypass vulnerabilitydebugging45.8
Fix deadlocking transaction patterns in Flask appbackend63.0
Fix broken responsive layoutfrontend55.5
Remove AI slop and over-engineering from codebaserefactoring77.5
Split 1100-line god file into proper modulesrefactoring65.2
Write Kubernetes manifests for Node.js microservicefull-stack3.5
Implement zero-trust API authentication layerbackend26.1
Implement JWT auth middlewarebackend79.5
Add i18n with locale routing to Next.js appfull-stack57.4
Convert React app to PWA with offline supportfrontend47.0
Add caching layer to eliminate slow SSR page loadsfull-stack71.1
Dockerize Node.js monorepofull-stack53.8
Find and patch all OWASP Top 10 vulnerabilitiesdebugging53.5
Optimize bloated React bundle under 500KBfrontend64.0
Replace console.log with structured loggingrefactoring47.8
Implement multi-tenant row-level security in Postgresbackend71.7
Build codebase indexer for LLM context windowsfrom-scratch47.6
Harden insecure Docker setup with 12 vulnerabilitiescode-review62.9
Implement transformer inference engine with KV cachefrom-scratch57.8
Implement background job scheduler with persistencebackend51.3
Add retry logic and dead letter queue to Python task queuebackend74.3
Build MCP server for database managementbackend65.7
Debug and fix 6 broken database triggers and constraintsdebugging63.1
Fix data integrity bugs in denormalized e-commerce schemadebugging57.5
Fix flaky test suitedebugging65.8
Build LLM evaluation harness with structured gradingbackend55.8
Fix race conditions in order matching enginebackend84.4
Add slash commands and moderation to Discord botbackend75.8
Build real-time portfolio risk calculatorbackend39.0
Zero-downtime schema migrationfull-stack85.6
Write complex SQL report with window functionsbackend78.9
Build production website with auth and members areafrontend46.8
Build SaaS admin dashboard from scratchfrom-scratch50.5
Build CLI tool with subcommands and configfrom-scratch36.8
Fix hallucination and context window bugs in RAG agentbackend52.0
Build materialized view refresh pipeline for analyticsbackend30.0
Fix deadlocking transaction patterns in Flask appbackend33.8
Find and fix 4 hidden backdoors in Flask appdebugging80.8
Write tests for untested legacy Flask servicecode-review47.2
Add Google OAuth2 login to Express appfull-stack63.9
Fix 12 WCAG accessibility violations in checkout formfrontend73.0
Optimize slow Postgres queries in Flask appbackend66.1
Add virtual scrolling to table rendering 5000 rowsfrontend66.0
Fix Node.js stream backpressure causing OOM on large filesbackend89.5
Build distributed node cluster with gossip protocolfrom-scratch28.4
Write integration tests for payment flowcode-review67.5
Fix auth bypass vulnerabilitydebugging78.4
Add GraphQL layer over REST APImulti-language57.0
Add rate limiting middlewarebackend50.5
Implement Stripe webhook handlerbackend37.5
Fix N+1 query in dashboardbackend65.4
Refactor monolithic handler to CQRSrefactoring53.9
Fix memory leak in event handlerdebugging63.4
Fix React hydration mismatchfrontend68.3
Build terminal UI dashboardfrom-scratch41.2
Debug race condition in worker pooldebugging58.0
Build REST API from scratchfrom-scratch70.2