APEX
Back to models

Qwen3 Coder Flash

OpenRouter

1000K context$0.30/M input$1.50/M output
1267peak 1284

Avg Score

59.5

Avg Cost

$0.01

Score/$

4866.3

Runs

65

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

code-reviewhard
1983
from-scratchexpert
1505
full-stackmedium
1383
backendmedium
1375
full-stack
1351
frontendmedium
1346
code-review
1343
full-stackhard
1308
backend
1265
frontend
1264
debuggingexpert
1247
refactoringmedium
1236
backendexpert
1222
debugging
1220
multi-languagehard
1213
refactoring
1210
code-reviewmedium
1203
from-scratch
1202
frontendeasy
1191
debugginghard
1166
backendhard
1149
multi-language
1140
debuggingmedium
1106
from-scratchhard
1095
from-scratcheasy
917
from-scratchmedium
794
frontendexpert
592
backendeasy
416
multi-languageexpert
285
frontendhard
213
refactoringexpert
16

All Results

TaskCategoryScore
Fix 12 WCAG accessibility violations in checkout formfrontend52.3
Build SaaS admin dashboard from scratchfrom-scratch51.5
Build materialized view refresh pipeline for analyticsbackend60.4
Fix React hydration mismatchfrontend85.9
Port Python CLI to Rustmulti-language33.0
Implement JWT auth middlewarebackend38.8
Migrate callback-hell Express app to async/awaitrefactoring62.7
Implement zero-trust API authentication layerbackend65.0
Fix N+1 query in dashboardbackend44.4
Add GraphQL layer over REST APImulti-language69.2
Find and fix 4 hidden backdoors in Flask appdebugging42.0
Add cursor-based pagination to REST APIbackend80.2
Implement background job scheduler with persistencebackend39.2
Convert React app to PWA with offline supportfrontend67.7
Write Kubernetes manifests for Node.js microservicefull-stack82.4
Fix memory leak in event handlerdebugging72.0
Write complex SQL report with window functionsbackend70.2
Add virtual scrolling to table rendering 5000 rowsfrontend34.0
Write integration tests for payment flowcode-review80.5
Implement multi-tenant row-level security in Postgresbackend51.5
Add file upload with S3 presigned URLsbackend53.0
Implement transformer inference engine with KV cachefrom-scratch76.2
Add rate limiting middlewarebackend40.2
Find and patch all OWASP Top 10 vulnerabilitiesdebugging64.3
Fix Node.js stream backpressure causing OOM on large filesbackend85.9
Fix hallucination and context window bugs in RAG agentbackend30.3
Code review: identify security vulnscode-review75.0
Refactor monolithic handler to CQRSrefactoring33.5
Add slash commands and moderation to Discord botbackend70.8
Replace console.log with structured loggingrefactoring38.5
Build codebase indexer for LLM context windowsfrom-scratch40.2
Build CLI tool with subcommands and configfrom-scratch35.8
Fix race conditions in order matching enginebackend81.5
Split 1100-line god file into proper modulesrefactoring70.8
Add Google OAuth2 login to Express appfull-stack55.6
Fix flaky test suitedebugging69.5
Fix broken GitHub Actions CI pipelinedebugging67.2
Build distributed node cluster with gossip protocolfrom-scratch40.1
Add streaming SSE endpoint for LLM chatbackend67.0
Add i18n with locale routing to Next.js appfull-stack66.8
Build production website with auth and members areafrontend52.0
Build MCP server for database managementbackend53.5
Remove AI slop and over-engineering from codebaserefactoring72.9
Build REST API from scratchfrom-scratch68.5
Add retry logic and dead letter queue to Python task queuebackend64.1
Optimize bloated React bundle under 500KBfrontend62.6
Zero-downtime schema migrationfull-stack80.5
Fix broken responsive layoutfrontend68.5
Build LLM evaluation harness with structured gradingbackend50.0
Optimize slow Postgres queries in Flask appbackend53.0
Build RAG pipeline with vector searchbackend46.2
Debug race condition in worker pooldebugging80.2
Add WebSocket real-time updatesfull-stack56.6
Debug and fix 6 broken database triggers and constraintsdebugging58.7
Build real-time portfolio risk calculatorbackend29.9
Dockerize Node.js monorepofull-stack65.8
Build terminal UI dashboardfrom-scratch52.3
Add Redis caching layer to Express APIbackend74.3
Add caching layer to eliminate slow SSR page loadsfull-stack81.7
Harden insecure Docker setup with 12 vulnerabilitiescode-review70.6
Fix data integrity bugs in denormalized e-commerce schemadebugging77.7
Implement Stripe webhook handlerbackend59.3
Write tests for untested legacy Flask servicecode-review32.1
Fix auth bypass vulnerabilitydebugging45.8
Fix deadlocking transaction patterns in Flask appbackend63.0