APEX
Back to models

Qwen3.6 Plus

OpenRouter

1000K context$0.33/M input$1.95/M output
1610peak 1627

Avg Score

77.8

Avg Cost

$0.41

Score/$

191.3

Runs

70

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

refactoringexpert
2549
frontendexpert
2338
multi-languagehard
2063
backendeasy
2038
from-scratchhard
1849
from-scratchmedium
1832
debuggingmedium
1812
frontendhard
1811
code-reviewmedium
1797
backendexpert
1773
backendhard
1690
code-review
1681
from-scratch
1653
backend
1651
debugging
1600
from-scratchexpert
1594
backendmedium
1592
debuggingexpert
1588
full-stackmedium
1580
debugginghard
1564
refactoring
1559
frontend
1551
multi-language
1545
full-stack
1539
frontendmedium
1509
full-stackhard
1507
refactoringmedium
1444
code-reviewhard
1442
frontendmaster
1399
from-scratcheasy
1394
frontendeasy
1326
backendmaster
1263
multi-languageexpert
1035

All Results

TaskCategoryScore
Add file upload with S3 presigned URLsbackend62.1
Implement JWT auth middlewarebackend79.9
Migrate Express monolith to modular architecturebackend33.1
Build interactive data visualization dashboardfrontend49.9
Optimize bloated React bundle under 500KBfrontend81.0
Build 3D browser game with physics and multiplayer syncfrontend74.5
Add streaming SSE endpoint for LLM chatbackend72.8
Fix memory leak in event handlerdebugging80.0
Build multi-tool LLM agent runtimebackend81.0
Fix and extend Chrome browser extensionfrontend73.0
Convert React app to PWA with offline supportfrontend78.8
Build MCP server for database managementbackend82.1
Replace console.log with structured loggingrefactoring43.7
Build SaaS admin dashboard from scratchfrom-scratch72.9
Harden insecure Docker setup with 12 vulnerabilitiescode-review89.3
Migrate callback-hell Express app to async/awaitrefactoring87.5
Build RAG pipeline with vector searchbackend70.4
Fix hallucination and context window bugs in RAG agentbackend81.2
Fix Node.js stream backpressure causing OOM on large filesbackend88.7
Build distributed node cluster with gossip protocolfrom-scratch79.5
Build LLM evaluation harness with structured gradingbackend83.8
Write complex SQL report with window functionsbackend88.3
Build terminal UI dashboardfrom-scratch69.4
Implement zero-trust API authentication layerbackend76.1
Build CLI tool with subcommands and configfrom-scratch78.8
Add cursor-based pagination to REST APIbackend82.5
Build materialized view refresh pipeline for analyticsbackend76.7
Implement multi-tenant row-level security in Postgresbackend82.8
Implement transformer inference engine with KV cachefrom-scratch78.0
Add Redis caching layer to Express APIbackend81.6
Implement background job scheduler with persistencebackend71.2
Fix broken responsive layoutfrontend70.5
Find and patch all OWASP Top 10 vulnerabilitiesdebugging71.9
Write Kubernetes manifests for Node.js microservicefull-stack83.4
Fix broken GitHub Actions CI pipelinedebugging90.9
Fix race conditions in order matching enginebackend84.4
Add caching layer to eliminate slow SSR page loadsfull-stack83.7
Fix deadlocking transaction patterns in Flask appbackend86.8
Debug race condition in worker pooldebugging87.8
Build production website with auth and members areafrontend80.8
Fix 12 WCAG accessibility violations in checkout formfrontend84.4
Split 1100-line god file into proper modulesrefactoring76.8
Write integration tests for payment flowcode-review71.8
Add retry logic and dead letter queue to Python task queuebackend84.5
Add slash commands and moderation to Discord botbackend81.7
Remove AI slop and over-engineering from codebaserefactoring76.7
Debug and fix 6 broken database triggers and constraintsdebugging87.0
Build real-time portfolio risk calculatorbackend79.5
Build codebase indexer for LLM context windowsfrom-scratch72.5
Add Google OAuth2 login to Express appfull-stack61.4
Write tests for untested legacy Flask servicecode-review87.3
Add GraphQL layer over REST APImulti-language84.8
Build REST API from scratchfrom-scratch78.7
Add virtual scrolling to table rendering 5000 rowsfrontend74.0
Optimize slow Postgres queries in Flask appbackend87.3
Port Python CLI to Rustmulti-language45.2
Fix data integrity bugs in denormalized e-commerce schemadebugging81.6
Add i18n with locale routing to Next.js appfull-stack77.0
Refactor monolithic handler to CQRSrefactoring85.8
Code review: identify security vulnscode-review81.1
Add WebSocket real-time updatesfull-stack84.4
Fix N+1 query in dashboardbackend79.0
Fix auth bypass vulnerabilitydebugging85.0
Fix flaky test suitedebugging91.7
Find and fix 4 hidden backdoors in Flask appdebugging89.1
Add rate limiting middlewarebackend82.3
Zero-downtime schema migrationfull-stack72.8
Dockerize Node.js monorepofull-stack78.7
Fix React hydration mismatchfrontend70.2
Implement Stripe webhook handlerbackend86.0