APEX
Back to models

Qwen3.5 122b A10b

OpenRouter

262K context$0.40/M input$3.20/M output
1564peak 1574

Avg Score

69.2

Avg Cost

$0.38

Score/$

182.8

Runs

124

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languagehard
1979
code-reviewmedium
1951
frontendhard
1887
refactoringexpert
1822
frontendexpert
1764
code-review
1758
from-scratchexpert
1733
debuggingexpert
1720
backendexpert
1684
backendhard
1624
backendeasy
1620
full-stackhard
1604
debuggingmedium
1603
debugging
1597
backend
1594
refactoring
1592
multi-language
1574
debugginghard
1561
refactoringmedium
1549
backendmedium
1548
frontend
1535
full-stack
1526
from-scratchmedium
1489
frontendeasy
1486
frontendmedium
1474
full-stackmedium
1473
code-reviewhard
1422
from-scratch
1406
from-scratchhard
1383
from-scratcheasy
1373
multi-languageexpert
1168

All Results

TaskCategoryScore
Build LLM evaluation harness with structured gradingbackend67.3
Fix React hydration mismatchfrontend79.5
Build SaaS admin dashboard from scratchfrom-scratch54.2
Add i18n with locale routing to Next.js appfull-stack70.0
Build distributed node cluster with gossip protocolfrom-scratch30.6
Add rate limiting middlewarebackend35.3
Add streaming SSE endpoint for LLM chatbackend82.5
Add GraphQL layer over REST APImulti-language80.5
Add caching layer to eliminate slow SSR page loadsfull-stack77.3
Implement multi-tenant row-level security in Postgresbackend34.6
Add slash commands and moderation to Discord botbackend60.6
Build REST API from scratchfrom-scratch74.3
Port Python CLI to Rustmulti-language33.8
Add virtual scrolling to table rendering 5000 rowsfrontend82.8
Code review: identify security vulnscode-review37.2
Build codebase indexer for LLM context windowsfrom-scratch37.3
Fix Node.js stream backpressure causing OOM on large filesbackend88.3
Dockerize Node.js monorepofull-stack67.1
Fix broken GitHub Actions CI pipelinedebugging85.9
Optimize slow Postgres queries in Flask appbackend71.1
Convert React app to PWA with offline supportfrontend71.6
Replace console.log with structured loggingrefactoring54.5
Build MCP server for database managementbackend82.4
Implement Stripe webhook handlerbackend61.5
Refactor monolithic handler to CQRSrefactoring49.7
Add Redis caching layer to Express APIbackend84.2
Fix broken responsive layoutfrontend70.0
Implement JWT auth middlewarebackend47.9
Add retry logic and dead letter queue to Python task queuebackend67.2
Implement transformer inference engine with KV cachefrom-scratch80.7
Implement background job scheduler with persistencebackend50.0
Find and patch all OWASP Top 10 vulnerabilitiesdebugging65.5
Add cursor-based pagination to REST APIbackend82.8
Optimize bloated React bundle under 500KBfrontend64.5
Implement zero-trust API authentication layerbackend71.3
Fix deadlocking transaction patterns in Flask appbackend57.8
Zero-downtime schema migrationfull-stack82.9
Build real-time portfolio risk calculatorbackend55.6
Fix auth bypass vulnerabilitydebugging87.0
Find and fix 4 hidden backdoors in Flask appdebugging86.2
Write integration tests for payment flowcode-review75.5
Fix flaky test suitedebugging70.5
Fix race conditions in order matching enginebackend86.5
Split 1100-line god file into proper modulesrefactoring71.2
Harden insecure Docker setup with 12 vulnerabilitiescode-review81.0
Debug and fix 6 broken database triggers and constraintsdebugging90.9
Remove AI slop and over-engineering from codebaserefactoring87.5
Build RAG pipeline with vector searchbackend48.5
Fix N+1 query in dashboardbackend52.4
Add Google OAuth2 login to Express appfull-stack78.3
Migrate callback-hell Express app to async/awaitrefactoring63.5
Build terminal UI dashboardfrom-scratch49.1
Fix 12 WCAG accessibility violations in checkout formfrontend79.0
Fix data integrity bugs in denormalized e-commerce schemadebugging82.4
Write complex SQL report with window functionsbackend75.0
Add WebSocket real-time updatesfull-stack67.6
Debug race condition in worker pooldebugging88.8
Fix hallucination and context window bugs in RAG agentbackend48.3
Add file upload with S3 presigned URLsbackend82.0
Write tests for untested legacy Flask servicecode-review56.6
Fix memory leak in event handlerdebugging80.3
Build CLI tool with subcommands and configfrom-scratch42.6
Build production website with auth and members areafrontend62.5
Build materialized view refresh pipeline for analyticsbackend71.6
Write Kubernetes manifests for Node.js microservicefull-stack85.0
Port Python CLI to Rustmulti-language54.9
Migrate callback-hell Express app to async/awaitrefactoring79.7
Add GraphQL layer over REST APImulti-language75.5
Add WebSocket real-time updatesfull-stack71.5
Fix 12 WCAG accessibility violations in checkout formfrontend86.1
Code review: identify security vulnscode-review90.9
Add i18n with locale routing to Next.js appfull-stack66.9
Optimize bloated React bundle under 500KBfrontend69.0
Dockerize Node.js monorepofull-stack67.2
Write Kubernetes manifests for Node.js microservicefull-stack88.5
Find and patch all OWASP Top 10 vulnerabilitiesdebugging73.2
Fix broken responsive layoutfrontend72.3
Split 1100-line god file into proper modulesrefactoring68.6
Add file upload with S3 presigned URLsbackend70.5
Convert React app to PWA with offline supportfrontend50.8
Implement zero-trust API authentication layerbackend75.8
Remove AI slop and over-engineering from codebaserefactoring76.5
Build codebase indexer for LLM context windowsfrom-scratch41.9
Harden insecure Docker setup with 12 vulnerabilitiescode-review77.3
Replace console.log with structured loggingrefactoring53.7
Implement JWT auth middlewarebackend79.8
Implement multi-tenant row-level security in Postgresbackend64.5
Add caching layer to eliminate slow SSR page loadsfull-stack74.3
Write complex SQL report with window functionsbackend89.3
Write integration tests for payment flowcode-review33.5
Build real-time portfolio risk calculatorbackend74.8
Fix race conditions in order matching enginebackend84.5
Build CLI tool with subcommands and configfrom-scratch37.6
Build SaaS admin dashboard from scratchfrom-scratch45.1
Build distributed node cluster with gossip protocolfrom-scratch53.2
Implement background job scheduler with persistencebackend42.3
Fix N+1 query in dashboardbackend67.2
Debug and fix 6 broken database triggers and constraintsdebugging89.3
Zero-downtime schema migrationfull-stack79.5
Add retry logic and dead letter queue to Python task queuebackend73.1
Build MCP server for database managementbackend74.3
Implement transformer inference engine with KV cachefrom-scratch74.0
Fix data integrity bugs in denormalized e-commerce schemadebugging88.0
Write tests for untested legacy Flask servicecode-review84.8
Fix Node.js stream backpressure causing OOM on large filesbackend74.2
Add Google OAuth2 login to Express appfull-stack68.8
Add virtual scrolling to table rendering 5000 rowsfrontend76.1
Add rate limiting middlewarebackend76.3
Build terminal UI dashboardfrom-scratch60.1
Fix React hydration mismatchfrontend68.9
Fix memory leak in event handlerdebugging66.2
Build REST API from scratchfrom-scratch80.0
Build LLM evaluation harness with structured gradingbackend73.6
Fix hallucination and context window bugs in RAG agentbackend76.0
Build materialized view refresh pipeline for analyticsbackend71.1
Optimize slow Postgres queries in Flask appbackend78.5
Build production website with auth and members areafrontend61.0
Find and fix 4 hidden backdoors in Flask appdebugging87.0
Add slash commands and moderation to Discord botbackend73.0
Build RAG pipeline with vector searchbackend68.5
Fix deadlocking transaction patterns in Flask appbackend89.7
Fix flaky test suitedebugging74.5
Refactor monolithic handler to CQRSrefactoring69.1
Debug race condition in worker pooldebugging67.0