APEX
Back to models

Qwen3.5 27b

OpenRouter

262K context$0.30/M input$2.40/M output
1573peak 1584

Avg Score

70.0

Avg Cost

$0.32

Score/$

217.0

Runs

122

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languagehard
2101
refactoringexpert
2074
code-reviewhard
1948
frontendexpert
1838
backendexpert
1739
full-stackhard
1737
backendeasy
1714
debuggingmedium
1713
code-review
1698
code-reviewmedium
1698
refactoring
1670
from-scratchexpert
1658
debuggingexpert
1657
refactoringmedium
1640
multi-language
1638
full-stack
1633
frontendhard
1622
debugging
1619
backend
1607
debugginghard
1604
backendhard
1601
full-stackmedium
1581
backendmedium
1575
frontendeasy
1414
from-scratcheasy
1413
frontend
1400
from-scratch
1394
from-scratchhard
1384
from-scratchmedium
1380
frontendmedium
1292
multi-languageexpert
1227

All Results

TaskCategoryScore
Build production website with auth and members areafrontend66.1
Write integration tests for payment flowcode-review76.9
Build SaaS admin dashboard from scratchfrom-scratch73.0
Implement background job scheduler with persistencebackend43.5
Fix hallucination and context window bugs in RAG agentbackend35.2
Implement multi-tenant row-level security in Postgresbackend37.5
Write tests for untested legacy Flask servicecode-review50.3
Refactor monolithic handler to CQRSrefactoring84.3
Optimize bloated React bundle under 500KBfrontend69.5
Add virtual scrolling to table rendering 5000 rowsfrontend30.6
Add WebSocket real-time updatesfull-stack82.4
Fix broken responsive layoutfrontend62.3
Write Kubernetes manifests for Node.js microservicefull-stack80.5
Build distributed node cluster with gossip protocolfrom-scratch33.7
Add cursor-based pagination to REST APIbackend82.0
Build materialized view refresh pipeline for analyticsbackend72.7
Build CLI tool with subcommands and configfrom-scratch30.8
Fix flaky test suitedebugging87.5
Fix broken GitHub Actions CI pipelinedebugging85.5
Implement zero-trust API authentication layerbackend71.8
Remove AI slop and over-engineering from codebaserefactoring76.1
Add i18n with locale routing to Next.js appfull-stack69.3
Fix 12 WCAG accessibility violations in checkout formfrontend81.1
Build real-time portfolio risk calculatorbackend50.0
Add Redis caching layer to Express APIbackend84.9
Fix race conditions in order matching enginebackend87.6
Debug and fix 6 broken database triggers and constraintsdebugging88.8
Zero-downtime schema migrationfull-stack82.2
Add retry logic and dead letter queue to Python task queuebackend76.6
Dockerize Node.js monorepofull-stack67.2
Find and patch all OWASP Top 10 vulnerabilitiesdebugging67.8
Fix memory leak in event handlerdebugging80.1
Add file upload with S3 presigned URLsbackend80.8
Write complex SQL report with window functionsbackend60.9
Optimize slow Postgres queries in Flask appbackend81.0
Add caching layer to eliminate slow SSR page loadsfull-stack75.0
Build terminal UI dashboardfrom-scratch50.4
Replace console.log with structured loggingrefactoring53.8
Fix Node.js stream backpressure causing OOM on large filesbackend87.0
Build LLM evaluation harness with structured gradingbackend78.8
Add Google OAuth2 login to Express appfull-stack80.7
Fix auth bypass vulnerabilitydebugging90.8
Build codebase indexer for LLM context windowsfrom-scratch28.8
Fix N+1 query in dashboardbackend61.2
Add streaming SSE endpoint for LLM chatbackend82.7
Migrate callback-hell Express app to async/awaitrefactoring41.0
Build MCP server for database managementbackend51.4
Build RAG pipeline with vector searchbackend47.9
Add GraphQL layer over REST APImulti-language83.8
Add rate limiting middlewarebackend46.3
Implement transformer inference engine with KV cachefrom-scratch77.5
Harden insecure Docker setup with 12 vulnerabilitiescode-review70.0
Build REST API from scratchfrom-scratch74.6
Code review: identify security vulnscode-review77.0
Fix deadlocking transaction patterns in Flask appbackend72.3
Find and fix 4 hidden backdoors in Flask appdebugging79.3
Implement JWT auth middlewarebackend53.1
Fix data integrity bugs in denormalized e-commerce schemadebugging81.5
Fix React hydration mismatchfrontend58.0
Debug race condition in worker pooldebugging86.6
Implement Stripe webhook handlerbackend79.5
Split 1100-line god file into proper modulesrefactoring60.8
Add slash commands and moderation to Discord botbackend65.0
Convert React app to PWA with offline supportfrontend71.2
Port Python CLI to Rustmulti-language43.8
Fix 12 WCAG accessibility violations in checkout formfrontend82.3
Add WebSocket real-time updatesfull-stack73.6
Code review: identify security vulnscode-review88.1
Migrate callback-hell Express app to async/awaitrefactoring74.0
Port Python CLI to Rustmulti-language56.7
Debug and fix 6 broken database triggers and constraintsdebugging82.7
Convert React app to PWA with offline supportfrontend56.1
Harden insecure Docker setup with 12 vulnerabilitiescode-review73.1
Fix broken responsive layoutfrontend72.0
Build codebase indexer for LLM context windowsfrom-scratch47.3
Implement zero-trust API authentication layerbackend66.5
Split 1100-line god file into proper modulesrefactoring77.9
Implement JWT auth middlewarebackend81.0
Find and patch all OWASP Top 10 vulnerabilitiesdebugging68.8
Optimize bloated React bundle under 500KBfrontend69.8
Replace console.log with structured loggingrefactoring47.3
Add file upload with S3 presigned URLsbackend66.8
Implement multi-tenant row-level security in Postgresbackend71.3
Add i18n with locale routing to Next.js appfull-stack66.7
Dockerize Node.js monorepofull-stack71.5
Remove AI slop and over-engineering from codebaserefactoring90.2
Add caching layer to eliminate slow SSR page loadsfull-stack78.6
Write Kubernetes manifests for Node.js microservicefull-stack90.8
Write complex SQL report with window functionsbackend78.9
Find and fix 4 hidden backdoors in Flask appdebugging91.2
Write integration tests for payment flowcode-review82.5
Build real-time portfolio risk calculatorbackend79.8
Build CLI tool with subcommands and configfrom-scratch41.5
Fix Node.js stream backpressure causing OOM on large filesbackend82.4
Zero-downtime schema migrationfull-stack83.0
Implement transformer inference engine with KV cachefrom-scratch72.3
Build distributed node cluster with gossip protocolfrom-scratch25.3
Fix data integrity bugs in denormalized e-commerce schemadebugging88.5
Refactor monolithic handler to CQRSrefactoring84.8
Build terminal UI dashboardfrom-scratch57.5
Add Google OAuth2 login to Express appfull-stack61.5
Build REST API from scratchfrom-scratch81.1
Write tests for untested legacy Flask servicecode-review89.7
Build SaaS admin dashboard from scratchfrom-scratch72.6
Implement background job scheduler with persistencebackend53.5
Add retry logic and dead letter queue to Python task queuebackend77.8
Fix auth bypass vulnerabilitydebugging82.0
Build MCP server for database managementbackend82.3
Add Redis caching layer to Express APIbackend64.0
Fix hallucination and context window bugs in RAG agentbackend65.0
Add slash commands and moderation to Discord botbackend74.8
Fix N+1 query in dashboardbackend67.5
Fix race conditions in order matching enginebackend87.7
Add rate limiting middlewarebackend76.8
Build LLM evaluation harness with structured gradingbackend69.5
Build production website with auth and members areafrontend63.0
Optimize slow Postgres queries in Flask appbackend84.4
Build materialized view refresh pipeline for analyticsbackend77.0
Build RAG pipeline with vector searchbackend69.6
Fix deadlocking transaction patterns in Flask appbackend78.5
Fix flaky test suitedebugging79.8
Debug race condition in worker pooldebugging57.8