APEX
Back to models

Qwen3.5 27b

OpenRouter

262K context$0.30/M input$2.40/M output
1421peak 1435

Avg Score

67.9

Avg Cost

$0.12

Score/$

563.4

Runs

70

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

refactoringexpert
2425
multi-languagehard
2030
code-reviewhard
1681
frontendhard
1632
debuggingmedium
1625
full-stackhard
1620
from-scratchexpert
1582
backendmedium
1561
debuggingexpert
1556
debugging
1533
debugginghard
1516
multi-language
1513
full-stack
1486
backendexpert
1464
backend
1439
backendmaster
1433
frontendexpert
1403
frontendmaster
1393
code-review
1388
refactoring
1357
code-reviewmedium
1323
frontend
1309
from-scratch
1302
backendhard
1299
full-stackmedium
1274
from-scratcheasy
1249
from-scratchhard
1198
frontendmedium
1191
refactoringmedium
1181
from-scratchmedium
1039
multi-languageexpert
934
frontendeasy
761
backendeasy
698

All Results

TaskCategoryScore
Fix and extend Chrome browser extensionfrontend32.0
Build interactive data visualization dashboardfrontend73.0
Build multi-tool LLM agent runtimebackend81.7
Migrate Express monolith to modular architecturebackend65.7
Build 3D browser game with physics and multiplayer syncfrontend77.8
Build production website with auth and members areafrontend66.1
Write integration tests for payment flowcode-review76.9
Build SaaS admin dashboard from scratchfrom-scratch73.0
Implement background job scheduler with persistencebackend43.5
Fix hallucination and context window bugs in RAG agentbackend35.2
Implement multi-tenant row-level security in Postgresbackend37.5
Write tests for untested legacy Flask servicecode-review50.3
Refactor monolithic handler to CQRSrefactoring84.3
Optimize bloated React bundle under 500KBfrontend69.5
Add virtual scrolling to table rendering 5000 rowsfrontend30.6
Add WebSocket real-time updatesfull-stack82.4
Fix broken responsive layoutfrontend62.3
Write Kubernetes manifests for Node.js microservicefull-stack80.5
Build distributed node cluster with gossip protocolfrom-scratch40.5
Add cursor-based pagination to REST APIbackend82.0
Build materialized view refresh pipeline for analyticsbackend72.7
Build CLI tool with subcommands and configfrom-scratch38.8
Fix flaky test suitedebugging87.5
Fix broken GitHub Actions CI pipelinedebugging85.5
Implement zero-trust API authentication layerbackend71.8
Remove AI slop and over-engineering from codebaserefactoring76.1
Add i18n with locale routing to Next.js appfull-stack69.3
Fix 12 WCAG accessibility violations in checkout formfrontend81.1
Build real-time portfolio risk calculatorbackend50.0
Add Redis caching layer to Express APIbackend84.9
Fix race conditions in order matching enginebackend87.6
Debug and fix 6 broken database triggers and constraintsdebugging88.8
Zero-downtime schema migrationfull-stack82.2
Add retry logic and dead letter queue to Python task queuebackend76.6
Dockerize Node.js monorepofull-stack67.2
Find and patch all OWASP Top 10 vulnerabilitiesdebugging67.8
Fix memory leak in event handlerdebugging80.1
Add file upload with S3 presigned URLsbackend80.8
Write complex SQL report with window functionsbackend60.9
Optimize slow Postgres queries in Flask appbackend81.0
Add caching layer to eliminate slow SSR page loadsfull-stack75.0
Build terminal UI dashboardfrom-scratch56.0
Replace console.log with structured loggingrefactoring53.8
Fix Node.js stream backpressure causing OOM on large filesbackend87.0
Build LLM evaluation harness with structured gradingbackend78.8
Add Google OAuth2 login to Express appfull-stack80.7
Fix auth bypass vulnerabilitydebugging90.8
Build codebase indexer for LLM context windowsfrom-scratch28.8
Fix N+1 query in dashboardbackend61.2
Add streaming SSE endpoint for LLM chatbackend82.7
Migrate callback-hell Express app to async/awaitrefactoring41.0
Build MCP server for database managementbackend51.4
Build RAG pipeline with vector searchbackend47.9
Add GraphQL layer over REST APImulti-language83.8
Add rate limiting middlewarebackend46.3
Implement transformer inference engine with KV cachefrom-scratch77.5
Harden insecure Docker setup with 12 vulnerabilitiescode-review70.0
Build REST API from scratchfrom-scratch74.6
Code review: identify security vulnscode-review77.0
Fix deadlocking transaction patterns in Flask appbackend72.3
Find and fix 4 hidden backdoors in Flask appdebugging79.3
Implement JWT auth middlewarebackend53.1
Fix data integrity bugs in denormalized e-commerce schemadebugging81.5
Fix React hydration mismatchfrontend58.0
Debug race condition in worker pooldebugging86.6
Implement Stripe webhook handlerbackend79.5
Split 1100-line god file into proper modulesrefactoring60.8
Add slash commands and moderation to Discord botbackend65.0
Convert React app to PWA with offline supportfrontend71.2
Port Python CLI to Rustmulti-language43.8