APEX
Back to models

Qwen3.5 Plus 02.15

OpenRouter

1000K context$0.40/M input$2.40/M output
1511peak 1534

Avg Score

58.1

Avg Cost

$0.10

Score/$

568.2

Runs

122

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

refactoringexpert
2010
from-scratchexpert
1809
full-stackmedium
1756
from-scratcheasy
1671
refactoring
1666
frontendeasy
1643
frontendexpert
1641
code-reviewmedium
1619
refactoringmedium
1608
backendexpert
1605
frontendhard
1590
from-scratchhard
1565
debuggingexpert
1553
backendmedium
1550
backend
1529
full-stack
1528
from-scratch
1521
code-review
1517
frontend
1505
backendhard
1501
debugginghard
1455
frontendmedium
1453
debugging
1447
full-stackhard
1302
backendeasy
1268
multi-languagehard
1268
debuggingmedium
1110
multi-language
1070
from-scratchmedium
961
code-reviewhard
307
multi-languageexpert
0

All Results

TaskCategoryScore
Fix hallucination and context window bugs in RAG agentbackend0.0
Implement background job scheduler with persistencebackend0.0
Add i18n with locale routing to Next.js appfull-stack28.0
Fix broken GitHub Actions CI pipelinedebugging
Build terminal UI dashboardfrom-scratch0.0
Add streaming SSE endpoint for LLM chatbackend
Fix broken responsive layoutfrontend22.0
Write complex SQL report with window functionsbackend0.0
Write tests for untested legacy Flask servicecode-review0.0
Find and patch all OWASP Top 10 vulnerabilitiesdebugging28.0
Add virtual scrolling to table rendering 5000 rowsfrontend
Fix flaky test suitedebugging28.0
Build CLI tool with subcommands and configfrom-scratch0.0
Dockerize Node.js monorepofull-stack28.0
Add Google OAuth2 login to Express appfull-stack
Fix auth bypass vulnerabilitydebugging
Build REST API from scratchfrom-scratch22.0
Implement Stripe webhook handlerbackend
Optimize bloated React bundle under 500KBfrontend28.0
Port Python CLI to Rustmulti-language0.0
Build RAG pipeline with vector searchbackend0.0
Fix 12 WCAG accessibility violations in checkout formfrontend0.0
Refactor monolithic handler to CQRSrefactoring
Add WebSocket real-time updatesfull-stack28.0
Build distributed node cluster with gossip protocolfrom-scratch0.0
Fix N+1 query in dashboardbackend0.0
Find and fix 4 hidden backdoors in Flask appdebugging28.0
Code review: identify security vulnscode-review0.0
Fix Node.js stream backpressure causing OOM on large filesbackend65.3
Build materialized view refresh pipeline for analyticsbackend85.6
Build LLM evaluation harness with structured gradingbackend52.5
Optimize slow Postgres queries in Flask appbackend77.7
Add file upload with S3 presigned URLsbackend53.3
Debug race condition in worker pooldebugging82.3
Build SaaS admin dashboard from scratchfrom-scratch44.3
Fix memory leak in event handlerdebugging49.3
Build production website with auth and members areafrontend60.5
Add rate limiting middlewarebackend66.3
Fix deadlocking transaction patterns in Flask appbackend83.1
Implement zero-trust API authentication layerbackend74.1
Migrate callback-hell Express app to async/awaitrefactoring74.1
Split 1100-line god file into proper modulesrefactoring74.8
Replace console.log with structured loggingrefactoring55.5
Add cursor-based pagination to REST APIbackend46.6
Fix React hydration mismatchfrontend53.9
Harden insecure Docker setup with 12 vulnerabilitiescode-review79.0
Write integration tests for payment flowcode-review45.8
Add slash commands and moderation to Discord botbackend65.8
Build real-time portfolio risk calculatorbackend44.0
Convert React app to PWA with offline supportfrontend77.3
Implement multi-tenant row-level security in Postgresbackend45.5
Debug and fix 6 broken database triggers and constraintsdebugging65.1
Write Kubernetes manifests for Node.js microservicefull-stack82.6
Build MCP server for database managementbackend78.5
Add Redis caching layer to Express APIbackend53.9
Fix data integrity bugs in denormalized e-commerce schemadebugging81.7
Add retry logic and dead letter queue to Python task queuebackend79.7
Implement transformer inference engine with KV cachefrom-scratch81.7
Remove AI slop and over-engineering from codebaserefactoring85.0
Implement JWT auth middlewarebackend55.0
Add GraphQL layer over REST APImulti-language53.6
Add caching layer to eliminate slow SSR page loadsfull-stack82.7
Build codebase indexer for LLM context windowsfrom-scratch46.0
Fix race conditions in order matching enginebackend79.5
Zero-downtime schema migrationfull-stack72.6
Split 1100-line god file into proper modulesrefactoring67.9
Remove AI slop and over-engineering from codebaserefactoring84.6
Fix broken responsive layoutfrontend78.0
Implement JWT auth middlewarebackend87.9
Convert React app to PWA with offline supportfrontend67.5
Implement multi-tenant row-level security in Postgresbackend82.3
Harden insecure Docker setup with 12 vulnerabilitiescode-review85.4
Add caching layer to eliminate slow SSR page loadsfull-stack83.2
Optimize bloated React bundle under 500KBfrontend67.1
Implement zero-trust API authentication layerbackend71.1
Dockerize Node.js monorepofull-stack81.6
Find and patch all OWASP Top 10 vulnerabilitiesdebugging72.1
Replace console.log with structured loggingrefactoring53.5
Add i18n with locale routing to Next.js appfull-stack67.7
Build codebase indexer for LLM context windowsfrom-scratch44.4
Write Kubernetes manifests for Node.js microservicefull-stack91.3
Build production website with auth and members areafrontend66.0
Build SaaS admin dashboard from scratchfrom-scratch67.7
Implement background job scheduler with persistencebackend56.1
Build MCP server for database managementbackend83.0
Implement transformer inference engine with KV cachefrom-scratch51.5
Build CLI tool with subcommands and configfrom-scratch67.0
Fix hallucination and context window bugs in RAG agentbackend71.5
Build real-time portfolio risk calculatorbackend59.1
Build LLM evaluation harness with structured gradingbackend48.0
Build materialized view refresh pipeline for analyticsbackend66.5
Fix race conditions in order matching enginebackend70.7
Fix data integrity bugs in denormalized e-commerce schemadebugging68.5
Write complex SQL report with window functionsbackend69.7
Debug and fix 6 broken database triggers and constraintsdebugging79.5
Fix deadlocking transaction patterns in Flask appbackend66.8
Find and fix 4 hidden backdoors in Flask appdebugging93.3
Add Redis caching layer to Express APIbackend74.9
Write tests for untested legacy Flask servicecode-review69.5
Optimize slow Postgres queries in Flask appbackend80.2
Add slash commands and moderation to Discord botbackend72.4
Fix 12 WCAG accessibility violations in checkout formfrontend81.0
Add retry logic and dead letter queue to Python task queuebackend72.5
Add virtual scrolling to table rendering 5000 rowsfrontend69.0
Fix Node.js stream backpressure causing OOM on large filesbackend74.7
Build distributed node cluster with gossip protocolfrom-scratch55.6
Fix auth bypass vulnerabilitydebugging84.0
Write integration tests for payment flowcode-review36.8
Add GraphQL layer over REST APImulti-language67.2
Add rate limiting middlewarebackend65.6
Implement Stripe webhook handlerbackend44.6
Zero-downtime schema migrationfull-stack61.0
Fix flaky test suitedebugging64.5
Refactor monolithic handler to CQRSrefactoring73.5
Add cursor-based pagination to REST APIbackend62.4
Code review: identify security vulnscode-review74.9
Fix N+1 query in dashboardbackend74.5
Fix memory leak in event handlerdebugging58.8
Debug race condition in worker pooldebugging86.9
Build terminal UI dashboardfrom-scratch50.3
Fix React hydration mismatchfrontend73.5
Build REST API from scratchfrom-scratch84.6