APEX
Back to models

Qwen3.5 35b A3b

OpenRouter

262K context$0.25/M input$2.00/M output
1445peak 1456

Avg Score

63.4

Avg Cost

$0.08

Score/$

831.6

Runs

121

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languagehard
2051
debuggingmedium
1792
code-reviewmedium
1598
backendexpert
1587
code-review
1571
frontendhard
1562
full-stackmedium
1561
code-reviewhard
1533
full-stack
1529
debugging
1523
debuggingexpert
1523
full-stackhard
1516
backendeasy
1509
frontendexpert
1471
refactoring
1460
backendmedium
1458
refactoringmedium
1450
frontendmedium
1440
frontend
1434
multi-language
1428
from-scratchexpert
1414
backend
1404
debugginghard
1401
frontendeasy
1315
from-scratcheasy
1290
from-scratchhard
1266
from-scratch
1260
backendhard
1247
refactoringexpert
1156
from-scratchmedium
1093
multi-languageexpert
1020

All Results

TaskCategoryScore
Build MCP server for database managementbackend70.2
Convert React app to PWA with offline supportfrontend62.9
Build real-time portfolio risk calculatorbackend46.5
Build CLI tool with subcommands and configfrom-scratch30.0
Add i18n with locale routing to Next.js appfull-stack46.1
Split 1100-line god file into proper modulesrefactoring34.5
Write complex SQL report with window functionsbackend59.7
Add caching layer to eliminate slow SSR page loadsfull-stack77.8
Find and fix 4 hidden backdoors in Flask appdebugging61.1
Add retry logic and dead letter queue to Python task queuebackend76.5
Fix data integrity bugs in denormalized e-commerce schemadebugging85.7
Fix N+1 query in dashboardbackend56.5
Fix race conditions in order matching enginebackend84.9
Fix hallucination and context window bugs in RAG agentbackend30.3
Fix memory leak in event handlerdebugging81.8
Debug race condition in worker pooldebugging81.1
Fix broken GitHub Actions CI pipelinedebugging81.3
Add streaming SSE endpoint for LLM chatbackend82.5
Fix React hydration mismatchfrontend78.2
Build LLM evaluation harness with structured gradingbackend63.5
Implement Stripe webhook handlerbackend68.3
Build distributed node cluster with gossip protocolfrom-scratch26.2
Migrate callback-hell Express app to async/awaitrefactoring62.4
Build SaaS admin dashboard from scratchfrom-scratch58.3
Find and patch all OWASP Top 10 vulnerabilitiesdebugging60.4
Debug and fix 6 broken database triggers and constraintsdebugging83.3
Add Google OAuth2 login to Express appfull-stack75.4
Fix Node.js stream backpressure causing OOM on large filesbackend86.7
Harden insecure Docker setup with 12 vulnerabilitiescode-review77.5
Implement background job scheduler with persistencebackend26.1
Fix auth bypass vulnerabilitydebugging87.0
Optimize slow Postgres queries in Flask appbackend78.0
Build REST API from scratchfrom-scratch60.9
Add Redis caching layer to Express APIbackend76.6
Fix 12 WCAG accessibility violations in checkout formfrontend80.5
Build materialized view refresh pipeline for analyticsbackend71.0
Optimize bloated React bundle under 500KBfrontend44.4
Add file upload with S3 presigned URLsbackend71.9
Write Kubernetes manifests for Node.js microservicefull-stack77.5
Fix flaky test suitedebugging86.6
Build RAG pipeline with vector searchbackend39.0
Build codebase indexer for LLM context windowsfrom-scratch35.5
Implement multi-tenant row-level security in Postgresbackend53.0
Build terminal UI dashboardfrom-scratch46.5
Add cursor-based pagination to REST APIbackend80.5
Write integration tests for payment flowcode-review64.1
Implement zero-trust API authentication layerbackend69.3
Dockerize Node.js monorepofull-stack59.4
Add WebSocket real-time updatesfull-stack65.3
Add rate limiting middlewarebackend70.5
Implement transformer inference engine with KV cachefrom-scratch73.8
Implement JWT auth middlewarebackend50.0
Add GraphQL layer over REST APImulti-language82.5
Add virtual scrolling to table rendering 5000 rowsfrontend76.5
Remove AI slop and over-engineering from codebaserefactoring77.8
Write tests for untested legacy Flask servicecode-review42.6
Build production website with auth and members areafrontend57.1
Zero-downtime schema migrationfull-stack85.8
Replace console.log with structured loggingrefactoring78.1
Add slash commands and moderation to Discord botbackend63.3
Fix deadlocking transaction patterns in Flask appbackend76.7
Fix broken responsive layoutfrontend64.2
Code review: identify security vulnscode-review34.1
Port Python CLI to Rustmulti-language29.3
Refactor monolithic handler to CQRSrefactoring37.6
Port Python CLI to Rustmulti-language29.3
Fix 12 WCAG accessibility violations in checkout formfrontend68.9
Code review: identify security vulnscode-review79.0
Add slash commands and moderation to Discord botbackend61.8
Add WebSocket real-time updatesfull-stack71.8
Implement multi-tenant row-level security in Postgresbackend69.5
Find and patch all OWASP Top 10 vulnerabilitiesdebugging62.5
Optimize bloated React bundle under 500KBfrontend67.1
Remove AI slop and over-engineering from codebaserefactoring73.6
Write Kubernetes manifests for Node.js microservicefull-stack86.7
Add streaming SSE endpoint for LLM chatbackend74.7
Build codebase indexer for LLM context windowsfrom-scratch31.0
Implement zero-trust API authentication layerbackend66.5
Convert React app to PWA with offline supportfrontend67.8
Add caching layer to eliminate slow SSR page loadsfull-stack68.8
Harden insecure Docker setup with 12 vulnerabilitiescode-review89.2
Split 1100-line god file into proper modulesrefactoring67.0
Fix broken responsive layoutfrontend60.5
Add i18n with locale routing to Next.js appfull-stack61.5
Implement JWT auth middlewarebackend76.5
Replace console.log with structured loggingrefactoring62.3
Dockerize Node.js monorepofull-stack74.8
Find and fix 4 hidden backdoors in Flask appdebugging77.1
Write complex SQL report with window functionsbackend75.6
Build real-time portfolio risk calculatorbackend28.2
Build LLM evaluation harness with structured gradingbackend61.0
Build SaaS admin dashboard from scratchfrom-scratch35.0
Optimize slow Postgres queries in Flask appbackend74.5
Debug and fix 6 broken database triggers and constraintsdebugging87.7
Fix hallucination and context window bugs in RAG agentbackend34.0
Fix N+1 query in dashboardbackend50.2
Add rate limiting middlewarebackend31.4
Fix data integrity bugs in denormalized e-commerce schemadebugging73.2
Write tests for untested legacy Flask servicecode-review55.6
Fix memory leak in event handlerdebugging70.7
Build CLI tool with subcommands and configfrom-scratch47.9
Build MCP server for database managementbackend57.1
Build RAG pipeline with vector searchbackend29.4
Refactor monolithic handler to CQRSrefactoring69.2
Zero-downtime schema migrationfull-stack61.9
Build terminal UI dashboardfrom-scratch32.0
Fix auth bypass vulnerabilitydebugging77.6
Implement background job scheduler with persistencebackend27.6
Implement transformer inference engine with KV cachefrom-scratch55.6
Fix React hydration mismatchfrontend73.3
Write integration tests for payment flowcode-review77.0
Build distributed node cluster with gossip protocolfrom-scratch21.1
Fix race conditions in order matching enginebackend79.2
Add virtual scrolling to table rendering 5000 rowsfrontend51.8
Build production website with auth and members areafrontend62.4
Build materialized view refresh pipeline for analyticsbackend66.2
Add retry logic and dead letter queue to Python task queuebackend69.1
Fix deadlocking transaction patterns in Flask appbackend88.2
Fix flaky test suitedebugging88.9
Build REST API from scratchfrom-scratch76.8
Debug race condition in worker pooldebugging66.9