APEX
Back to models

Qwen3.5 Flash 02.23

OpenRouter

1000K context$0.10/M input$0.40/M output
1341peak 1357

Avg Score

64.5

Avg Cost

$0.02

Score/$

4107.4

Runs

65

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languagehard
1954
frontendmedium
1535
multi-language
1521
debuggingmedium
1519
backendmedium
1484
code-reviewhard
1455
debugging
1441
frontend
1438
debugginghard
1435
frontendhard
1435
backendeasy
1419
debuggingexpert
1415
backend
1380
backendhard
1315
from-scratchexpert
1311
backendexpert
1283
full-stackhard
1265
full-stack
1251
code-review
1249
full-stackmedium
1185
code-reviewmedium
1161
multi-languageexpert
1115
frontendexpert
1114
from-scratch
1109
from-scratcheasy
1106
refactoring
1092
refactoringmedium
1084
frontendeasy
904
from-scratchhard
872
from-scratchmedium
794
refactoringexpert
210

All Results

TaskCategoryScore
Add caching layer to eliminate slow SSR page loadsfull-stack78.9
Build CLI tool with subcommands and configfrom-scratch32.7
Add rate limiting middlewarebackend72.2
Debug race condition in worker pooldebugging83.6
Add retry logic and dead letter queue to Python task queuebackend79.9
Add i18n with locale routing to Next.js appfull-stack62.6
Fix flaky test suitedebugging88.3
Build distributed node cluster with gossip protocolfrom-scratch30.6
Fix Node.js stream backpressure causing OOM on large filesbackend85.7
Fix N+1 query in dashboardbackend47.8
Fix 12 WCAG accessibility violations in checkout formfrontend77.5
Fix hallucination and context window bugs in RAG agentbackend19.8
Fix data integrity bugs in denormalized e-commerce schemadebugging82.3
Build real-time portfolio risk calculatorbackend42.9
Add slash commands and moderation to Discord botbackend69.0
Build codebase indexer for LLM context windowsfrom-scratch34.8
Optimize slow Postgres queries in Flask appbackend78.0
Build REST API from scratchfrom-scratch71.7
Debug and fix 6 broken database triggers and constraintsdebugging75.7
Add file upload with S3 presigned URLsbackend77.3
Fix auth bypass vulnerabilitydebugging88.8
Fix deadlocking transaction patterns in Flask appbackend70.2
Port Python CLI to Rustmulti-language46.7
Build MCP server for database managementbackend77.3
Add WebSocket real-time updatesfull-stack66.4
Build SaaS admin dashboard from scratchfrom-scratch35.3
Build terminal UI dashboardfrom-scratch52.5
Fix broken GitHub Actions CI pipelinedebugging76.1
Fix broken responsive layoutfrontend64.2
Fix race conditions in order matching enginebackend78.7
Optimize bloated React bundle under 500KBfrontend73.1
Implement transformer inference engine with KV cachefrom-scratch74.3
Convert React app to PWA with offline supportfrontend79.7
Find and patch all OWASP Top 10 vulnerabilitiesdebugging65.5
Add streaming SSE endpoint for LLM chatbackend80.8
Write tests for untested legacy Flask servicecode-review42.8
Implement zero-trust API authentication layerbackend70.6
Add cursor-based pagination to REST APIbackend76.1
Implement JWT auth middlewarebackend50.5
Add virtual scrolling to table rendering 5000 rowsfrontend78.7
Fix React hydration mismatchfrontend77.2
Dockerize Node.js monorepofull-stack62.0
Build production website with auth and members areafrontend60.5
Add Google OAuth2 login to Express appfull-stack67.0
Replace console.log with structured loggingrefactoring43.2
Implement background job scheduler with persistencebackend31.1
Build RAG pipeline with vector searchbackend49.3
Add Redis caching layer to Express APIbackend77.3
Refactor monolithic handler to CQRSrefactoring36.5
Code review: identify security vulnscode-review47.4
Build materialized view refresh pipeline for analyticsbackend73.8
Implement multi-tenant row-level security in Postgresbackend41.0
Implement Stripe webhook handlerbackend82.5
Write complex SQL report with window functionsbackend59.5
Build LLM evaluation harness with structured gradingbackend53.5
Migrate callback-hell Express app to async/awaitrefactoring53.9
Write Kubernetes manifests for Node.js microservicefull-stack72.8
Split 1100-line god file into proper modulesrefactoring51.4
Fix memory leak in event handlerdebugging83.3
Remove AI slop and over-engineering from codebaserefactoring74.3
Write integration tests for payment flowcode-review72.0
Add GraphQL layer over REST APImulti-language82.4
Harden insecure Docker setup with 12 vulnerabilitiescode-review74.0
Zero-downtime schema migrationfull-stack68.1
Find and fix 4 hidden backdoors in Flask appdebugging58.1