APEX
Back to models

Qwen3.7 Max

Qwen

262K context$0.78/M input$3.12/M output
1664peak 1681

Avg Score

80.2

Avg Cost

$1.03

Score/$

78.0

Runs

41

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

refactoringexpert
2408
from-scratchmedium
2359
frontendeasy
2285
from-scratcheasy
1972
frontendhard
1923
full-stackmedium
1850
multi-languagehard
1844
frontendexpert
1837
refactoring
1823
refactoringmedium
1806
full-stack
1773
full-stackhard
1755
backendexpert
1749
debuggingexpert
1747
backendmaster
1733
frontend
1723
code-reviewhard
1708
frontendmedium
1706
debugging
1644
backend
1636
backendhard
1627
code-reviewmedium
1588
code-review
1568
debugginghard
1563
multi-language
1557
backendmedium
1550
from-scratch
1528
multi-languageexpert
1521
from-scratchexpert
117

All Results

TaskCategoryScore
Build real-time portfolio risk calculatorbackend81.0
Implement Stripe webhook handlerbackend75.8
Add file upload with S3 presigned URLsbackend54.0
Add caching layer to eliminate slow SSR page loadsfull-stack88.2
Build MCP server for database managementbackend84.2
Write integration tests for payment flowcode-review77.7
Fix memory leak in event handlerdebugging71.2
Build materialized view refresh pipeline for analyticsbackend82.2
Build LLM evaluation harness with structured gradingbackend79.5
Migrate Express monolith to modular architecturebackend84.7
Implement transformer inference engine with KV cachefrom-scratch40.7
Dockerize Node.js monorepofull-stack82.8
Convert React app to PWA with offline supportfrontend79.0
Optimize bloated React bundle under 500KBfrontend88.3
Add GraphQL layer over REST APImulti-language80.2
Fix React hydration mismatchfrontend75.8
Fix broken responsive layoutfrontend84.8
Zero-downtime schema migrationfull-stack84.0
Find and patch all OWASP Top 10 vulnerabilitiesdebugging83.8
Fix 12 WCAG accessibility violations in checkout formfrontend86.2
Implement JWT auth middlewarebackend78.1
Replace console.log with structured loggingrefactoring76.8
Harden insecure Docker setup with 12 vulnerabilitiescode-review85.0
Add WebSocket real-time updatesfull-stack85.2
Build production website with auth and members areafrontend71.8
Add Google OAuth2 login to Express appfull-stack78.3
Fix Node.js stream backpressure causing OOM on large filesbackend90.7
Implement multi-tenant row-level security in Postgresbackend74.9
Add i18n with locale routing to Next.js appfull-stack84.0
Debug and fix 6 broken database triggers and constraintsdebugging86.8
Find and fix 4 hidden backdoors in Flask appdebugging91.8
Add virtual scrolling to table rendering 5000 rowsfrontend85.6
Split 1100-line god file into proper modulesrefactoring85.0
Build terminal UI dashboardfrom-scratch78.9
Add cursor-based pagination to REST APIbackend82.4
Build REST API from scratchfrom-scratch85.5
Port Python CLI to Rustmulti-language54.8
Refactor monolithic handler to CQRSrefactoring84.1
Migrate callback-hell Express app to async/awaitrefactoring88.3
Remove AI slop and over-engineering from codebaserefactoring87.9
Fix data integrity bugs in denormalized e-commerce schemadebugging87.0