APEX
Back to models

Qwen3.5 122b A10b [Q4_K_XL]

LM Studio

262K context<$0.01/M input<$0.01/M output
1493peak 1509

Avg Score

72.5

Avg Cost

$0.07

Score/$

1102.7

Runs

65

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languagehard
1910
frontendeasy
1771
refactoringexpert
1757
debuggingexpert
1740
from-scratchexpert
1735
full-stackhard
1674
full-stack
1621
frontendhard
1605
debugging
1598
frontendmedium
1584
debugginghard
1568
debuggingmedium
1561
full-stackmedium
1554
frontend
1547
backendmedium
1528
backendexpert
1517
multi-language
1505
backend
1481
refactoring
1478
backendhard
1455
refactoringmedium
1434
from-scratch
1363
from-scratchhard
1323
code-review
1298
code-reviewmedium
1290
from-scratchmedium
1262
frontendexpert
1221
code-reviewhard
1164
multi-languageexpert
1115
from-scratcheasy
917
backendeasy
698

All Results

TaskCategoryScore
Migrate callback-hell Express app to async/awaitrefactoring75.8
Build production website with auth and members areafrontend61.7
Build SaaS admin dashboard from scratchfrom-scratch57.0
Fix broken responsive layoutfrontend79.7
Add GraphQL layer over REST APImulti-language81.8
Fix race conditions in order matching enginebackend84.8
Fix 12 WCAG accessibility violations in checkout formfrontend80.1
Implement multi-tenant row-level security in Postgresbackend42.0
Fix broken GitHub Actions CI pipelinedebugging84.1
Debug and fix 6 broken database triggers and constraintsdebugging90.7
Add cursor-based pagination to REST APIbackend82.8
Add caching layer to eliminate slow SSR page loadsfull-stack80.5
Harden insecure Docker setup with 12 vulnerabilitiescode-review78.5
Convert React app to PWA with offline supportfrontend69.6
Fix auth bypass vulnerabilitydebugging84.8
Find and fix 4 hidden backdoors in Flask appdebugging89.5
Write tests for untested legacy Flask servicecode-review49.2
Write complex SQL report with window functionsbackend62.0
Implement zero-trust API authentication layerbackend69.7
Add Google OAuth2 login to Express appfull-stack84.6
Build CLI tool with subcommands and configfrom-scratch63.5
Implement Stripe webhook handlerbackend82.0
Implement JWT auth middlewarebackend51.9
Replace console.log with structured loggingrefactoring47.7
Refactor monolithic handler to CQRSrefactoring73.7
Add virtual scrolling to table rendering 5000 rowsfrontend82.8
Build terminal UI dashboardfrom-scratch61.0
Fix Node.js stream backpressure causing OOM on large filesbackend86.3
Add retry logic and dead letter queue to Python task queuebackend62.0
Fix N+1 query in dashboardbackend52.9
Fix hallucination and context window bugs in RAG agentbackend57.2
Build REST API from scratchfrom-scratch69.3
Fix data integrity bugs in denormalized e-commerce schemadebugging92.8
Build materialized view refresh pipeline for analyticsbackend87.8
Port Python CLI to Rustmulti-language46.8
Remove AI slop and over-engineering from codebaserefactoring95.3
Add slash commands and moderation to Discord botbackend84.1
Write Kubernetes manifests for Node.js microservicefull-stack85.0
Debug race condition in worker pooldebugging90.9
Add Redis caching layer to Express APIbackend82.7
Build distributed node cluster with gossip protocolfrom-scratch61.0
Build LLM evaluation harness with structured gradingbackend84.0
Add file upload with S3 presigned URLsbackend74.1
Fix React hydration mismatchfrontend79.3
Write integration tests for payment flowcode-review67.4
Split 1100-line god file into proper modulesrefactoring49.3
Build MCP server for database managementbackend74.5
Add i18n with locale routing to Next.js appfull-stack80.9
Fix memory leak in event handlerdebugging76.0
Code review: identify security vulnscode-review66.5
Build real-time portfolio risk calculatorbackend53.9
Build codebase indexer for LLM context windowsfrom-scratch37.1
Implement background job scheduler with persistencebackend52.5
Implement transformer inference engine with KV cachefrom-scratch80.9
Add rate limiting middlewarebackend47.4
Find and patch all OWASP Top 10 vulnerabilitiesdebugging70.7
Optimize bloated React bundle under 500KBfrontend80.8
Add WebSocket real-time updatesfull-stack81.8
Fix flaky test suitedebugging85.0
Optimize slow Postgres queries in Flask appbackend76.0
Add streaming SSE endpoint for LLM chatbackend84.6
Fix deadlocking transaction patterns in Flask appbackend81.7
Dockerize Node.js monorepofull-stack79.7
Build RAG pipeline with vector searchbackend66.3
Zero-downtime schema migrationfull-stack74.8