APEX
Back to models

Qwen3.5 397b A17b Q4 K XL [Q4_K_XL]

LM Studio

262K context<$0.01/M input<$0.01/M output
1524peak 1541

Avg Score

74.8

Avg Cost

$0.87

Score/$

85.8

Runs

70

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

from-scratcheasy
2138
frontendexpert
2005
from-scratchmedium
1928
backendeasy
1918
from-scratchhard
1914
refactoringexpert
1741
from-scratch
1706
frontendeasy
1679
full-stackmedium
1647
multi-languagehard
1622
refactoringmedium
1607
refactoring
1605
multi-languageexpert
1591
backendhard
1567
debuggingmedium
1551
frontend
1549
frontendmedium
1546
debuggingexpert
1539
code-reviewmedium
1518
full-stack
1511
backendexpert
1506
multi-language
1497
backend
1496
debugging
1475
frontendmaster
1451
code-review
1426
debugginghard
1424
full-stackhard
1410
backendmedium
1402
frontendhard
1366
backendmaster
1295
from-scratchexpert
1003
code-reviewhard
581

All Results

TaskCategoryScore
Fix and extend Chrome browser extensionfrontend49.0
Implement JWT auth middlewarebackend77.5
Add WebSocket real-time updatesfull-stack84.3
Zero-downtime schema migrationfull-stack66.8
Add Google OAuth2 login to Express appfull-stack65.7
Write integration tests for payment flowcode-review57.4
Add i18n with locale routing to Next.js appfull-stack68.0
Optimize bloated React bundle under 500KBfrontend76.0
Add caching layer to eliminate slow SSR page loadsfull-stack83.8
Build RAG pipeline with vector searchbackend56.1
Build LLM evaluation harness with structured gradingbackend64.8
Implement zero-trust API authentication layerbackend53.6
Build distributed node cluster with gossip protocolfrom-scratch78.9
Fix auth bypass vulnerabilitydebugging77.7
Add cursor-based pagination to REST APIbackend72.2
Migrate callback-hell Express app to async/awaitrefactoring83.0
Implement transformer inference engine with KV cachefrom-scratch69.9
Fix memory leak in event handlerdebugging65.0
Implement multi-tenant row-level security in Postgresbackend78.5
Fix deadlocking transaction patterns in Flask appbackend61.2
Fix Node.js stream backpressure causing OOM on large filesbackend78.7
Fix broken responsive layoutfrontend77.4
Migrate Express monolith to modular architecturebackend62.5
Add file upload with S3 presigned URLsbackend66.6
Build interactive data visualization dashboardfrontend73.0
Convert React app to PWA with offline supportfrontend75.2
Build MCP server for database managementbackend82.4
Replace console.log with structured loggingrefactoring75.7
Add slash commands and moderation to Discord botbackend76.8
Add GraphQL layer over REST APImulti-language76.2
Write complex SQL report with window functionsbackend87.5
Add Redis caching layer to Express APIbackend77.7
Fix N+1 query in dashboardbackend63.7
Write tests for untested legacy Flask servicecode-review81.3
Add virtual scrolling to table rendering 5000 rowsfrontend82.3
Build terminal UI dashboardfrom-scratch71.2
Build production website with auth and members areafrontend74.0
Build multi-tool LLM agent runtimebackend77.8
Build 3D browser game with physics and multiplayer syncfrontend78.7
Split 1100-line god file into proper modulesrefactoring72.7
Fix 12 WCAG accessibility violations in checkout formfrontend76.2
Add retry logic and dead letter queue to Python task queuebackend71.9
Write Kubernetes manifests for Node.js microservicefull-stack87.7
Code review: identify security vulnscode-review74.7
Fix flaky test suitedebugging86.8
Port Python CLI to Rustmulti-language57.5
Fix data integrity bugs in denormalized e-commerce schemadebugging74.2
Build real-time portfolio risk calculatorbackend81.7
Implement Stripe webhook handlerbackend67.2
Build codebase indexer for LLM context windowsfrom-scratch70.9
Build materialized view refresh pipeline for analyticsbackend78.5
Find and patch all OWASP Top 10 vulnerabilitiesdebugging75.8
Add streaming SSE endpoint for LLM chatbackend60.0
Fix broken GitHub Actions CI pipelinedebugging80.3
Add rate limiting middlewarebackend80.3
Build SaaS admin dashboard from scratchfrom-scratch83.3
Implement background job scheduler with persistencebackend70.8
Fix hallucination and context window bugs in RAG agentbackend76.3
Remove AI slop and over-engineering from codebaserefactoring82.2
Refactor monolithic handler to CQRSrefactoring73.3
Harden insecure Docker setup with 12 vulnerabilitiescode-review77.0
Dockerize Node.js monorepofull-stack79.3
Optimize slow Postgres queries in Flask appbackend76.8
Fix React hydration mismatchfrontend75.7
Find and fix 4 hidden backdoors in Flask appdebugging89.7
Fix race conditions in order matching enginebackend87.5
Debug and fix 6 broken database triggers and constraintsdebugging86.8
Debug race condition in worker pooldebugging84.5
Build CLI tool with subcommands and configfrom-scratch78.5
Build REST API from scratchfrom-scratch89.3