APEX
Back to models

Qwen3.5 27b [Q4_K_M]

LM Studio

262K context<$0.01/M input<$0.01/M output
1468peak 1481

Avg Score

62.8

Avg Cost

$0.18

Score/$

340.6

Runs

117

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

debuggingmedium
2057
frontendexpert
1878
from-scratchmedium
1865
refactoringexpert
1734
frontendhard
1721
frontendeasy
1631
debugginghard
1599
frontend
1586
backendexpert
1580
debugging
1576
frontendmedium
1573
refactoring
1526
code-reviewmedium
1506
refactoringmedium
1502
backend
1453
full-stackmedium
1449
backendmedium
1442
full-stack
1417
code-review
1417
full-stackhard
1396
backendhard
1384
debuggingexpert
1373
from-scratcheasy
1373
from-scratch
1303
multi-languageexpert
1302
from-scratchhard
1279
multi-language
1271
multi-languagehard
948
code-reviewhard
669
backendeasy
502
from-scratchexpert
0

All Results

TaskCategoryScore
Implement background job scheduler with persistencebackend53.0
Fix data integrity bugs in denormalized e-commerce schemadebugging71.8
Build RAG pipeline with vector searchbackend42.6
Migrate callback-hell Express app to async/awaitrefactoring54.1
Build terminal UI dashboardfrom-scratch57.1
Build real-time portfolio risk calculatorbackend56.6
Implement multi-tenant row-level security in Postgresbackend40.3
Build production website with auth and members areafrontend67.3
Optimize bloated React bundle under 500KBfrontend70.8
Fix auth bypass vulnerabilitydebugging28.0
Add file upload with S3 presigned URLsbackend80.9
Write Kubernetes manifests for Node.js microservicefull-stack81.1
Fix React hydration mismatchfrontend73.9
Write tests for untested legacy Flask servicecode-review50.9
Write complex SQL report with window functionsbackend50.1
Build CLI tool with subcommands and configfrom-scratch34.3
Fix N+1 query in dashboardbackend55.9
Optimize slow Postgres queries in Flask appbackend74.1
Implement Stripe webhook handlerbackend78.7
Add i18n with locale routing to Next.js appfull-stack63.8
Build codebase indexer for LLM context windowsfrom-scratch38.8
Build distributed node cluster with gossip protocolfrom-scratch31.8
Add streaming SSE endpoint for LLM chatbackend85.3
Add retry logic and dead letter queue to Python task queuebackend9.6
Add rate limiting middlewarebackend44.3
Remove AI slop and over-engineering from codebaserefactoring78.3
Debug and fix 6 broken database triggers and constraintsdebugging75.5
Fix flaky test suitedebugging89.8
Find and fix 4 hidden backdoors in Flask appdebugging78.7
Add slash commands and moderation to Discord botbackend69.8
Build REST API from scratchfrom-scratch75.1
Write integration tests for payment flowcode-review66.0
Harden insecure Docker setup with 12 vulnerabilitiescode-review83.5
Add caching layer to eliminate slow SSR page loadsfull-stack82.4
Zero-downtime schema migrationfull-stack70.5
Fix broken responsive layoutfrontend71.3
Implement JWT auth middlewarebackend45.7
Add WebSocket real-time updatesfull-stack73.9
Build SaaS admin dashboard from scratchfrom-scratch47.5
Build MCP server for database managementbackend55.8
Add GraphQL layer over REST APImulti-language62.7
Fix hallucination and context window bugs in RAG agentbackend63.0
Fix Node.js stream backpressure causing OOM on large filesbackend79.3
Fix deadlocking transaction patterns in Flask appbackend72.8
Implement transformer inference engine with KV cachefrom-scratch43.0
Replace console.log with structured loggingrefactoring36.4
Find and patch all OWASP Top 10 vulnerabilitiesdebugging66.3
Add Google OAuth2 login to Express appfull-stack66.3
Debug race condition in worker pooldebugging82.4
Fix race conditions in order matching enginebackend56.4
Build materialized view refresh pipeline for analyticsbackend74.8
Add Redis caching layer to Express APIbackend81.7
Add cursor-based pagination to REST APIbackend47.3
Dockerize Node.js monorepofull-stack66.6
Split 1100-line god file into proper modulesrefactoring50.3
Fix memory leak in event handlerdebugging44.9
Fix broken GitHub Actions CI pipelinedebugging93.0
Fix 12 WCAG accessibility violations in checkout formfrontend83.3
Convert React app to PWA with offline supportfrontend75.9
Add virtual scrolling to table rendering 5000 rowsfrontend45.5
Implement zero-trust API authentication layerbackend28.0
Port Python CLI to Rustmulti-language35.5
Code review: identify security vulnscode-review49.1
Add GraphQL layer over REST APImulti-language44.5
Migrate callback-hell Express app to async/awaitrefactoring75.4
Implement multi-tenant row-level security in Postgresbackend75.8
Optimize bloated React bundle under 500KBfrontend71.5
Convert React app to PWA with offline supportfrontend44.2
Fix broken responsive layoutfrontend64.7
Dockerize Node.js monorepofull-stack67.9
Harden insecure Docker setup with 12 vulnerabilitiescode-review89.6
Build codebase indexer for LLM context windowsfrom-scratch35.8
Replace console.log with structured loggingrefactoring54.2
Find and patch all OWASP Top 10 vulnerabilitiesdebugging70.2
Split 1100-line god file into proper modulesrefactoring71.9
Implement JWT auth middlewarebackend77.5
Add caching layer to eliminate slow SSR page loadsfull-stack81.2
Add i18n with locale routing to Next.js appfull-stack67.3
Implement zero-trust API authentication layerbackend73.2
Remove AI slop and over-engineering from codebaserefactoring80.0
Write Kubernetes manifests for Node.js microservicefull-stack84.9
Build distributed node cluster with gossip protocolfrom-scratch60.0
Build MCP server for database managementbackend58.8
Build CLI tool with subcommands and configfrom-scratch35.5
Build production website with auth and members areafrontend53.3
Implement background job scheduler with persistencebackend20.5
Fix hallucination and context window bugs in RAG agentbackend67.8
Build LLM evaluation harness with structured gradingbackend47.8
Implement transformer inference engine with KV cachefrom-scratch12.7
Build real-time portfolio risk calculatorbackend79.0
Fix race conditions in order matching enginebackend84.3
Fix data integrity bugs in denormalized e-commerce schemadebugging70.5
Build materialized view refresh pipeline for analyticsbackend52.0
Fix deadlocking transaction patterns in Flask appbackend50.6
Debug and fix 6 broken database triggers and constraintsdebugging47.4
Write complex SQL report with window functionsbackend58.4
Find and fix 4 hidden backdoors in Flask appdebugging91.5
Write tests for untested legacy Flask servicecode-review60.4
Add Google OAuth2 login to Express appfull-stack62.9
Optimize slow Postgres queries in Flask appbackend62.9
Add slash commands and moderation to Discord botbackend44.8
Add retry logic and dead letter queue to Python task queuebackend68.9
Fix Node.js stream backpressure causing OOM on large filesbackend70.9
Add virtual scrolling to table rendering 5000 rowsfrontend78.5
Fix 12 WCAG accessibility violations in checkout formfrontend77.8
Fix auth bypass vulnerabilitydebugging94.5
Write integration tests for payment flowcode-review60.8
Zero-downtime schema migrationfull-stack66.5
Add rate limiting middlewarebackend38.6
Fix flaky test suitedebugging91.8
Fix N+1 query in dashboardbackend63.4
Fix memory leak in event handlerdebugging71.5
Refactor monolithic handler to CQRSrefactoring67.6
Debug race condition in worker pooldebugging81.7
Fix React hydration mismatchfrontend76.5
Build terminal UI dashboardfrom-scratch44.0
Build REST API from scratchfrom-scratch79.8