APEX
Back to models

Qwen3.5 27b [Q4_K_M]

LM Studio

262K context<$0.01/M input<$0.01/M output
1334peak 1351

Avg Score

61.5

Avg Cost

$0.17

Score/$

369.9

Runs

65

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

debuggingmedium
1792
frontendhard
1775
refactoringexpert
1496
frontendexpert
1465
frontend
1416
backendmedium
1408
debugging
1403
full-stackmedium
1387
full-stack
1362
frontendmedium
1351
frontendeasy
1351
code-reviewmedium
1329
full-stackhard
1325
backend
1324
code-review
1317
backendhard
1316
refactoring
1295
debuggingexpert
1283
debugginghard
1276
from-scratcheasy
1249
refactoringmedium
1225
from-scratchmedium
1212
from-scratch
1191
backendexpert
1173
code-reviewhard
1088
from-scratchhard
1070
multi-language
1024
backendeasy
628
multi-languageexpert
471
multi-languagehard
243
from-scratchexpert
172

All Results

TaskCategoryScore
Implement background job scheduler with persistencebackend53.0
Fix data integrity bugs in denormalized e-commerce schemadebugging71.8
Build RAG pipeline with vector searchbackend42.6
Migrate callback-hell Express app to async/awaitrefactoring75.4
Build terminal UI dashboardfrom-scratch60.4
Build real-time portfolio risk calculatorbackend56.6
Implement multi-tenant row-level security in Postgresbackend40.3
Build production website with auth and members areafrontend67.3
Optimize bloated React bundle under 500KBfrontend70.8
Fix auth bypass vulnerabilitydebugging28.0
Add file upload with S3 presigned URLsbackend80.9
Write Kubernetes manifests for Node.js microservicefull-stack81.1
Fix React hydration mismatchfrontend73.9
Write tests for untested legacy Flask servicecode-review50.9
Write complex SQL report with window functionsbackend50.1
Build CLI tool with subcommands and configfrom-scratch42.3
Fix N+1 query in dashboardbackend55.9
Optimize slow Postgres queries in Flask appbackend74.1
Implement Stripe webhook handlerbackend78.7
Add i18n with locale routing to Next.js appfull-stack63.8
Build codebase indexer for LLM context windowsfrom-scratch38.8
Build distributed node cluster with gossip protocolfrom-scratch37.4
Add streaming SSE endpoint for LLM chatbackend85.3
Add retry logic and dead letter queue to Python task queuebackend9.6
Add rate limiting middlewarebackend44.3
Remove AI slop and over-engineering from codebaserefactoring78.3
Debug and fix 6 broken database triggers and constraintsdebugging75.5
Fix flaky test suitedebugging89.8
Find and fix 4 hidden backdoors in Flask appdebugging78.7
Add slash commands and moderation to Discord botbackend69.8
Build REST API from scratchfrom-scratch75.1
Write integration tests for payment flowcode-review66.0
Harden insecure Docker setup with 12 vulnerabilitiescode-review83.5
Add caching layer to eliminate slow SSR page loadsfull-stack82.4
Zero-downtime schema migrationfull-stack70.5
Fix broken responsive layoutfrontend71.3
Implement JWT auth middlewarebackend45.7
Add WebSocket real-time updatesfull-stack73.9
Build SaaS admin dashboard from scratchfrom-scratch47.5
Build MCP server for database managementbackend55.8
Add GraphQL layer over REST APImulti-language44.5
Fix hallucination and context window bugs in RAG agentbackend63.0
Fix Node.js stream backpressure causing OOM on large filesbackend79.3
Fix deadlocking transaction patterns in Flask appbackend72.8
Implement transformer inference engine with KV cachefrom-scratch43.0
Replace console.log with structured loggingrefactoring36.4
Find and patch all OWASP Top 10 vulnerabilitiesdebugging66.3
Add Google OAuth2 login to Express appfull-stack66.3
Debug race condition in worker pooldebugging82.4
Fix race conditions in order matching enginebackend56.4
Build materialized view refresh pipeline for analyticsbackend74.8
Add Redis caching layer to Express APIbackend81.7
Add cursor-based pagination to REST APIbackend47.3
Dockerize Node.js monorepofull-stack66.6
Split 1100-line god file into proper modulesrefactoring50.3
Fix memory leak in event handlerdebugging44.9
Fix broken GitHub Actions CI pipelinedebugging93.0
Fix 12 WCAG accessibility violations in checkout formfrontend83.3
Convert React app to PWA with offline supportfrontend75.9
Add virtual scrolling to table rendering 5000 rowsfrontend45.5
Implement zero-trust API authentication layerbackend28.0
Port Python CLI to Rustmulti-language35.5
Code review: identify security vulnscode-review49.1
Build LLM evaluation harness with structured gradingbackend47.8
Refactor monolithic handler to CQRSrefactoring67.6