APEX
Back to models

Qwen3.5 35b A3b [Q4_K_XL]

LM Studio

262K context<$0.01/M input<$0.01/M output
1240peak 1286

Avg Score

40.5

Avg Cost

$0.05

Score/$

774.8

Runs

105

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languageexpert
1754
multi-languagehard
1572
debugginghard
1551
multi-language
1519
full-stackhard
1487
backendeasy
1455
debugging
1336
frontendmedium
1311
full-stack
1310
backendhard
1294
from-scratcheasy
1290
backend
1236
frontend
1216
backendmedium
1160
backendexpert
1150
code-review
1132
code-reviewmedium
1125
from-scratch
1104
debuggingexpert
969
from-scratchhard
956
refactoringmedium
942
refactoring
912
frontendhard
725
from-scratchexpert
475
frontendexpert
473
code-reviewhard
363
full-stackmedium
0
refactoringexpert
0
from-scratchmedium
0
debuggingmedium
0
frontendeasy
0

All Results

TaskCategoryScore
Write tests for untested legacy Flask servicecode-review0.0
Add GraphQL layer over REST APImulti-language
Build MCP server for database managementbackend0.0
Implement Stripe webhook handlerbackend
Write Kubernetes manifests for Node.js microservicefull-stack28.0
Implement multi-tenant row-level security in Postgresbackend22.0
Harden insecure Docker setup with 12 vulnerabilitiescode-review28.0
Remove AI slop and over-engineering from codebaserefactoring28.0
Add file upload with S3 presigned URLsbackend
Fix hallucination and context window bugs in RAG agentbackend0.0
Split 1100-line god file into proper modulesrefactoring28.0
Debug race condition in worker pooldebugging28.0
Add caching layer to eliminate slow SSR page loadsfull-stack28.0
Build codebase indexer for LLM context windowsfrom-scratch0.0
Convert React app to PWA with offline supportfrontend28.0
Fix memory leak in event handlerdebugging
Fix deadlocking transaction patterns in Flask appbackend22.0
Build terminal UI dashboardfrom-scratch0.0
Fix Node.js stream backpressure causing OOM on large filesbackend
Build real-time portfolio risk calculatorbackend0.0
Find and patch all OWASP Top 10 vulnerabilitiesdebugging28.0
Add retry logic and dead letter queue to Python task queuebackend28.0
Add Google OAuth2 login to Express appfull-stack
Optimize slow Postgres queries in Flask appbackend28.0
Migrate callback-hell Express app to async/awaitrefactoring28.0
Add Redis caching layer to Express APIbackend
Build SaaS admin dashboard from scratchfrom-scratch0.0
Implement transformer inference engine with KV cachefrom-scratch28.0
Debug and fix 6 broken database triggers and constraintsdebugging0.0
Implement background job scheduler with persistencebackend0.0
Code review: identify security vulnscode-review28.0
Build distributed node cluster with gossip protocolfrom-scratch0.0
Fix flaky test suitedebugging28.0
Build REST API from scratchfrom-scratch22.0
Fix 12 WCAG accessibility violations in checkout formfrontend28.0
Build LLM evaluation harness with structured gradingbackend28.0
Fix N+1 query in dashboardbackend0.0
Optimize bloated React bundle under 500KBfrontend28.0
Fix race conditions in order matching enginebackend22.0
Implement zero-trust API authentication layerbackend0.0
Dockerize Node.js monorepofull-stack28.0
Fix broken responsive layoutfrontend22.0
Add i18n with locale routing to Next.js appfull-stack28.0
Build production website with auth and members areafrontend22.0
Fix data integrity bugs in denormalized e-commerce schemadebugging28.0
Fix auth bypass vulnerabilitydebugging
Refactor monolithic handler to CQRSrefactoring46.2
Fix broken GitHub Actions CI pipelinedebugging47.9
Write complex SQL report with window functionsbackend61.9
Implement JWT auth middlewarebackend51.8
Zero-downtime schema migrationfull-stack90.0
Build CLI tool with subcommands and configfrom-scratch47.9
Port Python CLI to Rustmulti-language41.5
Add slash commands and moderation to Discord botbackend68.0
Add streaming SSE endpoint for LLM chatbackend45.5
Add rate limiting middlewarebackend69.4
Add WebSocket real-time updatesfull-stack73.8
Write integration tests for payment flowcode-review46.0
Replace console.log with structured loggingrefactoring42.3
Add virtual scrolling to table rendering 5000 rowsfrontend42.5
Find and fix 4 hidden backdoors in Flask appdebugging87.4
Fix React hydration mismatchfrontend50.0
Build RAG pipeline with vector searchbackend44.5
Build materialized view refresh pipeline for analyticsbackend83.1
Add cursor-based pagination to REST APIbackend42.0
Build MCP server for database managementbackend54.0
Implement transformer inference engine with KV cachefrom-scratch58.7
Build SaaS admin dashboard from scratchfrom-scratch51.8
Implement background job scheduler with persistencebackend28.7
Build production website with auth and members areafrontend45.5
Build CLI tool with subcommands and configfrom-scratch2.6
Fix hallucination and context window bugs in RAG agentbackend45.0
Build LLM evaluation harness with structured gradingbackend37.7
Build real-time portfolio risk calculatorbackend47.0
Fix race conditions in order matching enginebackend81.9
Fix data integrity bugs in denormalized e-commerce schemadebugging54.9
Build materialized view refresh pipeline for analyticsbackend47.9
Fix deadlocking transaction patterns in Flask appbackend46.3
Debug and fix 6 broken database triggers and constraintsdebugging52.1
Write complex SQL report with window functionsbackend44.0
Find and fix 4 hidden backdoors in Flask appdebugging89.0
Add Redis caching layer to Express APIbackend69.2
Write tests for untested legacy Flask servicecode-review50.1
Optimize slow Postgres queries in Flask appbackend74.4
Add slash commands and moderation to Discord botbackend36.6
Add retry logic and dead letter queue to Python task queuebackend66.8
Fix Node.js stream backpressure causing OOM on large filesbackend73.8
Add virtual scrolling to table rendering 5000 rowsfrontend74.1
Fix 12 WCAG accessibility violations in checkout formfrontend68.8
Build distributed node cluster with gossip protocolfrom-scratch19.9
Fix auth bypass vulnerabilitydebugging92.3
Add GraphQL layer over REST APImulti-language69.8
Write integration tests for payment flowcode-review41.7
Zero-downtime schema migrationfull-stack48.0
Add rate limiting middlewarebackend66.1
Implement Stripe webhook handlerbackend45.3
Fix flaky test suitedebugging41.0
Add cursor-based pagination to REST APIbackend34.5
Fix N+1 query in dashboardbackend36.8
Fix memory leak in event handlerdebugging56.7
Refactor monolithic handler to CQRSrefactoring31.8
Debug race condition in worker pooldebugging88.5
Fix React hydration mismatchfrontend72.5
Build terminal UI dashboardfrom-scratch28.7
Build REST API from scratchfrom-scratch77.9