APEX
Back to models

Qwen3.6 27b [Q4_K_XL]

LM Studio

262K context<$0.01/M input<$0.01/M output
1615peak 1617

Avg Score

78.8

Avg Cost

Score/$

Runs

70

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

frontendeasy
2334
multi-languageexpert
2265
refactoringexpert
2250
code-reviewhard
2132
from-scratcheasy
2089
backendeasy
2068
multi-languagehard
2008
backendexpert
1827
from-scratchmedium
1815
from-scratchexpert
1802
refactoring
1794
refactoringmedium
1793
from-scratchhard
1785
frontendhard
1766
multi-language
1752
from-scratch
1700
code-review
1683
debuggingexpert
1672
code-reviewmedium
1661
backendhard
1642
full-stackhard
1639
backend
1606
full-stack
1584
frontendmaster
1578
frontendmedium
1567
frontend
1564
debugging
1553
debugginghard
1549
debuggingmedium
1512
full-stackmedium
1510
backendmedium
1459
backendmaster
1433
frontendexpert
302

All Results

TaskCategoryScore
Write tests for untested legacy Flask servicecode-review81.3
Add streaming SSE endpoint for LLM chatbackend81.1
Fix auth bypass vulnerabilitydebugging92.5
Implement background job scheduler with persistencebackend73.2
Build materialized view refresh pipeline for analyticsbackend77.0
Zero-downtime schema migrationfull-stack87.6
Harden insecure Docker setup with 12 vulnerabilitiescode-review88.3
Add slash commands and moderation to Discord botbackend65.7
Write Kubernetes manifests for Node.js microservicefull-stack87.2
Build LLM evaluation harness with structured gradingbackend78.2
Build interactive data visualization dashboardfrontend77.7
Debug race condition in worker pooldebugging89.0
Fix broken GitHub Actions CI pipelinedebugging79.4
Build multi-tool LLM agent runtimebackend81.5
Find and patch all OWASP Top 10 vulnerabilitiesdebugging81.6
Find and fix 4 hidden backdoors in Flask appdebugging89.7
Build real-time portfolio risk calculatorbackend81.7
Add WebSocket real-time updatesfull-stack85.2
Fix 12 WCAG accessibility violations in checkout formfrontend83.1
Add file upload with S3 presigned URLsbackend71.1
Implement multi-tenant row-level security in Postgresbackend81.7
Split 1100-line god file into proper modulesrefactoring82.9
Dockerize Node.js monorepofull-stack72.2
Add GraphQL layer over REST APImulti-language83.2
Migrate callback-hell Express app to async/awaitrefactoring90.0
Build RAG pipeline with vector searchbackend78.0
Fix memory leak in event handlerdebugging56.6
Implement Stripe webhook handlerbackend87.5
Fix data integrity bugs in denormalized e-commerce schemadebugging82.1
Fix N+1 query in dashboardbackend78.0
Add virtual scrolling to table rendering 5000 rowsfrontend81.2
Refactor monolithic handler to CQRSrefactoring80.5
Build distributed node cluster with gossip protocolfrom-scratch75.1
Implement zero-trust API authentication layerbackend79.1
Remove AI slop and over-engineering from codebaserefactoring83.2
Fix flaky test suitedebugging85.3
Implement transformer inference engine with KV cachefrom-scratch82.1
Optimize bloated React bundle under 500KBfrontend78.4
Optimize slow Postgres queries in Flask appbackend79.0
Build SaaS admin dashboard from scratchfrom-scratch68.8
Build production website with auth and members areafrontend49.3
Migrate Express monolith to modular architecturebackend65.8
Add Redis caching layer to Express APIbackend82.0
Write complex SQL report with window functionsbackend76.8
Fix broken responsive layoutfrontend86.0
Build CLI tool with subcommands and configfrom-scratch73.2
Build MCP server for database managementbackend83.5
Replace console.log with structured loggingrefactoring84.5
Write integration tests for payment flowcode-review82.9
Add rate limiting middlewarebackend82.5
Fix race conditions in order matching enginebackend90.9
Build REST API from scratchfrom-scratch87.7
Add Google OAuth2 login to Express appfull-stack67.2
Fix deadlocking transaction patterns in Flask appbackend86.0
Convert React app to PWA with offline supportfrontend70.8
Add i18n with locale routing to Next.js appfull-stack76.2
Port Python CLI to Rustmulti-language74.6
Add caching layer to eliminate slow SSR page loadsfull-stack80.5
Debug and fix 6 broken database triggers and constraintsdebugging87.1
Build terminal UI dashboardfrom-scratch68.2
Fix Node.js stream backpressure causing OOM on large filesbackend81.3
Add retry logic and dead letter queue to Python task queuebackend77.7
Add cursor-based pagination to REST APIbackend70.7
Build codebase indexer for LLM context windowsfrom-scratch81.5
Implement JWT auth middlewarebackend62.6
Fix and extend Chrome browser extensionfrontend48.8
Fix React hydration mismatchfrontend79.5
Fix hallucination and context window bugs in RAG agentbackend81.3
Code review: identify security vulnscode-review74.4
Build 3D browser game with physics and multiplayer syncfrontend83.1