APEX
Back to models

Grok 4.1 Fast

OpenRouter

2000K context$0.20/M input$0.50/M output
1397peak 1413

Avg Score

65.8

Avg Cost

$0.02

Score/$

4128.3

Runs

65

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

from-scratchmedium
1709
debuggingexpert
1685
multi-languagehard
1608
frontendeasy
1603
debuggingmedium
1578
debugging
1574
debugginghard
1531
backendeasy
1500
backendmedium
1455
full-stackhard
1425
backend
1421
full-stack
1408
backendhard
1402
frontendmedium
1398
frontend
1392
backendexpert
1373
full-stackmedium
1365
code-reviewmedium
1362
code-review
1355
multi-language
1347
code-reviewhard
1293
frontendhard
1261
refactoring
1254
refactoringmedium
1207
from-scratch
1152
frontendexpert
1114
refactoringexpert
1108
from-scratchexpert
1003
from-scratchhard
920
multi-languageexpert
808
from-scratcheasy
798

All Results

TaskCategoryScore
Implement background job scheduler with persistencebackend3.9
Build SaaS admin dashboard from scratchfrom-scratch23.8
Build CLI tool with subcommands and configfrom-scratch10.0
Fix N+1 query in dashboardbackend63.5
Fix flaky test suitedebugging93.2
Debug and fix 6 broken database triggers and constraintsdebugging90.8
Write complex SQL report with window functionsbackend70.5
Find and patch all OWASP Top 10 vulnerabilitiesdebugging69.7
Dockerize Node.js monorepofull-stack71.1
Optimize slow Postgres queries in Flask appbackend82.0
Find and fix 4 hidden backdoors in Flask appdebugging93.5
Add i18n with locale routing to Next.js appfull-stack64.7
Implement JWT auth middlewarebackend45.1
Build real-time portfolio risk calculatorbackend64.0
Code review: identify security vulnscode-review82.9
Fix data integrity bugs in denormalized e-commerce schemadebugging89.3
Write Kubernetes manifests for Node.js microservicefull-stack87.5
Fix React hydration mismatchfrontend78.3
Build terminal UI dashboardfrom-scratch66.3
Write integration tests for payment flowcode-review40.7
Add Google OAuth2 login to Express appfull-stack82.5
Add slash commands and moderation to Discord botbackend63.4
Fix Node.js stream backpressure causing OOM on large filesbackend80.1
Write tests for untested legacy Flask servicecode-review35.0
Build REST API from scratchfrom-scratch67.7
Fix broken GitHub Actions CI pipelinedebugging74.5
Fix hallucination and context window bugs in RAG agentbackend86.4
Add rate limiting middlewarebackend74.0
Build distributed node cluster with gossip protocolfrom-scratch41.5
Implement transformer inference engine with KV cachefrom-scratch70.8
Add Redis caching layer to Express APIbackend62.4
Build materialized view refresh pipeline for analyticsbackend54.5
Port Python CLI to Rustmulti-language41.0
Implement zero-trust API authentication layerbackend32.5
Build MCP server for database managementbackend88.2
Convert React app to PWA with offline supportfrontend71.8
Fix broken responsive layoutfrontend75.3
Replace console.log with structured loggingrefactoring62.7
Fix memory leak in event handlerdebugging75.6
Add WebSocket real-time updatesfull-stack67.3
Add cursor-based pagination to REST APIbackend70.8
Add retry logic and dead letter queue to Python task queuebackend81.3
Build codebase indexer for LLM context windowsfrom-scratch38.2
Build production website with auth and members areafrontend60.5
Fix 12 WCAG accessibility violations in checkout formfrontend74.7
Implement Stripe webhook handlerbackend77.5
Add caching layer to eliminate slow SSR page loadsfull-stack57.1
Add GraphQL layer over REST APImulti-language75.8
Remove AI slop and over-engineering from codebaserefactoring68.9
Add streaming SSE endpoint for LLM chatbackend84.4
Migrate callback-hell Express app to async/awaitrefactoring54.9
Add file upload with S3 presigned URLsbackend82.5
Split 1100-line god file into proper modulesrefactoring58.1
Refactor monolithic handler to CQRSrefactoring57.4
Debug race condition in worker pooldebugging93.7
Build RAG pipeline with vector searchbackend31.4
Fix race conditions in order matching enginebackend78.1
Fix deadlocking transaction patterns in Flask appbackend68.8
Fix auth bypass vulnerabilitydebugging71.7
Implement multi-tenant row-level security in Postgresbackend42.3
Optimize bloated React bundle under 500KBfrontend76.2
Harden insecure Docker setup with 12 vulnerabilitiescode-review77.6
Zero-downtime schema migrationfull-stack73.0
Build LLM evaluation harness with structured gradingbackend79.5
Add virtual scrolling to table rendering 5000 rowsfrontend43.0