APEX
Back to models

GLM 4.7 Flash

OpenRouter

203K context$0.06/M input$0.40/M output
1299peak 1315

Avg Score

55.4

Avg Cost

$0.02

Score/$

3241.2

Runs

107

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

frontendeasy
2003
from-scratchmedium
1922
debuggingmedium
1656
frontendexpert
1655
refactoringexpert
1621
code-reviewhard
1558
frontendhard
1540
debugging
1361
code-review
1354
full-stackmedium
1342
refactoring
1336
debugginghard
1336
frontend
1330
full-stack
1322
full-stackhard
1308
code-reviewmedium
1294
from-scratcheasy
1290
backendmedium
1284
debuggingexpert
1281
backend
1265
backendhard
1258
refactoringmedium
1258
from-scratch
1229
backendexpert
1211
from-scratchhard
1138
frontendmedium
1132
multi-language
1054
multi-languageexpert
985
backendeasy
650
from-scratchexpert
406
multi-languagehard
58

All Results

TaskCategoryScore
Fix hallucination and context window bugs in RAG agentbackend41.6
Build distributed node cluster with gossip protocolfrom-scratch25.4
Port Python CLI to Rustmulti-language29.1
Replace console.log with structured loggingrefactoring56.1
Debug and fix 6 broken database triggers and constraintsdebugging74.5
Harden insecure Docker setup with 12 vulnerabilitiescode-review54.0
Fix flaky test suitedebugging37.5
Add rate limiting middlewarebackend43.6
Build codebase indexer for LLM context windowsfrom-scratch14.8
Fix data integrity bugs in denormalized e-commerce schemadebugging37.6
Optimize slow Postgres queries in Flask appbackend41.9
Code review: identify security vulnscode-review80.2
Write tests for untested legacy Flask servicecode-review22.8
Write complex SQL report with window functionsbackend60.8
Implement background job scheduler with persistencebackend40.7
Zero-downtime schema migrationfull-stack69.3
Add GraphQL layer over REST APImulti-language44.3
Add streaming SSE endpoint for LLM chatbackend81.8
Add file upload with S3 presigned URLsbackend0.0
Implement Stripe webhook handlerbackend85.2
Build CLI tool with subcommands and configfrom-scratch30.9
Add cursor-based pagination to REST APIbackend77.0
Add Redis caching layer to Express APIbackend68.7
Fix 12 WCAG accessibility violations in checkout formfrontend71.5
Refactor monolithic handler to CQRSrefactoring40.7
Find and patch all OWASP Top 10 vulnerabilitiesdebugging57.6
Add virtual scrolling to table rendering 5000 rowsfrontend52.5
Add caching layer to eliminate slow SSR page loadsfull-stack80.5
Migrate callback-hell Express app to async/awaitrefactoring30.7
Remove AI slop and over-engineering from codebaserefactoring67.3
Implement transformer inference engine with KV cachefrom-scratch29.8
Build production website with auth and members areafrontend61.9
Build materialized view refresh pipeline for analyticsbackend61.3
Build MCP server for database managementbackend57.4
Optimize bloated React bundle under 500KBfrontend66.8
Add i18n with locale routing to Next.js appfull-stack62.0
Fix N+1 query in dashboardbackend26.9
Add WebSocket real-time updatesfull-stack63.0
Implement JWT auth middlewarebackend46.6
Fix auth bypass vulnerabilitydebugging90.3
Implement multi-tenant row-level security in Postgresbackend47.8
Write integration tests for payment flowcode-review39.5
Fix React hydration mismatchfrontend67.7
Debug race condition in worker pooldebugging74.1
Fix deadlocking transaction patterns in Flask appbackend68.3
Build REST API from scratchfrom-scratch46.7
Build RAG pipeline with vector searchbackend37.3
Fix memory leak in event handlerdebugging46.7
Find and fix 4 hidden backdoors in Flask appdebugging52.4
Build LLM evaluation harness with structured gradingbackend29.3
Fix broken responsive layoutfrontend77.5
Build terminal UI dashboardfrom-scratch58.5
Split 1100-line god file into proper modulesrefactoring57.8
Implement zero-trust API authentication layerbackend31.0
Find and patch all OWASP Top 10 vulnerabilitiesdebugging68.8
Implement JWT auth middlewarebackend78.0
Convert React app to PWA with offline supportfrontend35.8
Build codebase indexer for LLM context windowsfrom-scratch24.0
Implement multi-tenant row-level security in Postgresbackend61.9
Replace console.log with structured loggingrefactoring53.0
Optimize bloated React bundle under 500KBfrontend65.7
Write Kubernetes manifests for Node.js microservicefull-stack78.1
Remove AI slop and over-engineering from codebaserefactoring77.2
Add i18n with locale routing to Next.js appfull-stack57.1
Harden insecure Docker setup with 12 vulnerabilitiescode-review58.6
Dockerize Node.js monorepofull-stack68.8
Add caching layer to eliminate slow SSR page loadsfull-stack71.7
Fix broken responsive layoutfrontend70.5
Split 1100-line god file into proper modulesrefactoring65.2
Implement transformer inference engine with KV cachefrom-scratch57.0
Add virtual scrolling to table rendering 5000 rowsfrontend43.8
Write tests for untested legacy Flask servicecode-review45.3
Build distributed node cluster with gossip protocolfrom-scratch20.8
Find and fix 4 hidden backdoors in Flask appdebugging74.2
Build real-time portfolio risk calculatorbackend40.0
Fix hallucination and context window bugs in RAG agentbackend52.5
Fix N+1 query in dashboardbackend42.5
Zero-downtime schema migrationfull-stack53.0
Implement background job scheduler with persistencebackend32.8
Optimize slow Postgres queries in Flask appbackend67.0
Fix Node.js stream backpressure causing OOM on large filesbackend83.7
Build MCP server for database managementbackend57.5
Build production website with auth and members areafrontend46.4
Write complex SQL report with window functionsbackend48.8
Build CLI tool with subcommands and configfrom-scratch44.5
Fix deadlocking transaction patterns in Flask appbackend45.3
Build LLM evaluation harness with structured gradingbackend44.0
Add slash commands and moderation to Discord botbackend57.0
Fix data integrity bugs in denormalized e-commerce schemadebugging45.0
Debug and fix 6 broken database triggers and constraintsdebugging41.8
Fix 12 WCAG accessibility violations in checkout formfrontend79.3
Add retry logic and dead letter queue to Python task queuebackend40.0
Build SaaS admin dashboard from scratchfrom-scratch47.5
Fix race conditions in order matching enginebackend82.2
Fix auth bypass vulnerabilitydebugging82.0
Add rate limiting middlewarebackend56.9
Refactor monolithic handler to CQRSrefactoring64.7
Write integration tests for payment flowcode-review63.5
Add cursor-based pagination to REST APIbackend59.5
Implement Stripe webhook handlerbackend78.0
Fix flaky test suitedebugging83.3
Build terminal UI dashboardfrom-scratch38.4
Code review: identify security vulnscode-review63.7
Fix memory leak in event handlerdebugging61.3
Debug race condition in worker pooldebugging80.4
Fix React hydration mismatchfrontend69.7
Build REST API from scratchfrom-scratch77.2