APEX
Back to models

GLM 4.5

Z.ai

131K context$0.60/M input$2.20/M output
1448peak 1464

Avg Score

65.2

Avg Cost

$0.10

Score/$

657.0

Runs

109

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languageexpert
2128
from-scratchexpert
1863
frontendeasy
1817
debuggingmedium
1791
backendeasy
1736
backendmedium
1552
refactoringexpert
1521
backendexpert
1499
backend
1498
code-reviewhard
1477
from-scratchhard
1472
debugging
1469
backendhard
1441
frontend
1427
frontendmedium
1422
debugginghard
1419
from-scratch
1402
full-stackhard
1401
full-stack
1397
code-review
1390
full-stackmedium
1390
from-scratcheasy
1386
debuggingexpert
1377
refactoring
1362
code-reviewmedium
1359
refactoringmedium
1322
multi-language
1297
frontendhard
1205
frontendexpert
1003
from-scratchmedium
316
multi-languagehard
0

All Results

TaskCategoryScore
Implement background job scheduler with persistencebackend29.8
Add caching layer to eliminate slow SSR page loadsfull-stack76.8
Write Kubernetes manifests for Node.js microservicefull-stack83.5
Add virtual scrolling to table rendering 5000 rowsfrontend72.9
Build SaaS admin dashboard from scratchfrom-scratch40.7
Add cursor-based pagination to REST APIbackend82.4
Migrate callback-hell Express app to async/awaitrefactoring55.7
Build codebase indexer for LLM context windowsfrom-scratch32.5
Dockerize Node.js monorepofull-stack68.8
Fix memory leak in event handlerdebugging70.9
Build real-time portfolio risk calculatorbackend54.9
Implement transformer inference engine with KV cachefrom-scratch84.0
Write complex SQL report with window functionsbackend77.4
Write tests for untested legacy Flask servicecode-review43.0
Implement JWT auth middlewarebackend47.1
Fix race conditions in order matching enginebackend76.3
Fix hallucination and context window bugs in RAG agentbackend34.7
Convert React app to PWA with offline supportfrontend67.7
Build materialized view refresh pipeline for analyticsbackend73.2
Build LLM evaluation harness with structured gradingbackend55.3
Port Python CLI to Rustmulti-language45.9
Zero-downtime schema migrationfull-stack53.9
Build production website with auth and members areafrontend52.1
Fix 12 WCAG accessibility violations in checkout formfrontend72.0
Implement Stripe webhook handlerbackend72.7
Build distributed node cluster with gossip protocolfrom-scratch36.8
Code review: identify security vulnscode-review78.7
Build REST API from scratchfrom-scratch75.7
Fix deadlocking transaction patterns in Flask appbackend71.0
Fix Node.js stream backpressure causing OOM on large filesbackend84.5
Find and patch all OWASP Top 10 vulnerabilitiesdebugging67.3
Debug race condition in worker pooldebugging85.6
Debug and fix 6 broken database triggers and constraintsdebugging48.0
Add Redis caching layer to Express APIbackend77.4
Add WebSocket real-time updatesfull-stack73.1
Write integration tests for payment flowcode-review76.5
Fix N+1 query in dashboardbackend47.9
Implement zero-trust API authentication layerbackend57.4
Add i18n with locale routing to Next.js appfull-stack72.6
Add streaming SSE endpoint for LLM chatbackend79.2
Fix auth bypass vulnerabilitydebugging91.5
Add rate limiting middlewarebackend39.6
Remove AI slop and over-engineering from codebaserefactoring77.9
Find and fix 4 hidden backdoors in Flask appdebugging80.4
Fix flaky test suitedebugging68.8
Implement multi-tenant row-level security in Postgresbackend59.0
Fix broken responsive layoutfrontend76.8
Fix broken GitHub Actions CI pipelinedebugging90.9
Split 1100-line god file into proper modulesrefactoring58.1
Add Google OAuth2 login to Express appfull-stack30.4
Add GraphQL layer over REST APImulti-language36.8
Build RAG pipeline with vector searchbackend77.7
Port Python CLI to Rustmulti-language38.0
Code review: identify security vulnscode-review70.3
Add file upload with S3 presigned URLsbackend65.3
Implement multi-tenant row-level security in Postgresbackend69.6
Implement zero-trust API authentication layerbackend41.0
Build codebase indexer for LLM context windowsfrom-scratch35.0
Write Kubernetes manifests for Node.js microservicefull-stack74.8
Add caching layer to eliminate slow SSR page loadsfull-stack75.5
Find and patch all OWASP Top 10 vulnerabilitiesdebugging70.0
Add i18n with locale routing to Next.js appfull-stack72.6
Remove AI slop and over-engineering from codebaserefactoring81.0
Harden insecure Docker setup with 12 vulnerabilitiescode-review74.7
Split 1100-line god file into proper modulesrefactoring70.7
Optimize bloated React bundle under 500KBfrontend64.8
Convert React app to PWA with offline supportfrontend66.5
Replace console.log with structured loggingrefactoring47.8
Fix broken responsive layoutfrontend81.0
Dockerize Node.js monorepofull-stack68.9
Write tests for untested legacy Flask servicecode-review49.0
Build CLI tool with subcommands and configfrom-scratch55.0
Build production website with auth and members areafrontend32.9
Implement background job scheduler with persistencebackend62.5
Build SaaS admin dashboard from scratchfrom-scratch69.0
Build MCP server for database managementbackend78.2
Implement transformer inference engine with KV cachefrom-scratch73.3
Build materialized view refresh pipeline for analyticsbackend59.6
Fix hallucination and context window bugs in RAG agentbackend53.8
Build LLM evaluation harness with structured gradingbackend23.5
Fix race conditions in order matching enginebackend86.9
Fix data integrity bugs in denormalized e-commerce schemadebugging78.3
Build real-time portfolio risk calculatorbackend52.1
Fix deadlocking transaction patterns in Flask appbackend58.5
Debug and fix 6 broken database triggers and constraintsdebugging53.8
Write complex SQL report with window functionsbackend72.0
Find and fix 4 hidden backdoors in Flask appdebugging84.0
Add Redis caching layer to Express APIbackend79.7
Optimize slow Postgres queries in Flask appbackend66.0
Add slash commands and moderation to Discord botbackend75.5
Add retry logic and dead letter queue to Python task queuebackend79.7
Add virtual scrolling to table rendering 5000 rowsfrontend62.2
Fix 12 WCAG accessibility violations in checkout formfrontend74.3
Fix Node.js stream backpressure causing OOM on large filesbackend86.3
Build distributed node cluster with gossip protocolfrom-scratch43.3
Fix auth bypass vulnerabilitydebugging92.2
Write integration tests for payment flowcode-review65.9
Implement Stripe webhook handlerbackend63.8
Add rate limiting middlewarebackend78.1
Zero-downtime schema migrationfull-stack62.5
Fix flaky test suitedebugging78.8
Refactor monolithic handler to CQRSrefactoring63.5
Add cursor-based pagination to REST APIbackend64.4
Fix N+1 query in dashboardbackend74.2
Build terminal UI dashboardfrom-scratch39.5
Fix memory leak in event handlerdebugging63.4
Debug race condition in worker pooldebugging80.2
Build REST API from scratchfrom-scratch80.8
Fix React hydration mismatchfrontend78.0