APEX
Back to models

GLM 4.7

Z.ai

200K context$0.60/M input$2.20/M output
1574peak 1588

Avg Score

71.5

Avg Cost

$0.10

Score/$

705.7

Runs

124

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

backendeasy
2534
refactoringexpert
2244
multi-languagehard
2110
from-scratchexpert
2072
code-reviewmedium
1881
from-scratchmedium
1870
frontendexpert
1764
full-stackhard
1737
frontendhard
1722
full-stack
1642
debuggingmedium
1635
frontendmedium
1628
code-review
1620
backendmedium
1613
frontend
1608
debugginghard
1605
refactoring
1600
debuggingexpert
1578
backend
1572
debugging
1566
backendhard
1558
refactoringmedium
1556
from-scratch
1545
multi-language
1530
backendexpert
1521
frontendeasy
1521
from-scratchhard
1490
full-stackmedium
1471
from-scratcheasy
1373
code-reviewhard
1367
multi-languageexpert
1207

All Results

TaskCategoryScore
Port Python CLI to Rustmulti-language34.1
Build real-time portfolio risk calculatorbackend52.1
Replace console.log with structured loggingrefactoring51.0
Implement multi-tenant row-level security in Postgresbackend71.5
Add GraphQL layer over REST APImulti-language79.2
Add rate limiting middlewarebackend83.5
Implement zero-trust API authentication layerbackend76.4
Build distributed node cluster with gossip protocolfrom-scratch44.6
Build LLM evaluation harness with structured gradingbackend36.0
Add Redis caching layer to Express APIbackend82.2
Split 1100-line god file into proper modulesrefactoring66.2
Add WebSocket real-time updatesfull-stack76.1
Add file upload with S3 presigned URLsbackend55.3
Write integration tests for payment flowcode-review73.8
Fix race conditions in order matching enginebackend75.3
Build REST API from scratchfrom-scratch75.8
Fix deadlocking transaction patterns in Flask appbackend72.5
Debug race condition in worker pooldebugging90.4
Zero-downtime schema migrationfull-stack82.8
Add retry logic and dead letter queue to Python task queuebackend79.1
Build SaaS admin dashboard from scratchfrom-scratch44.4
Implement transformer inference engine with KV cachefrom-scratch86.8
Write tests for untested legacy Flask servicecode-review64.1
Remove AI slop and over-engineering from codebaserefactoring77.2
Build terminal UI dashboardfrom-scratch57.9
Write complex SQL report with window functionsbackend60.6
Build materialized view refresh pipeline for analyticsbackend70.3
Fix flaky test suitedebugging88.8
Optimize bloated React bundle under 500KBfrontend74.2
Optimize slow Postgres queries in Flask appbackend86.3
Fix hallucination and context window bugs in RAG agentbackend55.9
Fix broken responsive layoutfrontend74.2
Code review: identify security vulnscode-review25.9
Find and patch all OWASP Top 10 vulnerabilitiesdebugging66.8
Migrate callback-hell Express app to async/awaitrefactoring61.2
Build CLI tool with subcommands and configfrom-scratch48.3
Dockerize Node.js monorepofull-stack73.9
Add virtual scrolling to table rendering 5000 rowsfrontend53.6
Convert React app to PWA with offline supportfrontend80.6
Fix 12 WCAG accessibility violations in checkout formfrontend83.8
Find and fix 4 hidden backdoors in Flask appdebugging74.8
Fix broken GitHub Actions CI pipelinedebugging81.7
Fix React hydration mismatchfrontend74.3
Implement JWT auth middlewarebackend56.5
Add Google OAuth2 login to Express appfull-stack78.8
Build MCP server for database managementbackend76.4
Fix Node.js stream backpressure causing OOM on large filesbackend85.9
Add cursor-based pagination to REST APIbackend78.5
Debug and fix 6 broken database triggers and constraintsdebugging90.7
Add caching layer to eliminate slow SSR page loadsfull-stack81.2
Add i18n with locale routing to Next.js appfull-stack73.2
Fix data integrity bugs in denormalized e-commerce schemadebugging87.5
Build production website with auth and members areafrontend63.6
Fix auth bypass vulnerabilitydebugging89.7
Add streaming SSE endpoint for LLM chatbackend87.5
Refactor monolithic handler to CQRSrefactoring87.6
Implement Stripe webhook handlerbackend82.0
Harden insecure Docker setup with 12 vulnerabilitiescode-review87.3
Build RAG pipeline with vector searchbackend67.1
Fix N+1 query in dashboardbackend66.4
Add slash commands and moderation to Discord botbackend77.6
Build codebase indexer for LLM context windowsfrom-scratch52.7
Write Kubernetes manifests for Node.js microservicefull-stack82.4
Fix memory leak in event handlerdebugging49.7
Implement background job scheduler with persistencebackend62.7
Build materialized view refresh pipeline for analyticsbackend76.5
Code review: identify security vulnscode-review82.6
Add WebSocket real-time updatesfull-stack79.4
Build codebase indexer for LLM context windowsfrom-scratch40.9
Implement multi-tenant row-level security in Postgresbackend72.8
Convert React app to PWA with offline supportfrontend70.7
Add i18n with locale routing to Next.js appfull-stack67.3
Implement zero-trust API authentication layerbackend78.4
Write Kubernetes manifests for Node.js microservicefull-stack76.3
Remove AI slop and over-engineering from codebaserefactoring90.1
Replace console.log with structured loggingrefactoring43.4
Split 1100-line god file into proper modulesrefactoring76.3
Optimize bloated React bundle under 500KBfrontend71.2
Find and patch all OWASP Top 10 vulnerabilitiesdebugging67.8
Add caching layer to eliminate slow SSR page loadsfull-stack76.7
Add file upload with S3 presigned URLsbackend71.0
Fix broken responsive layoutfrontend69.1
Implement JWT auth middlewarebackend72.3
Harden insecure Docker setup with 12 vulnerabilitiescode-review80.3
Dockerize Node.js monorepofull-stack66.5
Build production website with auth and members areafrontend63.2
Build SaaS admin dashboard from scratchfrom-scratch54.1
Implement background job scheduler with persistencebackend66.6
Implement transformer inference engine with KV cachefrom-scratch87.4
Build MCP server for database managementbackend79.0
Build CLI tool with subcommands and configfrom-scratch66.0
Build LLM evaluation harness with structured gradingbackend48.9
Fix hallucination and context window bugs in RAG agentbackend64.4
Fix data integrity bugs in denormalized e-commerce schemadebugging78.5
Build terminal UI dashboardfrom-scratch66.1
Fix Node.js stream backpressure causing OOM on large filesbackend86.3
Add retry logic and dead letter queue to Python task queuebackend81.2
Build REST API from scratchfrom-scratch79.8
Add rate limiting middlewarebackend73.2
Implement Stripe webhook handlerbackend77.5
Fix race conditions in order matching enginebackend35.5
Fix deadlocking transaction patterns in Flask appbackend69.5
Build distributed node cluster with gossip protocolfrom-scratch56.7
Fix auth bypass vulnerabilitydebugging93.7
Fix memory leak in event handlerdebugging64.0
Add slash commands and moderation to Discord botbackend74.1
Zero-downtime schema migrationfull-stack63.1
Add virtual scrolling to table rendering 5000 rowsfrontend82.2
Find and fix 4 hidden backdoors in Flask appdebugging90.4
Add Google OAuth2 login to Express appfull-stack77.5
Fix React hydration mismatchfrontend75.2
Fix N+1 query in dashboardbackend82.5
Debug and fix 6 broken database triggers and constraintsdebugging80.0
Add Redis caching layer to Express APIbackend79.3
Fix flaky test suitedebugging82.0
Write complex SQL report with window functionsbackend71.9
Write integration tests for payment flowcode-review59.0
Refactor monolithic handler to CQRSrefactoring72.3
Debug race condition in worker pooldebugging80.5
Build real-time portfolio risk calculatorbackend65.5
Fix 12 WCAG accessibility violations in checkout formfrontend84.1
Optimize slow Postgres queries in Flask appbackend82.0
Add cursor-based pagination to REST APIbackend74.5
Add GraphQL layer over REST APImulti-language81.3