APEX
Back to models

GLM 5.2

Z.ai Coding Plan

1000K context$1.40/M input$4.40/M output
1795peak 1892

Avg Score

85.3

Avg Cost

$0.39

Score/$

217.1

Runs

70

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languageexpert
2921
from-scratchmedium
2769
refactoringexpert
2569
frontendexpert
2459
from-scratcheasy
2388
multi-languagehard
2349
code-reviewhard
2307
backendeasy
2170
from-scratchhard
2123
multi-language
2095
from-scratch
1979
code-reviewmedium
1973
backendexpert
1969
code-review
1943
frontendhard
1942
from-scratchexpert
1922
refactoring
1906
refactoringmedium
1889
full-stackhard
1886
debuggingexpert
1867
full-stack
1860
full-stackmedium
1854
backendhard
1817
debuggingmedium
1776
backend
1771
frontendeasy
1771
frontend
1732
debugging
1719
frontendmedium
1712
backendmedium
1665
debugginghard
1656
frontendmaster
1604
backendmaster
1572

All Results

TaskCategoryScore
Fix hallucination and context window bugs in RAG agentbackend87.9
Implement background job scheduler with persistencebackend81.5
Find and patch all OWASP Top 10 vulnerabilitiesdebugging88.1
Fix N+1 query in dashboardbackend77.7
Build CLI tool with subcommands and configfrom-scratch88.4
Build materialized view refresh pipeline for analyticsbackend85.5
Write Kubernetes manifests for Node.js microservicefull-stack90.3
Fix data integrity bugs in denormalized e-commerce schemadebugging90.5
Build 3D browser game with physics and multiplayer syncfrontend71.0
Fix memory leak in event handlerdebugging78.1
Build interactive data visualization dashboardfrontend79.5
Implement multi-tenant row-level security in Postgresbackend87.0
Build MCP server for database managementbackend83.5
Add virtual scrolling to table rendering 5000 rowsfrontend85.6
Write complex SQL report with window functionsbackend78.5
Optimize bloated React bundle under 500KBfrontend73.0
Fix broken responsive layoutfrontend79.6
Implement Stripe webhook handlerbackend87.9
Fix Node.js stream backpressure causing OOM on large filesbackend91.9
Build SaaS admin dashboard from scratchfrom-scratch87.9
Port Python CLI to Rustmulti-language89.0
Fix auth bypass vulnerabilitydebugging90.4
Add i18n with locale routing to Next.js appfull-stack81.0
Zero-downtime schema migrationfull-stack88.5
Build real-time portfolio risk calculatorbackend86.5
Optimize slow Postgres queries in Flask appbackend84.3
Build multi-tool LLM agent runtimebackend73.8
Remove AI slop and over-engineering from codebaserefactoring88.3
Add caching layer to eliminate slow SSR page loadsfull-stack87.0
Add slash commands and moderation to Discord botbackend87.3
Implement transformer inference engine with KV cachefrom-scratch83.4
Build codebase indexer for LLM context windowsfrom-scratch75.3
Fix flaky test suitedebugging88.7
Implement zero-trust API authentication layerbackend86.5
Fix and extend Chrome browser extensionfrontend75.0
Build RAG pipeline with vector searchbackend87.0
Debug race condition in worker pooldebugging93.0
Fix deadlocking transaction patterns in Flask appbackend90.4
Fix 12 WCAG accessibility violations in checkout formfrontend86.5
Convert React app to PWA with offline supportfrontend85.7
Split 1100-line god file into proper modulesrefactoring86.8
Add cursor-based pagination to REST APIbackend86.9
Add Google OAuth2 login to Express appfull-stack85.7
Migrate callback-hell Express app to async/awaitrefactoring83.9
Build production website with auth and members areafrontend83.5
Build distributed node cluster with gossip protocolfrom-scratch87.3
Fix race conditions in order matching enginebackend92.3
Write integration tests for payment flowcode-review87.3
Debug and fix 6 broken database triggers and constraintsdebugging90.7
Add file upload with S3 presigned URLsbackend87.5
Build REST API from scratchfrom-scratch94.8
Fix broken GitHub Actions CI pipelinedebugging91.8
Implement JWT auth middlewarebackend68.2
Dockerize Node.js monorepofull-stack86.2
Migrate Express monolith to modular architecturebackend94.6
Add Redis caching layer to Express APIbackend86.6
Add rate limiting middlewarebackend86.3
Add GraphQL layer over REST APImulti-language90.0
Build LLM evaluation harness with structured gradingbackend81.8
Fix React hydration mismatchfrontend84.5
Refactor monolithic handler to CQRSrefactoring86.5
Code review: identify security vulnscode-review92.5
Write tests for untested legacy Flask servicecode-review87.5
Add retry logic and dead letter queue to Python task queuebackend84.9
Add streaming SSE endpoint for LLM chatbackend59.9
Replace console.log with structured loggingrefactoring89.6
Build terminal UI dashboardfrom-scratch85.8
Harden insecure Docker setup with 12 vulnerabilitiescode-review92.2
Find and fix 4 hidden backdoors in Flask appdebugging90.3
Add WebSocket real-time updatesfull-stack87.8