APEX
Back to models

GLM 4.7 [Q4_K_XL]

LM Studio

200K context$0.60/M input$2.20/M output
1451peak 1467

Avg Score

71.2

Avg Cost

$0.04

Score/$

2021.2

Runs

41

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

from-scratchexpert
2103
from-scratcheasy
2051
frontendexpert
1888
backendeasy
1857
from-scratchmedium
1815
from-scratch
1659
multi-languagehard
1649
refactoringexpert
1607
from-scratchhard
1597
frontendhard
1586
frontend
1501
debugginghard
1486
refactoring
1481
multi-language
1460
backendmedium
1447
backend
1442
frontendmedium
1434
backendhard
1424
full-stack
1422
full-stackhard
1421
backendexpert
1391
debugging
1352
code-reviewmedium
1254
debuggingmedium
1253
code-review
1225
debuggingexpert
1078
code-reviewhard
581

All Results

TaskCategoryScore
Build production website with auth and members areafrontend72.5
Fix data integrity bugs in denormalized e-commerce schemadebugging63.4
Build LLM evaluation harness with structured gradingbackend74.4
Build MCP server for database managementbackend57.4
Implement background job scheduler with persistencebackend73.1
Fix hallucination and context window bugs in RAG agentbackend64.4
Build SaaS admin dashboard from scratchfrom-scratch73.6
Implement transformer inference engine with KV cachefrom-scratch87.5
Build real-time portfolio risk calculatorbackend58.9
Build CLI tool with subcommands and configfrom-scratch57.2
Fix race conditions in order matching enginebackend80.1
Fix deadlocking transaction patterns in Flask appbackend62.4
Debug and fix 6 broken database triggers and constraintsdebugging68.1
Write complex SQL report with window functionsbackend71.5
Find and fix 4 hidden backdoors in Flask appdebugging88.5
Add Redis caching layer to Express APIbackend50.9
Write tests for untested legacy Flask servicecode-review54.5
Add Google OAuth2 login to Express appfull-stack79.8
Optimize slow Postgres queries in Flask appbackend81.2
Add slash commands and moderation to Discord botbackend68.7
Add retry logic and dead letter queue to Python task queuebackend51.2
Fix Node.js stream backpressure causing OOM on large filesbackend89.8
Add virtual scrolling to table rendering 5000 rowsfrontend82.2
Fix 12 WCAG accessibility violations in checkout formfrontend80.0
Build distributed node cluster with gossip protocolfrom-scratch74.5
Fix auth bypass vulnerabilitydebugging88.5
Add GraphQL layer over REST APImulti-language77.5
Write integration tests for payment flowcode-review58.5
Zero-downtime schema migrationfull-stack61.8
Add rate limiting middlewarebackend79.0
Implement Stripe webhook handlerbackend70.5
Fix flaky test suitedebugging73.9
Add cursor-based pagination to REST APIbackend82.9
Fix N+1 query in dashboardbackend75.8
Fix memory leak in event handlerdebugging61.1
Code review: identify security vulnscode-review49.0
Refactor monolithic handler to CQRSrefactoring68.8
Debug race condition in worker pooldebugging85.0
Fix React hydration mismatchfrontend67.5
Build terminal UI dashboardfrom-scratch68.6
Build REST API from scratchfrom-scratch87.0