APEX
Back to models

Minimax M2.7 [NVFP4]

SGLang

197K context<$0.01/M input<$0.01/M output
1621peak 1638

Avg Score

79.4

Avg Cost

Score/$

Runs

46

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languageexpert
2353
refactoringexpert
2296
from-scratcheasy
2126
backendeasy
1952
multi-language
1835
from-scratchhard
1834
code-reviewmedium
1751
refactoring
1734
full-stackhard
1705
refactoringmedium
1705
full-stack
1671
debuggingexpert
1657
backendhard
1651
backendexpert
1650
debugginghard
1648
debugging
1632
code-review
1610
backend
1609
frontendmedium
1608
from-scratch
1601
from-scratchmedium
1592
full-stackmedium
1590
backendmedium
1552
frontend
1549
frontendhard
1446
backendmaster
1424
code-reviewhard
1363
frontendmaster
1148
from-scratchexpert
1003

All Results

TaskCategoryScore
Migrate Express monolith to modular architecturebackend70.3
Implement JWT auth middlewarebackend86.7
Build REST API from scratchfrom-scratch88.7
Write integration tests for payment flowcode-review70.3
Build CLI tool with subcommands and configfrom-scratch74.3
Build codebase indexer for LLM context windowsfrom-scratch77.7
Fix and extend Chrome browser extensionfrontend52.0
Add Redis caching layer to Express APIbackend83.4
Optimize slow Postgres queries in Flask appbackend84.2
Implement multi-tenant row-level security in Postgresbackend78.8
Build materialized view refresh pipeline for analyticsbackend77.3
Fix N+1 query in dashboardbackend74.7
Fix React hydration mismatchfrontend85.8
Split 1100-line god file into proper modulesrefactoring84.0
Remove AI slop and over-engineering from codebaserefactoring81.9
Fix auth bypass vulnerabilitydebugging91.0
Harden insecure Docker setup with 12 vulnerabilitiescode-review86.5
Write Kubernetes manifests for Node.js microservicefull-stack86.3
Build multi-tool LLM agent runtimebackend79.9
Fix hallucination and context window bugs in RAG agentbackend80.8
Fix 12 WCAG accessibility violations in checkout formfrontend77.7
Refactor monolithic handler to CQRSrefactoring82.1
Write tests for untested legacy Flask servicecode-review84.3
Fix race conditions in order matching enginebackend85.8
Build terminal UI dashboardfrom-scratch64.9
Build MCP server for database managementbackend81.5
Debug and fix 6 broken database triggers and constraintsdebugging86.8
Migrate callback-hell Express app to async/awaitrefactoring85.1
Implement background job scheduler with persistencebackend68.3
Convert React app to PWA with offline supportfrontend74.3
Implement zero-trust API authentication layerbackend73.5
Implement transformer inference engine with KV cachefrom-scratch70.9
Build RAG pipeline with vector searchbackend76.5
Write complex SQL report with window functionsbackend82.4
Fix Node.js stream backpressure causing OOM on large filesbackend70.8
Zero-downtime schema migrationfull-stack81.5
Find and fix 4 hidden backdoors in Flask appdebugging91.3
Fix data integrity bugs in denormalized e-commerce schemadebugging85.2
Add rate limiting middlewarebackend80.8
Replace console.log with structured loggingrefactoring77.2
Add Google OAuth2 login to Express appfull-stack77.7
Optimize bloated React bundle under 500KBfrontend75.5
Add virtual scrolling to table rendering 5000 rowsfrontend81.8
Port Python CLI to Rustmulti-language77.5
Add cursor-based pagination to REST APIbackend77.0
Add WebSocket real-time updatesfull-stack86.8