APEX
Back to models

Minimax M2.5

OpenRouter

205K context$0.30/M input$1.20/M output
1358peak 1373

Avg Score

64.7

Avg Cost

$0.09

Score/$

712.3

Runs

65

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

from-scratchexpert
1977
frontendhard
1677
debugginghard
1595
code-reviewhard
1518
frontendexpert
1465
frontend
1448
backendexpert
1427
frontendmedium
1426
full-stack
1423
full-stackhard
1417
full-stackmedium
1414
debugging
1405
backendmedium
1393
frontendeasy
1339
code-review
1335
refactoringmedium
1325
backend
1322
from-scratch
1307
refactoring
1305
code-reviewmedium
1285
debuggingexpert
1278
from-scratcheasy
1249
from-scratchhard
1215
multi-language
1197
backendhard
1163
debuggingmedium
1146
multi-languagehard
955
multi-languageexpert
743
backendeasy
582
refactoringexpert
387
from-scratchmedium
0

All Results

TaskCategoryScore
Port Python CLI to Rustmulti-language39.6
Build SaaS admin dashboard from scratchfrom-scratch53.0
Implement zero-trust API authentication layerbackend81.7
Build real-time portfolio risk calculatorbackend79.0
Build codebase indexer for LLM context windowsfrom-scratch70.1
Code review: identify security vulnscode-review81.3
Split 1100-line god file into proper modulesrefactoring71.5
Find and patch all OWASP Top 10 vulnerabilitiesdebugging65.8
Fix broken responsive layoutfrontend71.1
Write complex SQL report with window functionsbackend60.5
Optimize bloated React bundle under 500KBfrontend73.2
Implement background job scheduler with persistencebackend31.9
Build materialized view refresh pipeline for analyticsbackend72.6
Optimize slow Postgres queries in Flask appbackend69.7
Fix 12 WCAG accessibility violations in checkout formfrontend81.6
Implement multi-tenant row-level security in Postgresbackend44.8
Fix race conditions in order matching enginebackend59.6
Debug race condition in worker pooldebugging85.1
Build MCP server for database managementbackend53.5
Fix Node.js stream backpressure causing OOM on large filesbackend32.8
Add cursor-based pagination to REST APIbackend72.0
Fix auth bypass vulnerabilitydebugging89.7
Write integration tests for payment flowcode-review73.3
Fix flaky test suitedebugging64.5
Add slash commands and moderation to Discord botbackend75.9
Add WebSocket real-time updatesfull-stack73.3
Replace console.log with structured loggingrefactoring54.6
Fix hallucination and context window bugs in RAG agentbackend45.6
Add GraphQL layer over REST APImulti-language63.8
Build REST API from scratchfrom-scratch74.6
Add Redis caching layer to Express APIbackend79.8
Remove AI slop and over-engineering from codebaserefactoring75.1
Add Google OAuth2 login to Express appfull-stack81.2
Convert React app to PWA with offline supportfrontend80.2
Fix data integrity bugs in denormalized e-commerce schemadebugging74.5
Build terminal UI dashboardfrom-scratch31.6
Fix N+1 query in dashboardbackend45.0
Implement Stripe webhook handlerbackend53.3
Fix broken GitHub Actions CI pipelinedebugging75.7
Write tests for untested legacy Flask servicecode-review38.1
Add virtual scrolling to table rendering 5000 rowsfrontend40.3
Fix React hydration mismatchfrontend78.0
Build CLI tool with subcommands and configfrom-scratch29.4
Find and fix 4 hidden backdoors in Flask appdebugging90.3
Dockerize Node.js monorepofull-stack71.5
Implement JWT auth middlewarebackend53.9
Implement transformer inference engine with KV cachefrom-scratch85.4
Build RAG pipeline with vector searchbackend42.0
Build LLM evaluation harness with structured gradingbackend43.5
Add i18n with locale routing to Next.js appfull-stack68.3
Write Kubernetes manifests for Node.js microservicefull-stack82.2
Fix memory leak in event handlerdebugging81.0
Add streaming SSE endpoint for LLM chatbackend77.3
Fix deadlocking transaction patterns in Flask appbackend78.0
Add rate limiting middlewarebackend43.0
Debug and fix 6 broken database triggers and constraintsdebugging72.2
Harden insecure Docker setup with 12 vulnerabilitiescode-review68.0
Build production website with auth and members areafrontend67.3
Refactor monolithic handler to CQRSrefactoring40.6
Add file upload with S3 presigned URLsbackend77.3
Zero-downtime schema migrationfull-stack63.0
Add caching layer to eliminate slow SSR page loadsfull-stack80.1
Add retry logic and dead letter queue to Python task queuebackend74.8
Migrate callback-hell Express app to async/awaitrefactoring62.8
Build distributed node cluster with gossip protocolfrom-scratch36.6