APEX
Back to models

Minimax M2.5

OpenRouter

205K context$0.30/M input$1.20/M output
1453peak 1471

Avg Score

63.4

Avg Cost

$0.14

Score/$

463.1

Runs

123

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languagehard
2110
from-scratchexpert
2072
frontendexpert
1878
from-scratcheasy
1671
frontendhard
1641
frontendeasy
1631
refactoringexpert
1621
full-stackmedium
1602
from-scratchmedium
1593
full-stack
1555
debugginghard
1553
full-stackhard
1548
frontend
1533
frontendmedium
1513
debuggingmedium
1506
from-scratch
1483
debugging
1474
backendhard
1452
refactoring
1448
debuggingexpert
1444
backendmedium
1432
backend
1414
refactoringmedium
1411
code-review
1379
code-reviewmedium
1377
multi-language
1372
code-reviewhard
1367
backendexpert
1329
from-scratchhard
1325
backendeasy
1278
multi-languageexpert
518

All Results

TaskCategoryScore
Split 1100-line god file into proper modulesrefactoring71.5
Build SaaS admin dashboard from scratchfrom-scratch36.5
Build codebase indexer for LLM context windowsfrom-scratch46.9
Find and patch all OWASP Top 10 vulnerabilitiesdebugging65.8
Fix broken responsive layoutfrontend71.1
Write complex SQL report with window functionsbackend60.5
Optimize bloated React bundle under 500KBfrontend73.2
Implement background job scheduler with persistencebackend31.9
Build materialized view refresh pipeline for analyticsbackend74.5
Optimize slow Postgres queries in Flask appbackend69.7
Fix 12 WCAG accessibility violations in checkout formfrontend81.6
Implement multi-tenant row-level security in Postgresbackend44.8
Fix race conditions in order matching enginebackend59.6
Debug race condition in worker pooldebugging85.1
Build MCP server for database managementbackend53.5
Fix Node.js stream backpressure causing OOM on large filesbackend32.8
Build real-time portfolio risk calculatorbackend17.6
Add cursor-based pagination to REST APIbackend72.0
Fix auth bypass vulnerabilitydebugging89.7
Write integration tests for payment flowcode-review73.3
Fix flaky test suitedebugging64.5
Add slash commands and moderation to Discord botbackend75.9
Add WebSocket real-time updatesfull-stack73.3
Replace console.log with structured loggingrefactoring54.6
Fix hallucination and context window bugs in RAG agentbackend45.6
Add GraphQL layer over REST APImulti-language63.8
Build REST API from scratchfrom-scratch74.6
Implement zero-trust API authentication layerbackend38.5
Add Redis caching layer to Express APIbackend79.8
Remove AI slop and over-engineering from codebaserefactoring75.1
Add Google OAuth2 login to Express appfull-stack81.2
Convert React app to PWA with offline supportfrontend80.2
Fix data integrity bugs in denormalized e-commerce schemadebugging74.5
Build terminal UI dashboardfrom-scratch26.4
Fix N+1 query in dashboardbackend45.0
Implement Stripe webhook handlerbackend53.3
Fix broken GitHub Actions CI pipelinedebugging75.7
Write tests for untested legacy Flask servicecode-review38.1
Add virtual scrolling to table rendering 5000 rowsfrontend40.3
Fix React hydration mismatchfrontend78.0
Build CLI tool with subcommands and configfrom-scratch23.9
Find and fix 4 hidden backdoors in Flask appdebugging90.3
Dockerize Node.js monorepofull-stack71.5
Implement JWT auth middlewarebackend53.9
Implement transformer inference engine with KV cachefrom-scratch85.4
Build RAG pipeline with vector searchbackend42.0
Build LLM evaluation harness with structured gradingbackend43.5
Add i18n with locale routing to Next.js appfull-stack68.3
Write Kubernetes manifests for Node.js microservicefull-stack82.2
Fix memory leak in event handlerdebugging81.0
Add streaming SSE endpoint for LLM chatbackend77.3
Fix deadlocking transaction patterns in Flask appbackend78.0
Add rate limiting middlewarebackend43.0
Debug and fix 6 broken database triggers and constraintsdebugging72.2
Harden insecure Docker setup with 12 vulnerabilitiescode-review68.0
Build production website with auth and members areafrontend67.3
Refactor monolithic handler to CQRSrefactoring40.6
Add file upload with S3 presigned URLsbackend77.3
Port Python CLI to Rustmulti-language20.1
Zero-downtime schema migrationfull-stack63.0
Add caching layer to eliminate slow SSR page loadsfull-stack80.1
Add retry logic and dead letter queue to Python task queuebackend74.8
Migrate callback-hell Express app to async/awaitrefactoring62.8
Build distributed node cluster with gossip protocolfrom-scratch29.9
Code review: identify security vulnscode-review11.7
Build materialized view refresh pipeline for analyticsbackend72.6
Fix broken responsive layoutfrontend59.3
Harden insecure Docker setup with 12 vulnerabilitiescode-review79.0
Add caching layer to eliminate slow SSR page loadsfull-stack81.9
Replace console.log with structured loggingrefactoring40.0
Implement multi-tenant row-level security in Postgresbackend70.8
Implement JWT auth middlewarebackend70.3
Add i18n with locale routing to Next.js appfull-stack69.0
Find and patch all OWASP Top 10 vulnerabilitiesdebugging74.0
Build codebase indexer for LLM context windowsfrom-scratch33.0
Add file upload with S3 presigned URLsbackend66.0
Split 1100-line god file into proper modulesrefactoring71.8
Write Kubernetes manifests for Node.js microservicefull-stack86.1
Dockerize Node.js monorepofull-stack77.3
Remove AI slop and over-engineering from codebaserefactoring78.0
Optimize bloated React bundle under 500KBfrontend75.3
Convert React app to PWA with offline supportfrontend64.9
Implement zero-trust API authentication layerbackend67.7
Build CLI tool with subcommands and configfrom-scratch36.5
Build production website with auth and members areafrontend67.2
Build MCP server for database managementbackend50.1
Implement transformer inference engine with KV cachefrom-scratch87.7
Build SaaS admin dashboard from scratchfrom-scratch66.3
Implement background job scheduler with persistencebackend62.6
Build real-time portfolio risk calculatorbackend46.4
Fix hallucination and context window bugs in RAG agentbackend69.8
Build LLM evaluation harness with structured gradingbackend48.5
Fix race conditions in order matching enginebackend70.0
Fix data integrity bugs in denormalized e-commerce schemadebugging62.9
Write complex SQL report with window functionsbackend48.8
Fix deadlocking transaction patterns in Flask appbackend58.8
Debug and fix 6 broken database triggers and constraintsdebugging57.0
Find and fix 4 hidden backdoors in Flask appdebugging55.8
Write tests for untested legacy Flask servicecode-review36.6
Add Google OAuth2 login to Express appfull-stack69.2
Optimize slow Postgres queries in Flask appbackend66.8
Add slash commands and moderation to Discord botbackend67.2
Add retry logic and dead letter queue to Python task queuebackend57.6
Fix Node.js stream backpressure causing OOM on large filesbackend71.2
Fix 12 WCAG accessibility violations in checkout formfrontend80.8
Add virtual scrolling to table rendering 5000 rowsfrontend65.8
Build distributed node cluster with gossip protocolfrom-scratch31.9
Write integration tests for payment flowcode-review55.8
Add GraphQL layer over REST APImulti-language81.3
Fix auth bypass vulnerabilitydebugging92.6
Add rate limiting middlewarebackend69.8
Zero-downtime schema migrationfull-stack69.0
Implement Stripe webhook handlerbackend67.5
Fix flaky test suitedebugging80.9
Add cursor-based pagination to REST APIbackend63.6
Fix N+1 query in dashboardbackend78.2
Build terminal UI dashboardfrom-scratch61.5
Fix memory leak in event handlerdebugging62.8
Code review: identify security vulnscode-review78.3
Refactor monolithic handler to CQRSrefactoring64.7
Debug race condition in worker pooldebugging80.8
Fix React hydration mismatchfrontend74.8
Build REST API from scratchfrom-scratch84.6