APEX
Back to models

Minimax M3

OpenRouter

1049K context$0.30/M input$1.20/M output
1741peak 1756

Avg Score

83.9

Avg Cost

$0.15

Score/$

577.7

Runs

70

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languageexpert
2921
refactoringexpert
2805
code-reviewhard
2385
from-scratchmedium
2359
frontendexpert
2268
multi-languagehard
2213
frontendeasy
2191
backendeasy
2119
multi-language
2013
refactoring
1964
refactoringmedium
1931
from-scratchhard
1902
debuggingexpert
1890
backendexpert
1876
from-scratcheasy
1856
code-review
1848
frontendmedium
1845
code-reviewmedium
1824
full-stackhard
1777
from-scratch
1767
frontend
1744
backend
1737
full-stack
1735
from-scratchexpert
1735
backendhard
1722
full-stackmedium
1698
backendmaster
1697
backendmedium
1678
debugging
1642
debugginghard
1592
debuggingmedium
1561
frontendmaster
1542
frontendhard
1498

All Results

TaskCategoryScore
Build 3D browser game with physics and multiplayer syncfrontend70.7
Add streaming SSE endpoint for LLM chatbackend90.4
Fix memory leak in event handlerdebugging72.7
Port Python CLI to Rustmulti-language88.6
Implement zero-trust API authentication layerbackend83.4
Build RAG pipeline with vector searchbackend86.0
Fix N+1 query in dashboardbackend75.5
Migrate Express monolith to modular architecturebackend87.3
Add i18n with locale routing to Next.js appfull-stack80.7
Fix and extend Chrome browser extensionfrontend68.0
Build interactive data visualization dashboardfrontend79.5
Add WebSocket real-time updatesfull-stack86.3
Implement multi-tenant row-level security in Postgresbackend87.3
Fix broken responsive layoutfrontend83.6
Build LLM evaluation harness with structured gradingbackend75.1
Build codebase indexer for LLM context windowsfrom-scratch73.2
Optimize slow Postgres queries in Flask appbackend74.9
Fix flaky test suitedebugging72.8
Fix 12 WCAG accessibility violations in checkout formfrontend78.5
Split 1100-line god file into proper modulesrefactoring86.5
Write integration tests for payment flowcode-review88.6
Build production website with auth and members areafrontend78.4
Build terminal UI dashboardfrom-scratch79.0
Add virtual scrolling to table rendering 5000 rowsfrontend87.3
Build SaaS admin dashboard from scratchfrom-scratch72.8
Refactor monolithic handler to CQRSrefactoring91.5
Convert React app to PWA with offline supportfrontend81.6
Optimize bloated React bundle under 500KBfrontend84.7
Write tests for untested legacy Flask servicecode-review89.9
Harden insecure Docker setup with 12 vulnerabilitiescode-review92.3
Fix Node.js stream backpressure causing OOM on large filesbackend82.1
Fix hallucination and context window bugs in RAG agentbackend74.8
Add Redis caching layer to Express APIbackend90.3
Implement transformer inference engine with KV cachefrom-scratch81.0
Find and fix 4 hidden backdoors in Flask appdebugging88.7
Write complex SQL report with window functionsbackend90.9
Add retry logic and dead letter queue to Python task queuebackend80.8
Remove AI slop and over-engineering from codebaserefactoring87.7
Write Kubernetes manifests for Node.js microservicefull-stack90.8
Fix deadlocking transaction patterns in Flask appbackend88.3
Build distributed node cluster with gossip protocolfrom-scratch83.3
Add cursor-based pagination to REST APIbackend78.3
Fix data integrity bugs in denormalized e-commerce schemadebugging91.5
Implement background job scheduler with persistencebackend83.5
Debug and fix 6 broken database triggers and constraintsdebugging90.3
Fix auth bypass vulnerabilitydebugging91.0
Build CLI tool with subcommands and configfrom-scratch83.0
Build MCP server for database managementbackend87.3
Find and patch all OWASP Top 10 vulnerabilitiesdebugging89.1
Fix broken GitHub Actions CI pipelinedebugging94.4
Implement JWT auth middlewarebackend90.0
Add rate limiting middlewarebackend84.4
Zero-downtime schema migrationfull-stack84.4
Build materialized view refresh pipeline for analyticsbackend76.8
Add file upload with S3 presigned URLsbackend81.3
Add Google OAuth2 login to Express appfull-stack81.6
Add caching layer to eliminate slow SSR page loadsfull-stack82.8
Build real-time portfolio risk calculatorbackend86.2
Dockerize Node.js monorepofull-stack80.5
Add slash commands and moderation to Discord botbackend78.9
Fix React hydration mismatchfrontend89.1
Fix race conditions in order matching enginebackend90.3
Replace console.log with structured loggingrefactoring92.0
Migrate callback-hell Express app to async/awaitrefactoring90.8
Build multi-tool LLM agent runtimebackend84.9
Implement Stripe webhook handlerbackend85.7
Add GraphQL layer over REST APImulti-language87.0
Code review: identify security vulnscode-review79.3
Build REST API from scratchfrom-scratch84.1
Debug race condition in worker pooldebugging89.7