APEX
Back to models

Minimax M2.5 [Q4_K_XL]

LM Studio

197K context<$0.01/M input<$0.01/M output
1346peak 1362

Avg Score

63.9

Avg Cost

$0.03

Score/$

2128.9

Runs

39

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

from-scratcheasy
2188
frontendexpert
2103
multi-languagehard
1844
frontendhard
1757
frontend
1608
multi-language
1585
code-reviewmedium
1559
frontendmedium
1541
backendeasy
1539
debugginghard
1463
backendhard
1343
from-scratch
1330
backendmedium
1313
code-review
1308
backend
1297
debugging
1287
refactoring
1247
full-stack
1171
full-stackhard
1160
from-scratchhard
1158
backendexpert
1088
from-scratchmedium
1039
debuggingexpert
941
from-scratchexpert
936
refactoringexpert
856
debuggingmedium
746
code-reviewhard
22

All Results

TaskCategoryScore
Implement background job scheduler with persistencebackend37.0
Build MCP server for database managementbackend47.0
Implement transformer inference engine with KV cachefrom-scratch67.7
Build CLI tool with subcommands and configfrom-scratch41.5
Build production website with auth and members areafrontend75.5
Build SaaS admin dashboard from scratchfrom-scratch63.2
Fix data integrity bugs in denormalized e-commerce schemadebugging52.9
Build terminal UI dashboardfrom-scratch55.6
Build real-time portfolio risk calculatorbackend21.8
Write tests for untested legacy Flask servicecode-review65.9
Add slash commands and moderation to Discord botbackend57.1
Fix deadlocking transaction patterns in Flask appbackend68.5
Implement Stripe webhook handlerbackend61.2
Build REST API from scratchfrom-scratch90.1
Fix N+1 query in dashboardbackend64.5
Fix 12 WCAG accessibility violations in checkout formfrontend82.9
Add retry logic and dead letter queue to Python task queuebackend63.5
Fix auth bypass vulnerabilitydebugging93.7
Refactor monolithic handler to CQRSrefactoring51.8
Fix hallucination and context window bugs in RAG agentbackend67.7
Fix race conditions in order matching enginebackend66.5
Debug and fix 6 broken database triggers and constraintsdebugging58.4
Add Redis caching layer to Express APIbackend75.8
Fix flaky test suitedebugging44.8
Optimize slow Postgres queries in Flask appbackend81.7
Fix Node.js stream backpressure causing OOM on large filesbackend81.8
Fix React hydration mismatchfrontend77.7
Build distributed node cluster with gossip protocolfrom-scratch30.5
Find and fix 4 hidden backdoors in Flask appdebugging92.5
Debug race condition in worker pooldebugging68.7
Write integration tests for payment flowcode-review39.5
Add virtual scrolling to table rendering 5000 rowsfrontend79.0
Build LLM evaluation harness with structured gradingbackend41.6
Fix memory leak in event handlerdebugging61.1
Write complex SQL report with window functionsbackend78.5
Add rate limiting middlewarebackend75.4
Add cursor-based pagination to REST APIbackend67.6
Zero-downtime schema migrationfull-stack62.4
Add GraphQL layer over REST APImulti-language80.5