APEX
Back to models

Minimax M2.5 [Q4_K_XL]

LM Studio

197K context<$0.01/M input<$0.01/M output
1395peak 1416

Avg Score

63.6

Avg Cost

$0.03

Score/$

2118.8

Runs

39

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

frontendexpert
2182
multi-languagehard
2110
from-scratcheasy
1972
multi-language
1915
code-reviewmedium
1723
frontendhard
1688
frontend
1662
frontendmedium
1628
backendeasy
1594
debugginghard
1475
code-review
1473
backendhard
1453
from-scratch
1405
backendmedium
1363
backend
1357
debugging
1315
from-scratchmedium
1292
full-stack
1265
full-stackhard
1261
from-scratchhard
1227
refactoring
1200
backendexpert
1119
debuggingexpert
1066
from-scratchexpert
1024
refactoringexpert
872
code-reviewhard
307
debuggingmedium
0

All Results

TaskCategoryScore
Implement background job scheduler with persistencebackend37.0
Build MCP server for database managementbackend47.0
Implement transformer inference engine with KV cachefrom-scratch67.7
Build CLI tool with subcommands and configfrom-scratch39.5
Build production website with auth and members areafrontend72.5
Build SaaS admin dashboard from scratchfrom-scratch63.2
Fix data integrity bugs in denormalized e-commerce schemadebugging52.9
Build terminal UI dashboardfrom-scratch55.6
Build real-time portfolio risk calculatorbackend21.8
Write tests for untested legacy Flask servicecode-review65.9
Add slash commands and moderation to Discord botbackend57.1
Fix deadlocking transaction patterns in Flask appbackend68.5
Implement Stripe webhook handlerbackend61.2
Build REST API from scratchfrom-scratch90.1
Fix N+1 query in dashboardbackend64.5
Fix 12 WCAG accessibility violations in checkout formfrontend82.9
Add retry logic and dead letter queue to Python task queuebackend63.5
Fix auth bypass vulnerabilitydebugging93.7
Refactor monolithic handler to CQRSrefactoring51.8
Fix hallucination and context window bugs in RAG agentbackend67.7
Fix race conditions in order matching enginebackend66.5
Debug and fix 6 broken database triggers and constraintsdebugging58.4
Add Redis caching layer to Express APIbackend75.8
Fix flaky test suitedebugging44.8
Optimize slow Postgres queries in Flask appbackend81.7
Fix Node.js stream backpressure causing OOM on large filesbackend81.8
Fix React hydration mismatchfrontend77.7
Build distributed node cluster with gossip protocolfrom-scratch23.7
Find and fix 4 hidden backdoors in Flask appdebugging92.5
Debug race condition in worker pooldebugging68.7
Write integration tests for payment flowcode-review39.5
Add virtual scrolling to table rendering 5000 rowsfrontend79.0
Build LLM evaluation harness with structured gradingbackend41.6
Fix memory leak in event handlerdebugging61.1
Write complex SQL report with window functionsbackend78.5
Add rate limiting middlewarebackend75.4
Add cursor-based pagination to REST APIbackend67.6
Zero-downtime schema migrationfull-stack62.4
Add GraphQL layer over REST APImulti-language80.5