APEX
Back to models

Kimi K2.5

OpenRouter

262K context$0.45/M input$2.25/M output
1493peak 1509

Avg Score

72.7

Avg Cost

$0.04

Score/$

1791.8

Runs

65

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languagehard
2030
from-scratchexpert
2024
backendeasy
1985
from-scratchmedium
1928
frontendhard
1811
debugginghard
1635
full-stackhard
1632
full-stack
1614
full-stackmedium
1596
debuggingmedium
1595
code-reviewhard
1567
multi-language
1553
debugging
1542
backendmedium
1526
backendexpert
1513
backend
1508
from-scratch
1491
frontend
1483
frontendmedium
1473
backendhard
1461
debuggingexpert
1451
frontendexpert
1440
code-review
1435
code-reviewmedium
1417
from-scratchhard
1376
from-scratcheasy
1307
frontendeasy
1288
refactoring
1204
refactoringmedium
1195
multi-languageexpert
1135
refactoringexpert
475

All Results

TaskCategoryScore
Add Google OAuth2 login to Express appfull-stack82.5
Build LLM evaluation harness with structured gradingbackend73.8
Find and patch all OWASP Top 10 vulnerabilitiesdebugging69.3
Write integration tests for payment flowcode-review75.0
Fix hallucination and context window bugs in RAG agentbackend49.1
Build distributed node cluster with gossip protocolfrom-scratch66.0
Add i18n with locale routing to Next.js appfull-stack74.7
Implement multi-tenant row-level security in Postgresbackend73.7
Port Python CLI to Rustmulti-language47.9
Debug race condition in worker pooldebugging89.5
Build real-time portfolio risk calculatorbackend74.9
Build SaaS admin dashboard from scratchfrom-scratch55.6
Fix race conditions in order matching enginebackend74.5
Add cursor-based pagination to REST APIbackend80.8
Fix broken responsive layoutfrontend70.1
Find and fix 4 hidden backdoors in Flask appdebugging93.8
Convert React app to PWA with offline supportfrontend66.3
Implement transformer inference engine with KV cachefrom-scratch86.0
Fix N+1 query in dashboardbackend53.4
Fix deadlocking transaction patterns in Flask appbackend77.4
Add rate limiting middlewarebackend81.7
Fix memory leak in event handlerdebugging81.8
Add retry logic and dead letter queue to Python task queuebackend82.9
Fix Node.js stream backpressure causing OOM on large filesbackend90.3
Write Kubernetes manifests for Node.js microservicefull-stack90.2
Add GraphQL layer over REST APImulti-language84.1
Code review: identify security vulnscode-review79.0
Zero-downtime schema migrationfull-stack76.1
Debug and fix 6 broken database triggers and constraintsdebugging71.9
Build terminal UI dashboardfrom-scratch71.2
Fix 12 WCAG accessibility violations in checkout formfrontend84.3
Build MCP server for database managementbackend84.0
Fix broken GitHub Actions CI pipelinedebugging83.3
Fix auth bypass vulnerabilitydebugging88.5
Split 1100-line god file into proper modulesrefactoring63.8
Implement JWT auth middlewarebackend53.8
Optimize slow Postgres queries in Flask appbackend78.4
Add slash commands and moderation to Discord botbackend72.7
Add file upload with S3 presigned URLsbackend79.8
Fix React hydration mismatchfrontend83.3
Add WebSocket real-time updatesfull-stack84.4
Build production website with auth and members areafrontend67.1
Build codebase indexer for LLM context windowsfrom-scratch27.5
Add virtual scrolling to table rendering 5000 rowsfrontend54.0
Replace console.log with structured loggingrefactoring44.7
Fix data integrity bugs in denormalized e-commerce schemadebugging85.0
Implement Stripe webhook handlerbackend84.4
Harden insecure Docker setup with 12 vulnerabilitiescode-review76.3
Dockerize Node.js monorepofull-stack75.3
Implement zero-trust API authentication layerbackend67.5
Add streaming SSE endpoint for LLM chatbackend81.8
Optimize bloated React bundle under 500KBfrontend80.5
Build materialized view refresh pipeline for analyticsbackend76.7
Fix flaky test suitedebugging87.5
Add caching layer to eliminate slow SSR page loadsfull-stack81.2
Write tests for untested legacy Flask servicecode-review53.3
Build RAG pipeline with vector searchbackend49.5
Remove AI slop and over-engineering from codebaserefactoring74.8
Implement background job scheduler with persistencebackend58.0
Migrate callback-hell Express app to async/awaitrefactoring58.0
Write complex SQL report with window functionsbackend66.8
Build CLI tool with subcommands and configfrom-scratch74.5
Refactor monolithic handler to CQRSrefactoring42.6
Add Redis caching layer to Express APIbackend85.2
Build REST API from scratchfrom-scratch77.2