APEX
Back to models

Gemini 2.5 Flash Lite

Google

1000K context$0.10/M input$0.40/M output
1255peak 1271

Avg Score

57.1

Avg Cost

$0.02

Score/$

2552.2

Runs

65

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

multi-languageexpert
1615
from-scratcheasy
1420
refactoringmedium
1304
code-reviewmedium
1299
debugginghard
1297
frontendmedium
1294
refactoring
1273
from-scratchhard
1259
backend
1257
backendhard
1249
multi-language
1244
debuggingmedium
1244
backendmedium
1241
backendexpert
1240
code-review
1237
from-scratch
1229
debugging
1227
full-stackmedium
1209
frontend
1199
full-stack
1156
debuggingexpert
1068
full-stackhard
1066
frontendeasy
761
backendeasy
698
frontendhard
635
frontendexpert
144
refactoringexpert
117
code-reviewhard
22
from-scratchexpert
6
from-scratchmedium
0
multi-languagehard
0

All Results

TaskCategoryScore
Add file upload with S3 presigned URLsbackend50.6
Fix auth bypass vulnerabilitydebugging93.0
Build CLI tool with subcommands and configfrom-scratch67.1
Add Redis caching layer to Express APIbackend65.3
Add virtual scrolling to table rendering 5000 rowsfrontend79.3
Build SaaS admin dashboard from scratchfrom-scratch38.9
Build RAG pipeline with vector searchbackend83.7
Fix Node.js stream backpressure causing OOM on large filesbackend93.2
Implement background job scheduler with persistencebackend59.1
Build materialized view refresh pipeline for analyticsbackend70.6
Add retry logic and dead letter queue to Python task queuebackend73.0
Add Google OAuth2 login to Express appfull-stack67.5
Code review: identify security vulnscode-review77.5
Fix broken GitHub Actions CI pipelinedebugging52.8
Implement zero-trust API authentication layerbackend62.3
Add caching layer to eliminate slow SSR page loadsfull-stack65.5
Add i18n with locale routing to Next.js appfull-stack19.3
Remove AI slop and over-engineering from codebaserefactoring65.0
Optimize bloated React bundle under 500KBfrontend68.5
Build codebase indexer for LLM context windowsfrom-scratch47.1
Add streaming SSE endpoint for LLM chatbackend67.5
Find and patch all OWASP Top 10 vulnerabilitiesdebugging37.8
Implement multi-tenant row-level security in Postgresbackend33.4
Split 1100-line god file into proper modulesrefactoring50.8
Harden insecure Docker setup with 12 vulnerabilitiescode-review70.0
Write Kubernetes manifests for Node.js microservicefull-stack80.4
Convert React app to PWA with offline supportfrontend40.9
Fix broken responsive layoutfrontend60.1
Replace console.log with structured loggingrefactoring69.3
Implement JWT auth middlewarebackend34.8
Dockerize Node.js monorepofull-stack67.0
Port Python CLI to Rustmulti-language58.9
Migrate callback-hell Express app to async/awaitrefactoring73.8
Write complex SQL report with window functionsbackend50.8
Build MCP server for database managementbackend51.8
Implement transformer inference engine with KV cachefrom-scratch37.5
Build production website with auth and members areafrontend45.3
Fix deadlocking transaction patterns in Flask appbackend68.0
Fix N+1 query in dashboardbackend45.0
Build real-time portfolio risk calculatorbackend57.2
Debug and fix 6 broken database triggers and constraintsdebugging74.4
Build LLM evaluation harness with structured gradingbackend44.9
Optimize slow Postgres queries in Flask appbackend61.1
Zero-downtime schema migrationfull-stack53.0
Add cursor-based pagination to REST APIbackend27.8
Write integration tests for payment flowcode-review37.8
Build distributed node cluster with gossip protocolfrom-scratch46.5
Add slash commands and moderation to Discord botbackend51.0
Write tests for untested legacy Flask servicecode-review45.5
Fix 12 WCAG accessibility violations in checkout formfrontend64.3
Fix flaky test suitedebugging81.2
Add rate limiting middlewarebackend46.3
Find and fix 4 hidden backdoors in Flask appdebugging67.3
Implement Stripe webhook handlerbackend48.3
Add GraphQL layer over REST APImulti-language29.8
Fix hallucination and context window bugs in RAG agentbackend29.5
Fix data integrity bugs in denormalized e-commerce schemadebugging45.5
Refactor monolithic handler to CQRSrefactoring34.7
Fix memory leak in event handlerdebugging59.9
Add WebSocket real-time updatesfull-stack62.7
Debug race condition in worker pooldebugging57.3
Fix React hydration mismatchfrontend65.3
Build terminal UI dashboardfrom-scratch21.3
Build REST API from scratchfrom-scratch79.5
Fix race conditions in order matching enginebackend77.3