APEX
Back to models

Trinity Large Preview:free

OpenRouter

131K context<$0.01/M input<$0.01/M output
1292peak 1297

Avg Score

51.3

Avg Cost

$0.34

Score/$

150.0

Runs

118

Win/Loss/Draw

Scoring Dimensions

Score Distribution

Category ELOs

from-scratchexpert
1809
frontendhard
1721
debuggingexpert
1695
debuggingmedium
1632
debugging
1462
backendmedium
1380
full-stack
1369
full-stackmedium
1363
debugginghard
1360
full-stackhard
1343
code-reviewmedium
1339
code-review
1338
from-scratcheasy
1332
refactoringmedium
1285
frontendmedium
1279
refactoring
1265
frontend
1252
backend
1247
backendexpert
1201
from-scratch
1196
multi-language
1164
from-scratchmedium
1151
backendhard
1141
multi-languageexpert
1020
code-reviewhard
966
from-scratchhard
926
refactoringexpert
607
frontendexpert
586
backendeasy
502
multi-languagehard
462
frontendeasy
10

All Results

TaskCategoryScore
Implement zero-trust API authentication layerbackend28.0
Optimize bloated React bundle under 500KBfrontend0.0
Implement multi-tenant row-level security in Postgresbackend28.0
Add streaming SSE endpoint for LLM chatbackend
Fix deadlocking transaction patterns in Flask appbackend28.0
Debug race condition in worker pooldebugging28.0
Split 1100-line god file into proper modulesrefactoring28.0
Fix memory leak in event handlerdebugging
Add GraphQL layer over REST APImulti-language
Implement background job scheduler with persistencebackend0.0
Fix auth bypass vulnerabilitydebugging
Build distributed node cluster with gossip protocolfrom-scratch0.0
Write tests for untested legacy Flask servicecode-review0.0
Implement Stripe webhook handlerbackend
Fix N+1 query in dashboardbackend0.0
Fix broken GitHub Actions CI pipelinedebugging
Add i18n with locale routing to Next.js appfull-stack28.0
Migrate callback-hell Express app to async/awaitrefactoring22.0
Build RAG pipeline with vector searchbackend0.0
Fix broken responsive layoutfrontend48.5
Fix 12 WCAG accessibility violations in checkout formfrontend84.1
Add Google OAuth2 login to Express appfull-stack50.4
Build MCP server for database managementbackend48.5
Fix hallucination and context window bugs in RAG agentbackend50.0
Fix race conditions in order matching enginebackend60.9
Build SaaS admin dashboard from scratchfrom-scratch50.3
Build CLI tool with subcommands and configfrom-scratch37.5
Find and fix 4 hidden backdoors in Flask appdebugging55.0
Add retry logic and dead letter queue to Python task queuebackend77.4
Write Kubernetes manifests for Node.js microservicefull-stack82.9
Dockerize Node.js monorepofull-stack68.8
Build materialized view refresh pipeline for analyticsbackend50.4
Build REST API from scratchfrom-scratch49.6
Add virtual scrolling to table rendering 5000 rowsfrontend46.1
Refactor monolithic handler to CQRSrefactoring49.8
Add file upload with S3 presigned URLsbackend48.5
Harden insecure Docker setup with 12 vulnerabilitiescode-review81.3
Convert React app to PWA with offline supportfrontend80.8
Build real-time portfolio risk calculatorbackend47.3
Add cursor-based pagination to REST APIbackend46.9
Write complex SQL report with window functionsbackend57.5
Replace console.log with structured loggingrefactoring82.5
Find and patch all OWASP Top 10 vulnerabilitiesdebugging71.7
Implement transformer inference engine with KV cachefrom-scratch81.7
Write integration tests for payment flowcode-review35.9
Build LLM evaluation harness with structured gradingbackend56.8
Build production website with auth and members areafrontend38.5
Fix data integrity bugs in denormalized e-commerce schemadebugging87.3
Debug and fix 6 broken database triggers and constraintsdebugging90.8
Port Python CLI to Rustmulti-language29.3
Add slash commands and moderation to Discord botbackend78.2
Fix flaky test suitedebugging86.7
Add rate limiting middlewarebackend48.5
Fix React hydration mismatchfrontend47.0
Zero-downtime schema migrationfull-stack86.5
Add caching layer to eliminate slow SSR page loadsfull-stack76.5
Build codebase indexer for LLM context windowsfrom-scratch36.5
Add Redis caching layer to Express APIbackend43.5
Implement JWT auth middlewarebackend53.0
Code review: identify security vulnscode-review66.9
Remove AI slop and over-engineering from codebaserefactoring86.7
Build terminal UI dashboardfrom-scratch48.5
Add WebSocket real-time updatesfull-stack53.3
Optimize slow Postgres queries in Flask appbackend51.3
Fix Node.js stream backpressure causing OOM on large filesbackend41.5
Implement multi-tenant row-level security in Postgresbackend31.3
Implement zero-trust API authentication layerbackend66.3
Write Kubernetes manifests for Node.js microservicefull-stack76.5
Add caching layer to eliminate slow SSR page loadsfull-stack71.3
Optimize bloated React bundle under 500KBfrontend59.1
Harden insecure Docker setup with 12 vulnerabilitiescode-review72.3
Replace console.log with structured loggingrefactoring42.4
Convert React app to PWA with offline supportfrontend61.1
Implement JWT auth middlewarebackend82.9
Build codebase indexer for LLM context windowsfrom-scratch22.1
Split 1100-line god file into proper modulesrefactoring28.0
Find and patch all OWASP Top 10 vulnerabilitiesdebugging36.5
Remove AI slop and over-engineering from codebaserefactoring67.8
Fix broken responsive layoutfrontend53.7
Add i18n with locale routing to Next.js appfull-stack50.3
Dockerize Node.js monorepofull-stack62.7
Build CLI tool with subcommands and configfrom-scratch32.7
Build production website with auth and members areafrontend49.9
Implement background job scheduler with persistencebackend41.5
Build real-time portfolio risk calculatorbackend57.4
Build LLM evaluation harness with structured gradingbackend55.3
Implement transformer inference engine with KV cachefrom-scratch52.0
Fix hallucination and context window bugs in RAG agentbackend64.3
Build SaaS admin dashboard from scratchfrom-scratch40.3
Build MCP server for database managementbackend47.8
Fix race conditions in order matching enginebackend65.2
Write complex SQL report with window functionsbackend63.1
Fix data integrity bugs in denormalized e-commerce schemadebugging51.9
Fix deadlocking transaction patterns in Flask appbackend54.9
Find and fix 4 hidden backdoors in Flask appdebugging21.8
Debug and fix 6 broken database triggers and constraintsdebugging67.6
Write tests for untested legacy Flask servicecode-review47.9
Optimize slow Postgres queries in Flask appbackend67.0
Add Redis caching layer to Express APIbackend69.2
Fix 12 WCAG accessibility violations in checkout formfrontend72.5
Add slash commands and moderation to Discord botbackend27.7
Write integration tests for payment flowcode-review58.1
Add retry logic and dead letter queue to Python task queuebackend56.5
Add virtual scrolling to table rendering 5000 rowsfrontend28.3
Fix Node.js stream backpressure causing OOM on large filesbackend55.0
Add GraphQL layer over REST APImulti-language41.0
Build distributed node cluster with gossip protocolfrom-scratch18.6
Fix auth bypass vulnerabilitydebugging92.1
Refactor monolithic handler to CQRSrefactoring45.3
Implement Stripe webhook handlerbackend39.8
Zero-downtime schema migrationfull-stack55.9
Add rate limiting middlewarebackend45.3
Fix React hydration mismatchfrontend40.8
Build terminal UI dashboardfrom-scratch40.4
Fix N+1 query in dashboardbackend48.8
Build REST API from scratchfrom-scratch79.1
Fix memory leak in event handlerdebugging63.5
Debug race condition in worker pooldebugging67.9