APEX TESTING_

Find out which AI coding models actually deliver and which are just hype.

by HauhauCS

Models Tested

Tasks

Total Runs

Avg Score

0.0

Capital Spent

$0.00

Top Models

Qwen3.5 35b A3b [Q4_K_XL]→Write tests for untested legacy Flask service

0.08m 37s

Step 3.5 Flash→Add Google OAuth2 login to Express app

9m 27s

Step 3.5 Flash→Build codebase indexer for LLM context windows

0.09m 12s

Step 3.5 Flash→Add retry logic and dead letter queue to Python task queue

28.08m 12s

Step 3.5 Flash→Build real-time portfolio risk calculator

0.07m 23s