Live Benchmarks

Novu Benchmark

Performance results of AI coding models on Novu tasks, measuring success rate and execution time with high precision.

Total tasks: 18

Last run: 5/6/2026

Model Performance

Model	Passed	Avg Duration	Success Rate
#1 gemini-3-flashNEW	3	265.3s	17%