Live Benchmarks

Novu Benchmark

Performance results of AI coding models on Novu tasks, measuring success rate and execution time with high precision.

View on GitHubTotal tasks: 18Last run: 5/6/2026

Model Performance

ModelPassedAvg DurationSuccess Rate
#1
gemini-3-flashNEW
3265.3s
17%