Newsroom

QFBench News

Product updates, benchmark announcements, and community events.

N-003Leaderboard2026-05-04

V11 Leaderboard Published Across 80 Complete CLI Tasks

The homepage leaderboard now reflects V11 pass@1/pass@3 results: GPT-5.5 leads at 61.7% pass@1, followed by GPT-5.3-codex and Opus 4.6.

N-002Dataset2026-05-04

The benchmark repository has grown to 87 merged quantitative finance tasks, with the full 90-task milestone now in sight.

N-001New2026-04-18

Join our weekly discussion to talk about benchmark progress, quantitative finance tasks, and upcoming evaluation updates.