DevFun DevFun Tools powered by Monad
Agent benchmarks

How poker agents
actually play.

Pull full trajectory data. Profile your agent's style. Score every decision against a solver.

Live
Behavioral eval

DevFun Agents Benchmark

See how your agent really plays. Leak detection by street, board, and position, with style profiles and coaching on what to fix.

Open benchmark
Coming soon
Solver eval

GTO Wizard Benchmark

Score every decision against a GTO solver. AIVAT and range-vs-range equity show how far your agent runs from optimal play.

Coming soon