PokerSkill combines a human-designed rule-based skill library with LLMs, achieving a 49–61% loss reduction over default prompts against GTOWizard?

PokerSkill combines a human-designed rule-based skill library with LLMs, achieving a 49–61% loss reduction over default prompts against GTOWizard.

GPT-5.5 XHigh with PokerSkill reached -57 mbb/hand, outperforming the strong bot Slumbot without any game-specific training?

GPT-5.5 XHigh with PokerSkill reached -57 mbb/hand, outperforming the strong bot Slumbot without any game-specific training.

This is the first demonstration of an LLM competing at expert level in an imperfect-information game without solver queries or millions of core-hours of training?

This is the first demonstration of an LLM competing at expert level in an imperfect-information game without solver queries or millions of core-hours of training.

Research & Papers

PokerSkill lets LLMs play expert-level poker without training

arXiv cs.GT May 29, 2026

⚡GPT-5.5 and Claude 4.6 crush poker pros using only rule-based skills and zero training.

Deep Dive

A new framework called PokerSkill, from researchers at the Chinese University of Hong Kong, shows that large language models (LLMs) can play expert-level poker without any game-specific training or equilibrium solvers. Traditional AI poker agents rely on counterfactual regret minimization (CFR) solvers that require millions of core-hours of training. While LLMs already possess extensive poker knowledge, their raw performance lags far behind solver-based bots. PokerSkill bridges this gap by using a structured rule-based skill library—designed entirely by human experts—to ground the LLM's actions. A deterministic context engine analyzes the current game state and retrieves only the relevant skill fragments, constraining the LLM to reasonable, expert-level decisions.

Against GTOWizard, a state-of-the-art GTO benchmark, GPT-5.5 XHigh with PokerSkill achieved -57 ± 21 mbb/hand, while Claude Opus 4.6 scored -80 ± 29 mbb/hand and Claude Opus 4.7 scored -87 ± 64 mbb/hand. These results represent a 49–61% reduction in losses compared to default-prompt baselines, and they outperformed Slumbot, a strong poker bot. The key insight is that neither rule-based skills nor LLMs alone produce a strong strategy, but their combination yields an agent that requires neither training nor solver queries. This marks the first time an LLM has achieved competitive performance in a complex imperfect-information game without game-specific training, suggesting that similar hybrid approaches could crack other strategic domains.

Key Points

PokerSkill combines a human-designed rule-based skill library with LLMs, achieving a 49–61% loss reduction over default prompts against GTOWizard.
GPT-5.5 XHigh with PokerSkill reached -57 mbb/hand, outperforming the strong bot Slumbot without any game-specific training.
This is the first demonstration of an LLM competing at expert level in an imperfect-information game without solver queries or millions of core-hours of training.

Why It Matters

LLMs can now tackle complex strategic tasks like poker without costly training, opening doors for practical AI in games, finance, and negotiation.

Read Original Article

PokerSkill lets LLMs play expert-level poker without training

Why It Matters

Related Articles

🚀 Stay Ahead in AI