LLM trading agents show measurable pre-failure signatures before market drawdowns
New study finds LLM reasoning drifts before crashes, risk feedback can help.
A new study by Weicheng Xue (arXiv:2605.28850) introduces TradeArena, an auditable trading-agent testbed that lets researchers analyze how LLM agents reason and behave under market stress. The paper reveals measurable pre-failure signatures: planning embeddings drift away from normal-state centroids, and fused plan-risk representations separate normal from pre-drawdown states. The author uses 80 rolling failure anchors across 8 LLM trajectories and shows that effective-rank contraction persists across multiple embedding probes (hash, LSA, Transformer, white-box hidden states).
Key experiments show that structured risk feedback can act as an external alignment signal without fine-tuning, though it's not a universal performance booster: true audit feedback improves calibration for some models, while placebo feedback sometimes yields higher short-horizon returns but weaker alignment diagnostics. A 51-stock intraday experiment reveals a correlation blind spot—LLM rationales often justify concentrated exposure to coupled assets that the risk layer repeatedly clips. The takeaway is not profitability but auditable risk feedback and representation trajectories that reveal when LLM financial reasoning is aligning, drifting, or failing.
- TradeArena enables replayable trajectories and risk reports to study LLM trading agent behavior
- 80 failure anchors over 8 LLM trajectories show effective-rank contraction before drawdowns
- Structured risk feedback improves calibration for some models but placebo feedback can boost short-term returns
Why It Matters
LLM trading agents are risky; this study offers early warning signals and feedback alignment methods to improve safety.