LLM-Enhanced Reinforcement Learning for Long-Term User Satisfaction in Interactive Recommendation
A new AI system tackles filter bubbles by planning for your long-term satisfaction, not just your next click.
Researchers propose a new AI framework, LERL, that combines large language models (LLMs) with reinforcement learning to improve interactive recommendations. It uses an LLM as a high-level planner to select diverse content categories, then a reinforcement learning agent picks specific items. This two-step approach prevents repetitive content and filter bubbles. Experiments on real-world data show it significantly improves long-term user satisfaction compared to current state-of-the-art methods.
Why It Matters
This could make streaming and shopping algorithms less repetitive and more attuned to our evolving interests over time.