NotebookLM can now summarize research in ‘cinematic’ video overviews
The upgraded feature uses Gemini 3, Veo 3, and Nano Banana Pro to create fully animated summaries from text.
Google has significantly upgraded its AI-powered research assistant, NotebookLM, with a new 'cinematic' video overview feature that transforms text notes into fully animated video summaries. This represents a major leap from last year's basic narrated slideshows, as the system now employs a sophisticated pipeline of Google's latest AI models—including Gemini 3 for narrative structuring, Veo 3 for video generation, and Nano Banana Pro for optimization—to create cohesive, stylized animations directly from user-provided content. The feature marks another aggressive move in Google's expanding AI video toolkit, following recent upgrades to Veo and broader access to its Flow video generator, positioning the company against competitors like OpenAI's Sora and emerging text-to-video platforms.
The technical implementation involves Gemini 3 analyzing the uploaded notes to determine the optimal narrative flow, visual aesthetic, and format, then refining its own output for consistency before Veo 3 generates the corresponding animated sequences. This 'cinematic' capability is currently exclusive to English-speaking users over 18 with a Google AI Ultra subscription, and is subject to a daily limit of 20 generations to manage computational load. For researchers, students, and professionals, this means dense reports or complex project notes can be automatically converted into shareable, engaging video summaries, potentially saving hours of manual presentation work. However, the subscription gate and usage caps indicate Google is treating this as a premium, resource-intensive feature as it scales its AI video infrastructure.
- Upgraded from basic slideshows to fully animated videos using Gemini 3, Veo 3, and Nano Banana Pro AI models
- Currently limited to 20 generations per day for English-speaking Google AI Ultra subscribers aged 18+
- Gemini 3 autonomously determines narrative, visual style, and refines output for consistency without user input
Why It Matters
Automates the creation of engaging video summaries from dense text, saving professionals hours of manual presentation work.