Viral Wire

DeepSeek V4's 1M token context slashes costs for startups

DeepSeek V4 offers 1M tokens at $0.14/million input, reshaping AI economics.

Deep Dive

DeepSeek has launched preview versions of its V4 model family, featuring a 1 million token context window that dramatically expands what can be processed in a single prompt. Available as V4 Pro and V4 Flash, the models support reasoning, agentic workflows, and document-heavy use cases. The 1M context is a massive leap from V3's 128K tokens, enabling startups to handle entire codebases, large contract sets, or stacks of research files without resorting to chunking or retrieval-augmented generation. This simplification reduces engineering overhead and failure points, making long-context AI more accessible.

Pricing is equally disruptive: V4 Flash costs $0.14 per million input tokens and $0.28 per million output, while V4 Pro is offered at a 75% discount through May 31. These numbers undercut OpenAI and Anthropic, forcing them to justify premium prices beyond raw capability. For early-stage teams on tight budgets, this could mean the difference between shipping a product and staying in prototype. Available via OpenAI-format and Anthropic-format endpoints, DeepSeek V4 makes long-context AI a basic utility rather than a premium feature.

Key Points
  • 1 million token context window, up from 128K in V3, supporting full codebases and documents in one prompt.
  • V4 Flash priced at $0.14/M input tokens and $0.28/M output tokens; V4 Pro at 75% discount through May 31.
  • Compatible with OpenAI and Anthropic API formats, easing migration and integration for startups.

Why It Matters

Startups can now build long-context AI apps cheaply, forcing OpenAI and Anthropic to compete on price.