DeepSeek Launches Sparse Attention Model, Halves API Costs

Tuesday, 30 June

Tuesday, 30 June, 2026

DeepSeek Launches Sparse Attention Model, Halves API Costs

By Isha

DeepSeek has unveiled V3.2-exp, a sparse-attention model using a “lightning indexer” and fine-grained token selection to trim inference expenses. In long-context applications, this architecture can cut per-call API costs by up to 50%. The model is open-weight and publicly available on Hugging Face, enabling further third-party validation and adoption.

Read full story at TechCrunch

Tags:api adoption DeepSeek

Download TechShots

IT Trends Move Fast. Stay Faster.

Android iOS

Share your insights

Create Content

Categories

DeepSeek Launches Sparse Attention Model, Halves API Costs

Also Read

NASA in the Neighborhood: The $25,000 Luxury Lunar Rover for Earth

Pocket Revolution: The Device That Swallowed the 21st Century Turns 19

Italy Probes Microsoft Over Sneaky, Expensive AI Upgrades

The Future of Tech Work: 92% of Leaders Say AI Management is Non-Negotiable

Premium Power, Mid-Range Price: Motorola Drops Moto Pad 70 Pro

Beat the Ads: Vi Bundles 3 Months of Free Spotify Premium

Feeding the Machine: iPhone 18 Slated for RAM Boost to Power Advanced AI

High-Tech Meets Roots: Why Climate Solutions Demand Local Wisdom

Cracks in the Launchpad: NASA’s Apollo-Era Infrastructure Threatens Artemis Moon Missions

Download TechShots

Share your insights

Subscribe To Our Newsletter.