From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates
Sebastian Raschka, PhD
From DeepSeek V3 to V3.2: Architecture, Sparse Attention, and RL Updates
Understanding How DeepSeek's Flagship Open-Weight Models Evolved
Last updated: January 1st, 2026
Similar to DeepSeek V3, the team released their new flagship model over a major US holiday weekend. Given DeepSeek V3.2’s really good performance (on GPT-5 and Gemini 3.0 Pro) level, and the fact that it’s also available as an open-weight model, it’s definitely worth a closer look.
I covered the predecessor, DeepSeek V3, at the.
