The upcoming model promises improved coding capabilities and reasoning in multiple languages beyond English. DeepSeek’s competitive advantage stems from its parent company High-Flyer’s early investment in computing power, including two supercomputing clusters acquired before U.S. export bans on advanced Nvidia chips. The second cluster, Fire-Flyer II, comprised approximately 10,000 Nvidia A100 chips. DeepSeek’s cost-efficiency comes from innovative architecture choices like Mixture-of-Experts (MoE) and multihead latent attention (MLA).
According to Bernstein analysts, DeepSeek’s pricing was 20-40 times cheaper than OpenAI’s equivalent models. The competitive pressure has already forced OpenAI to cut prices and release a scaled-down model, while Google’s Gemini has introduced discounted access tiers.
Read more of this story at Slashdot.