Ingen varer
Kassen / rediger kurv
By creatively blending a variety of strategies and innovations like Mixture of Experts, Latent Attention, Multi-token Prediction, model distillation and efficient parallelisation, DeepSeek set a new standard for what’s possible in an open LLM.