articleGitHub Trending — Python
deepseek-ai/DeepSeek-V3
DeepSeek-V3 is a 671B Mixture-of-Experts LLM with 37B activated per token, featuring MLA and an auxiliary-load-free load balancing strategy, trained on 14.8T tokens. It achieves top open-source benchmarks and supports FP8-enabled backends (SGLang, LMDeploy, TRT-LLM, vLLM, LightLLM) with local run steps and a Multi-Token Prediction objective plus post-training distillation from DeepSeek-R1.
published APR 28, 2026★★★★★
Read the sourcegithub.com/deepseek-ai/DeepSeek-V3
[*] Opens in a new tab · no tracking on Lantern's side
- Source
- GitHub Trending — Python
- Ingested
- APR 28, 2026 · 08:40
- Editorial score
- 5.0 / 5