DeepSeek v3

DeepSeek V3

Model#Total Params#Activated ParamsContext LengthDownload
DeepSeek-V3-Base671B37B128KHugging Face
DeepSeek-V3671B37B128KHugging Face

About DeepSeek V3

Unveiled in December 2024, DeepSeek V3 leverages a mixture-of-experts architecture, making it capable of handling a diverse array of tasks. With an impressive 671 billion parameters and a context length of 128,000, this model pushes the boundaries of AI performance.