Back to Models
deepseek-ai·Dec 26, 2024
DeepSeek: DeepSeek V3
deepseek/deepseek-chatcontext
163.8K
Max Output
163.8K
Input / 1M
$0.2002
Output / 1M
$0.8001
About
DeepSeek-V3 is a Mixture-of-Experts (MoE) model from the DeepSeek team with 671B total parameters and 37B activated per token, building on the instruction-following and coding abilities of prior versions. It adopts Multi-head Latent Attention (MLA) and the DeepSeekMoE architecture, with an auxiliary-loss-free load-balancing strategy and a multi-token prediction objective. Pre-trained on nearly 15 trillion tokens, it reports 87.1% on MMLU, 89.3% on GSM8K, and 65.2% on HumanEval, with results that rival leading closed-source models.
Capabilities
- Context Length
- 163.8K
- Max Output
- 163.8K
- Reasoning
- No
- In
- text
- Out
- text
Pricing
Full pricing| Type | Price / 1M tokens |
|---|---|
| Input | $0.2002 |
| Output | $0.8001 |
OpenAI-compatible · Model ID deepseek/deepseek-chat
curl https://api.elliotgate.com/v1/chat/completions \
-H "Authorization: Bearer sk-omg-your-api-key" \
-H "Content-Type: application/json" \
-d '{
"model": "deepseek/deepseek-chat",
"messages": [{"role": "user", "content": "Hello!"}]
}'