DeepSeek-V3 is a 671B-param MoE model (37B active per token). Matches GPT-4o on benchmarks. MIT-licensed weights, $0.27/1M input on the hosted API.
DeepSeek-V3 — Open-Weight 671B MoE Model with GPT-4o Quality
DeepSeek-V3 is a 671B-param MoE model (37B active per token). Matches GPT-4o on benchmarks. MIT-licensed weights, $0.27/1M input on the hosted API.
Staging sûr pour cet actif
Cet actif est d'abord staged. Le prompt copié demande à l'agent d'inspecter les fichiers staged avant d'activer scripts, config MCP ou config globale.
npx -y tokrepo@latest install 1b0d1ab2-1edb-49e1-9853-b02807a64140 --target codexStage les fichiers d'abord; l'activation exige la revue du README et du plan staged.
Fil de discussion
Actifs similaires
DeepSeek-R1 — Open-Weight Reasoning Model Rivaling OpenAI o1
DeepSeek-R1 is the open-weight reasoning model that matches OpenAI o1 on math, code, science benchmarks. Streaming chain-of-thought visible. MIT-licensed.
DeepSeek Coder — Code-Specialized Model for Local Inference
DeepSeek Coder is the code-specialized open-weight model with FIM (fill-in-middle) support. Beats Codestral on HumanEval. Drops into Continue, Aider.
Open R1 — Fully Open Reproduction of DeepSeek-R1
A community effort by Hugging Face to reproduce and improve upon DeepSeek-R1 reasoning capabilities using fully open training recipes, datasets, and model weights.
Fireworks Inference — 100+ Open Models on OpenAI-Compat API
Fireworks runs Llama, Mixtral, DeepSeek, Qwen, Phi via OpenAI-compat API. Sub-second TTFT, speculative decoding on flagship models.