NVIDIA Build (NIM API)

NVIDIA Build (NIM API)

80 pts
1 wins
VS
👑 DeepSeek

DeepSeek

95 pts
2 wins

🏆 Overall, DeepSeek offers more free value (2/6 categories)

📊 Side-by-Side

Category
NVIDIA Build (NIM API)
DeepSeek
Free Tier
✅ Unlimited (40 RPM rate limit)
✅ 50 requests/day
Free API
✅ 无限制(已取消额度限制)
✅ $5
Rate Limit
40 RPM(可申请提升到 200 RPM)
2 RPM
Open Source
❌ No
✅ Yes
Free Models
10
4
GitHub Stars
-
⭐ 102,923

🧠 Model Details

NVIDIA Build (NIM API) 10 models
MiniMax M2.7
📐 128k ⚡ 40 RPM
230B params, coding/reasoning/office all-rounder
Kimi K2.5
📐 1000k ⚡ 40 RPM
Moonshot native multimodal agentic model, 15T tokens training, 1M context, top Chinese ability
GLM-5.1
📐 128k ⚡ 40 RPM
Zhipu's latest flagship, GLM-5 upgrade, optimized for agentic coding/long-horizon reasoning. GLM-5 deprecated 2026-04-20
DeepSeek V3.2
📐 128k ⚡ 40 RPM
671B MoE, coding champion
DeepSeek R1
📐 64k ⚡ 40 RPM
671B MoE, reasoning champion
Gemma 4 31B-IT
📐 128k ⚡ 40 RPM
Google's latest open source, strong agentic capability, runs on consumer hardware
Nemotron-3-Super-120B
📐 1000k ⚡ 40 RPM
NVIDIA's own flagship, hybrid Mamba-Transformer MoE, 1M context, 7.5x throughput vs Qwen3.5-122B
Llama 4 Maverick
📐 128k ⚡ 40 RPM
Meta's latest open source LLM
Qwen 3.5
📐 128k ⚡ 40 RPM
Alibaba Qwen, native multimodal, 397B params with only 17B active, extremely efficient
Step 3.5 Flash
📐 128k ⚡ 40 RPM
StepFun, extremely fast
DeepSeek 4 models
DeepSeek-V4-Pro
📐 1000k ⚡ Rate limited
Released April 2026, 1.6T param MoE (49B active), 1M context, thinking/non-thinking modes
DeepSeek-V4-Flash
📐 1000k ⚡ Rate limited
Lightweight V4, 284B params (13B active), 1M context, excellent value
DeepSeek-V3
📐 64k ⚡ 50 RPD
Previous flagship, still available
DeepSeek-R1
📐 64k ⚡ 50 RPD
Reasoning model, strong at math/code
🐑 小羊助手