vLLM
🌍 International 📖 Open Source ✅ Free
UC Berkeley open-source high-throughput LLM inference engine with PagedAttention. Self-host any open-source model.
🎁 Free Tier
Daily Limit: Apache-2.0 open-source.
| Model | Context | Limit | Notes |
|---|
🔑 Free API
No free API available