DeepSeek V4 Pro 75% Off Promo
DeepSeek V4 Pro has an official limited-time 75% API discount through 2026-05-31 15:59 UTC. This is a pricing promo, not a verified free-credit grant.
AI DEAL COLLECTION
AI API deals where openllmapi can act as a unified fallback when official access, keys, or billing are inconvenient.
AI API deals where openllmapi can act as a unified fallback when official access, keys, or billing are inconvenient. It is useful for developers, indie hackers, and AI tool users who want to compare free credits, limits, and alternative routes quickly.
yangmao.ai refreshes free tiers, expiration dates, claim requirements, and accessibility signals through automated pipelines plus manual checks. Always verify the final claim page before use.
Check the same page for alternative providers, OpenAI-compatible APIs, China-friendly access, or evergreen free tiers instead of relying on one vendor.
DeepSeek V4 Pro has an official limited-time 75% API discount through 2026-05-31 15:59 UTC. This is a pricing promo, not a verified free-credit grant.
Google made Gemini 2.5 Flash publicly available with 1M token context, free tier included.
OpenAI announces a significant price cut for GPT-4.1 API, with input price reduced to $2/M tokens and output to $8/M tokens, offering better value than GPT-4o for large-scale API usage.
OpenAI released an updated GPT-4o mini model with improved performance and lower cost.
Anthropic confirmed a Claude Code and Claude API quota boost on 2026-05-06: doubled five-hour Claude Code limits for Pro, Max, Team, and seat-based Enterprise plans, removal of peak-hours reduction for Pro/Max, and higher Claude Opus API rate limits. This is a quota boost rather than a free subscription.
Claude for Startups is an official application-based promo for venture-backed startups connected with Anthropic VC partners. It offers free API credits and priority rate limits, but the public page does not state a fixed credit amount. Official terms include regional restrictions, so do not position it as China-accessible.
Anthropic for Startups is a high-confidence official path for startup API credits and priority rate limits, but it is not an unconditional signup bonus. It targets VC-backed startups working with Anthropic VC partners; the credit amount is not publicly fixed and depends on Anthropic approval.
Anyscale API has a recorded free trial: $10 free credits; rate limit: 30 RPM.
Anyscale has a recorded free tier: credit-based. Good for testing before upgrading.
Anyscale is recorded as supporting OpenAI-compatible API access. Free/trial info: $10 free credits. Useful for low-cost testing by swapping SDK base_url.
AutoDL has recorded free compute or trial credits: ¥10 credits. Useful for inference, deployment, or GPU experiments.
Baichuan AI API has a recorded free trial: 500万 tokens; rate limit: 5 RPM.
Baichuan AI has a recorded free tier: No explicit limit. Good for testing before upgrading.
Baichuan AI is recorded as supporting OpenAI-compatible API access. Free/trial info: 500万 tokens. Useful for low-cost testing by swapping SDK base_url.
Banana has recorded free compute or trial credits: trial credits. Useful for inference, deployment, or GPU experiments.
Cerebras API has a recorded free trial: 1M tokens/day; rate limit: 30 RPM / 60K TPM / 1M TPD.
Cerebras has a recorded free tier: 1M tokens/day. Good for testing before upgrading.
Cerebras is recorded as supporting OpenAI-compatible API access. Free/trial info: 1M tokens/day. Useful for low-cost testing by swapping SDK base_url.
ChatGPT (OpenAI) is recorded as supporting OpenAI-compatible API access. Free/trial info: $0. Useful for low-cost testing by swapping SDK base_url.
Malta's AI for All program treats ChatGPT Plus as a national AI literacy benefit: complete the course first, then receive one year of Plus. This is country-limited, not a general loophole, but an important AI public-benefit signal.
Anthropic API API has a recorded free trial: $5; rate limit: 5 RPM.
Cloudflare Workers AI API has a recorded free trial: 每天 10000 神经元(永久有效); rate limit: 10000 requests/day.
Cloudflare Workers AI has a recorded free tier: 10,000 free requests/day. Good for testing before upgrading.
Cloudflare Workers AI is recorded as supporting OpenAI-compatible API access. Free/trial info: 每天 10000 神经元(永久有效). Useful for low-cost testing by swapping SDK base_url.
Cohere launched Command A for enterprise RAG and tool use, with updated API pricing. See official website for details.
Cohere API has a recorded free trial: 1000 calls/month; rate limit: Trial rate limits.
Cohere has a recorded free tier: 1,000 calls/month (Trial Key). Good for testing before upgrading.
Coze (ByteDance) API has a recorded free trial: Free tier; rate limit: Varies.
Coze (ByteDance) has a recorded free tier: No explicit limit. Good for testing before upgrading.
DeepSeek's official docs confirm a capacity expansion request path for API accounts that need higher concurrency than the default limits. DeepSeek matches appropriate concurrency based on submitted business needs, with no additional cost for capacity expansion. This is for teams or businesses needing higher DeepSeek V4 Pro / V4 Flash concurrency; it is not free token credit and is not automatic access.
DeepSeek API has a recorded free trial: $5; rate limit: 2 RPM.
DeepSeek offers 50 free inferences daily (V3 + R1 models) plus $5 API credits on signup. R1 reasoning model excels at math and code, one of the best free AI options available.
DeepSeek has a recorded free tier: 50 requests/day. Good for testing before upgrading.
DeepSeek is recorded as supporting OpenAI-compatible API access. Free/trial info: $5. Useful for low-cost testing by swapping SDK base_url.
Doubao (ByteDance) API has a recorded free trial: 50万 tokens; rate limit: 5 RPM.
Doubao (ByteDance) is recorded as supporting OpenAI-compatible API access. Free/trial info: 50万 tokens. Useful for low-cost testing by swapping SDK base_url.
ElevenLabs API has a recorded free trial: 10K chars/month; rate limit: Varies.
ElevenLabs has a recorded free tier: 10,000 characters/month. Good for testing before upgrading.
ERNIE Bot (Baidu) API has a recorded free trial: Free tier; rate limit: 5 RPM.
ERNIE Bot (Baidu) has a recorded free tier: No explicit limit. Good for testing before upgrading.
fal.ai API has a recorded free trial: Promotional credits; rate limit: N/A.
fal.ai has a recorded free tier: Promotional credits on signup. Good for testing before upgrading.
Fireworks AI API has a recorded free trial: $1 free credits; rate limit: 600 RPM.
Fireworks AI has a recorded free tier: 600 RPM. Good for testing before upgrading.
Fireworks AI is recorded as supporting OpenAI-compatible API access. Free/trial info: $1 free credits. Useful for low-cost testing by swapping SDK base_url.
FLUX (Black Forest Labs) API has a recorded free trial: Free via platforms; rate limit: Varies.
The Gemini API free tier is suitable for developers, small projects, and prototypes. Actual free rate limits vary by model, project, and billing tier, so users should confirm current limits in AI Studio.
Gemini (Google) has a recorded free tier: No explicit limit. Good for testing before upgrading.
Local deployment resource for Gemma 4 31B: MLX and GGUF variants, Mac memory requirements, Ollama/LM Studio routes, and safety notes.
GitHub Education is one of the most reliable education-benefit paths for AI coding. Verified students and teachers can receive Copilot-related benefits for long-term learning and development.
Google AI Pro / Gemini Advanced often has student or region-limited offers. This entry tracks official eligibility, verification paths, and alternatives so users do not mistake limited campaigns for global offers.
Google Antigravity still has official free weekly limits and Pro/Ultra higher-limit signals, but the Ultra USD $100 bonus-credit sub-offer expired on 2026-05-25 and is no longer presented as claimable.
Google Cloud's official $300 free credit offer for new customers can support AI API and cloud POC workflows. Eligibility and regional availability should be checked in the Google Cloud signup flow.
Google AI (Gemini) API has a recorded free trial: 免费 API 无需信用卡; rate limit: 15 RPM (Flash).
Google AI (Gemini) has a recorded free tier: Gemini free tier unlimited. Good for testing before upgrading.
Google has updated the Gemini free tier quota. The Gemini 2.5 Flash model is now free on AI Studio with a rate limit of 30 requests per minute.
Google officially launched Gemini 2.5 Flash, supporting up to 1M token context window, priced lower than Pro, designed for developers with efficient reasoning capabilities.
Gemini 2.5 Flash introduces a controllable thinking mode, allowing users to adjust reasoning depth for a balance between speed and accuracy.
The official Gemini API / AI Studio no-card free tier now has an additional entry: beyond Gemini API Free Tier input/output tokens, Google's I/O 2026 Blog confirms that new AI Studio builders can deploy their first two apps to Google Cloud at no cost with no credit card required. Production use, higher limits, or projects with billing already enabled still follow Cloud Run / Paid Tier rules.
Gorilla has recorded free compute or trial credits: unlimited. Useful for inference, deployment, or GPU experiments.
Grok (xAI) API has a recorded free trial: $25/月; rate limit: Varies.
Grok (xAI) has a recorded free tier: Limited requests/day. Good for testing before upgrading.
xAI's Grok gives $25 API credits monthly, auto-reset. Supports Grok-2 models with OpenAI compatible format. One of the highest monthly free API credits available.
Grok (xAI) is recorded as supporting OpenAI-compatible API access. Free/trial info: $25/月. Useful for low-cost testing by swapping SDK base_url.
Groq launched DeepSeek R1 671B on its platform for high-speed inference, available via Groq API or interface.
Groq API has a recorded free trial: Free tier(永久免费); rate limit: 30 RPM / 6000 TPM.
Groq is one of today's most useful free inference deals: the free tier lets developers test Llama, Mixtral, Gemma and other models through an OpenAI-compatible API. It is best for AI agents, RAG summarization, and low-latency chat prototypes. China access may require additional verification or a relay.
Groq uses custom LPU (Language Processing Unit) chips for the fastest AI inference in the industry. Free models: - Llama 3.3 70B Versatile — 6000 TPM / 30 RPM - Llama 4 Scout 17B — 6000 TPM / 30 RPM - Llama 4 Maverick 17B — 6000 TPM / 30 RPM - Mixtral 8x7B — 5000 TPM / 30 RPM - Gemma 2 9B — 15000 TPM / 30 RPM - DeepSeek R1 Distill Llama 70B — 6000 TPM / 30 RPM Highlights: - 10x+ faster than GPU solutions, Llama 3.3 70B reaches 300+ tokens/sec - API keys start with gsk_, OpenAI-compatible - No total cap, rate-limited only - Requires proxy from China (use openllmapi.com)
Groq uses proprietary LPU (Language Processing Unit) chips for the world's fastest AI inference. Free tier requires no credit card. Free tier details: - Llama 3.3 70B: 30 RPM, 6000 tokens/min, 14400 requests/day - Llama 3.1 8B: 30 RPM, 20000 tokens/min - Gemma 2 9B: 30 RPM, 15000 tokens/min - Mixtral 8x7B: 30 RPM, 5000 tokens/min - Llama 4 Scout/Maverick (newly added) Why Groq is so fast: - Custom LPU chip designed specifically for LLM inference - Deterministic execution, no GPU memory bandwidth bottleneck - Llama 3.3 70B output at 300+ tokens/s (GPU typically 30-50 tokens/s) - Ultra-low time-to-first-token, ideal for real-time chat and streaming Best for: - Real-time AI chat (speed is the core experience) - Agent tool calls (low latency = faster multi-step reasoning) - Streaming output (buttery smooth typewriter effect) - Rapid prototyping China accessible. OpenAI-compatible API, base_url is https://api.groq.com/openai/v1.
Groq has a recorded free tier: 6000 tokens/min (Llama 3.3 70B). Good for testing before upgrading.
Groq deployed Meta's Llama 4 Scout and Llama 4 Maverick models with free API access.
Groq added Llama 4 Scout and Llama 4 Maverick models, available on free tier.
Groq is recorded as supporting OpenAI-compatible API access. Free/trial info: Free tier(永久免费). Useful for low-cost testing by swapping SDK base_url.
Hugging Face API has a recorded free trial: Free tier; rate limit: Varies.
Hugging Face has a recorded free tier: Varies by model. Good for testing before upgrading.
Tencent Hunyuan API has a recorded free trial: 100万 tokens; rate limit: 5 RPM.
Tencent Hunyuan is recorded as supporting OpenAI-compatible API access. Free/trial info: 100万 tokens. Useful for low-cost testing by swapping SDK base_url.
Kimi (Moonshot AI) API has a recorded free trial: ¥15 + 充 $5 送 $5; rate limit: 3 RPM.
Kimi (Moonshot AI) is recorded as supporting OpenAI-compatible API access. Free/trial info: ¥15 + 充 $5 送 $5. Useful for low-cost testing by swapping SDK base_url.
Official Kiro signup credit for new users: get up to $20 toward your first paid-plan upgrade. Kiro Pro includes premium models such as Claude Opus 4.7, Opus 4.6, and Sonnet 4.6. A card is required, so use a secondary/virtual card and confirm or cancel auto-renewal after activation.
Lambda Cloud has recorded free compute or trial credits: 无免费额度,但价格有竞争力. Useful for inference, deployment, or GPU experiments.
DGX Cloud Lepton (formerly Lepton AI) API has a recorded free trial: $10 free credits; rate limit: 10 RPM.
DGX Cloud Lepton (formerly Lepton AI) has a recorded free tier: 10M tokens/day. Good for testing before upgrading.
DGX Cloud Lepton (formerly Lepton AI) is recorded as supporting OpenAI-compatible API access. Free/trial info: $10 free credits. Useful for low-cost testing by swapping SDK base_url.
LM Studio API has a recorded free trial: Unlimited; rate limit: Local.
LM Studio is recorded as supporting OpenAI-compatible API access. Free/trial info: Unlimited. Useful for low-cost testing by swapping SDK base_url.
Million Engine is recorded as supporting OpenAI-compatible API access. Free/trial info: 按量付费. Useful for low-cost testing by swapping SDK base_url.
MiniMax API has a recorded free trial: ¥15; rate limit: Varies.
MiniMax is recorded as supporting OpenAI-compatible API access. Free/trial info: ¥15. Useful for low-cost testing by swapping SDK base_url.
Mistral AI API has a recorded free trial: Free tier; rate limit: 1 RPM.
Mistral AI has a recorded free tier: No explicit limit. Good for testing before upgrading.
Mistral AI is recorded as supporting OpenAI-compatible API access. Free/trial info: Free tier. Useful for low-cost testing by swapping SDK base_url.
Modal has recorded free compute or trial credits: $30/month credits. Useful for inference, deployment, or GPU experiments.
Novita AI API has a recorded free trial: $0.50 free credits; rate limit: 60 RPM.
Novita AI has a recorded free tier: credit-based. Good for testing before upgrading.
Novita AI is recorded as supporting OpenAI-compatible API access. Free/trial info: $0.50 free credits. Useful for low-cost testing by swapping SDK base_url.
NVIDIA Build (NIM API) API has a recorded free trial: 无限制(已取消额度限制); rate limit: 40 RPM(可申请提升到 200 RPM).
NVIDIA Build (NIM API) has a recorded free tier: Unlimited (40 RPM rate limit). Good for testing before upgrading.
NVIDIA Build (NIM API) is recorded as supporting OpenAI-compatible API access. Free/trial info: 无限制(已取消额度限制). Useful for low-cost testing by swapping SDK base_url.
NVIDIA NIM has recorded free compute or trial credits: 40 RPM (upgradable to 200). Useful for inference, deployment, or GPU experiments.
OctoAI API has a recorded free trial: $10 free credits; rate limit: 60 RPM.
OctoAI has a recorded free tier: credit-based. Good for testing before upgrading.
OctoAI is recorded as supporting OpenAI-compatible API access. Free/trial info: $10 free credits. Useful for low-cost testing by swapping SDK base_url.
Ollama API has a recorded free trial: Unlimited; rate limit: Local.
Ollama has a recorded free tier: Unlimited (runs locally). Good for testing before upgrading.
Ollama is recorded as supporting OpenAI-compatible API access. Free/trial info: Unlimited. Useful for low-cost testing by swapping SDK base_url.
OpenAI API has a recorded free trial: $5; rate limit: 3 RPM (free tier).
OpenAI has a recorded free tier: ChatGPT free tier unlimited. Good for testing before upgrading.
OpenAI announced a significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output price to $8 per million tokens, approximately 26% cheaper than GPT-4o, greatly reducing developer costs.
OpenAI launches GPT-4.1 series API, approximately 26% cheaper than GPT-4o, with input at $2/M tokens and output at $8/M tokens. GPT-4.1 mini and nano are even more affordable for various use cases.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, approximately 50% lower than GPT-4o, greatly reducing developer costs.
OpenAI announced a significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output price to $8 per million tokens, 26% cheaper than GPT-4o, offering developers more cost-effective AI capabilities.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2/M tokens and output to $8/M tokens, 26% cheaper than GPT-4o, greatly reducing developer costs.
OpenAI announces price reduction for GPT-4.1 API series, with input price dropping to $2 per million tokens and output to $8 per million tokens, offering better value than GPT-4o.
OpenAI is recorded as supporting OpenAI-compatible API access. Free/trial info: $5. Useful for low-cost testing by swapping SDK base_url.
OpenAI free benefits are expanding from individual trials to students, teachers, military cohorts, and country programs. This tracker consolidates eligibility, regions, duration, official paths, and alternatives.
OpenRouter API has a recorded free trial: Free models; rate limit: 20 RPM.
OpenRouter has a recorded free tier: Varies by model. Good for testing before upgrading.
OpenRouter is recorded as supporting OpenAI-compatible API access. Free/trial info: Free models. Useful for low-cost testing by swapping SDK base_url.
Paperspace has recorded free compute or trial credits: free GPU notebooks (6hr sessions). Useful for inference, deployment, or GPU experiments.
Perplexity AI is recorded as supporting OpenAI-compatible API access. Free/trial info: $0. Useful for low-cost testing by swapping SDK base_url.
Qwen (Alibaba) API has a recorded free trial: 7000 万 tokens(新用户一次性); rate limit: 按模型不同.
Alibaba's Qwen3.6-Plus is the strongest Chinese coding model. New Bailian users get 70M free tokens (one-time). Coding ability close to Claude Sonnet 4.6, priced at only ¥2/M tokens.
Qwen (Alibaba) is recorded as supporting OpenAI-compatible API access. Free/trial info: 7000 万 tokens(新用户一次性). Useful for low-cost testing by swapping SDK base_url.
Replicate API has a recorded free trial: Free tier; rate limit: Varies.
Replicate has a recorded free tier: Credit-based. Good for testing before upgrading.
RunPod has recorded free compute or trial credits: $1 credits. Useful for inference, deployment, or GPU experiments.
SaladCloud has recorded free compute or trial credits: trial credits. Useful for inference, deployment, or GPU experiments.
SambaNova Cloud offers the world's only free LLaMA 3.1 405B API access. Core advantages: - LLaMA 3.1 405B (405 billion parameters) completely free — the largest free open-source model - The only platform globally offering free 405B access, bar none - Custom RDU (Reconfigurable Dataflow Unit) chip acceleration, ultra-fast inference - 30 RPM rate limit, no total cap — thousands of calls per day - API keys start with sn-, OpenAI-compatible format Supported models: - LLaMA 3.1 405B (flagship, best for complex reasoning) - Llama 3.3 70B (best value) - DeepSeek R1/V3 (671B MoE) - Qwen 2.5 72B - More models added regularly 405B vs 70B difference: - Significantly better complex reasoning (math, logic, multi-step) - Stronger long-text understanding (128K context) - Higher code generation quality - More precise instruction following Requires proxy from China (use openllmapi.com). Ideal for developers needing large model capabilities on a budget.
SambaNova API has a recorded free trial: Free tier(永久免费); rate limit: 30 RPM.
SambaNova has a recorded free tier: 30 RPM (no total cap). Good for testing before upgrading.
SambaNova is recorded as supporting OpenAI-compatible API access. Free/trial info: Free tier(永久免费). Useful for low-cost testing by swapping SDK base_url.
SenseNova Token Plan beta is a lead for free DeepSeek-V4-Flash API access. Developers in China can test it for low-cost document handling, summarization, and simple agent subtasks. Current details come from a public article and platform entry; quotas and limits need re-verification.
SiliconFlow offers a 14-day free API trial for new users, supporting a variety of mainstream models, ideal for developers to quickly experience and test.
SiliconFlow API has a recorded free trial: ¥14; rate limit: Varies.
SiliconFlow offers 14 open-source model APIs completely free, including Qwen, DeepSeek, Llama. Direct China access, fast speed, OpenAI compatible. The most convenient free AI API for Chinese developers.
SiliconFlow has a recorded free tier: Varies by model. Good for testing before upgrading.
SiliconFlow is recorded as supporting OpenAI-compatible API access. Free/trial info: ¥14. Useful for low-cost testing by swapping SDK base_url.
iFlytek Spark API has a recorded free trial: 200万 tokens; rate limit: 5 RPM.
iFlytek Spark has a recorded free tier: No explicit limit. Good for testing before upgrading.
iFlytek Spark is recorded as supporting OpenAI-compatible API access. Free/trial info: 200万 tokens. Useful for low-cost testing by swapping SDK base_url.
StepFun API has a recorded free trial: ¥10; rate limit: 5 RPM.
StepFun is recorded as supporting OpenAI-compatible API access. Free/trial info: ¥10. Useful for low-cost testing by swapping SDK base_url.
textgen has recorded free compute or trial credits: 免费算力/额度. Useful for inference, deployment, or GPU experiments.
Tiangong AI API has a recorded free trial: Free tier; rate limit: Varies.
Together AI has recorded free compute or trial credits: $5 credits. Useful for inference, deployment, or GPU experiments.
Together AI offers $25 free API credits for new users, supporting 200+ open-source models. Key highlight: FLUX.1 Schnell Free image generation is completely free! - No credits consumed - Unlimited use - High-quality AI image generation - The only platform offering free high-quality AI image generation API LLM models: Llama 3.3 70B Turbo, Llama 4 Maverick, DeepSeek V3, Mixtral 8x22B, and 200+ more. API keys start with together-, OpenAI-compatible. base_url: https://api.together.xyz/v1 Requires proxy from China (use openllmapi.com).
Together AI API has a recorded free trial: $5(注册赠送); rate limit: Varies by model.
Together AI has a recorded free tier: Credit-based ($5 signup bonus). Good for testing before upgrading.
Together AI is recorded as supporting OpenAI-compatible API access. Free/trial info: $5(注册赠送). Useful for low-cost testing by swapping SDK base_url.
Vast.ai has recorded free compute or trial credits: $1 credits. Useful for inference, deployment, or GPU experiments.
Vidu API has a recorded free trial: $1; rate limit: N/A.
01.AI (Yi) API has a recorded free trial: ¥10; rate limit: 5 RPM.
01.AI (Yi) is recorded as supporting OpenAI-compatible API access. Free/trial info: ¥10. Useful for low-cost testing by swapping SDK base_url.
ChatGLM (Zhipu AI) API has a recorded free trial: 500万 tokens; rate limit: 5 RPM.
ChatGLM (Zhipu AI) has a recorded free tier: No explicit limit. Good for testing before upgrading.
ChatGLM (Zhipu AI) is recorded as supporting OpenAI-compatible API access. Free/trial info: 500万 tokens. Useful for low-cost testing by swapping SDK base_url.
🎁 Free Resource Pack
Free API credits list, AI business case studies, payment stack, risk checklist, and a monetization roadmap.