$2 Coupon
SiliconCloud offers new users a 14 RMB coupon for API calls, valid for 30 days.
AI DEAL COLLECTION
API credits, AI coding tools, cloud credits, and OpenAI-compatible routes for developers and indie builders.
API credits, AI coding tools, cloud credits, and OpenAI-compatible routes for developers and indie builders. It is useful for developers, indie hackers, and AI tool users who want to compare free credits, limits, and alternative routes quickly.
yangmao.ai refreshes free tiers, expiration dates, claim requirements, and accessibility signals through automated pipelines plus manual checks. Always verify the final claim page before use.
Check the same page for alternative providers, OpenAI-compatible APIs, China-friendly access, or evergreen free tiers instead of relying on one vendor.
SiliconCloud offers new users a 14 RMB coupon for API calls, valid for 30 days.
Cohere 为新用户提供100美元免费 API 额度,支持 Command R+ 等最新模型,适用于 RAG、摘要和分类任务,中国大陆需通过代理注册和使用。
DeepSeek released V3-0324 with improved performance and reasoning, API pricing unchanged.
DeepSeek V3 模型新注册用户赠送500万 token 免费额度,支持中文优化,中国大陆直接访问,无网络限制,适合文本生成和对话场景。
Google Gemini 2.5 Flash 模型提供免费 API 调用额度,每分钟最多1500次请求,适合开发者和中小应用集成,中国大陆可通过代理或 Google Cloud 端点访问。
Google launched Gemini 2.5 Flash, focusing on low latency and efficient inference with multimodal input.
Google launched Gemini 2.5 Flash with 1M token context, priced lower than Pro.
Groq added Llama 4 Scout and Llama 4 Maverick with ultra-low latency inference.
Groq 提供基于 LPU 推理引擎的免费 API,支持 Mixtral 8x7B 等模型,每日1440次请求限制,响应速度极快,中国大陆可通过代理访问。
Mistral AI 的 Le Chat 聊天机器人提供完全免费的无限对话额度,支持多语言和代码生成,无需绑定信用卡,中国大陆可直接访问网页版。
月之暗面 Kimi 大模型 API 新注册用户赠送 $10 额度,支持长上下文(128K),中国大陆可直接访问,适合文本生成和对话场景。
NVIDIA NIM microservices now support Llama 4 Scout and Maverick models, offering high-performance optimized inference for developers.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, approximately 26% cheaper than GPT-4o, offering developers more cost-effective AI capabilities.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, representing a 26%-50% decrease compared to GPT-4o.
OpenAI announces GPT-4.1 API price drop, with input price reduced to $2 per million tokens and output price reduced to $8 per million tokens, offering better value than GPT-4o.
OpenAI released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, supporting up to 1 million token context windows with significantly reduced API pricing, offering developers more powerful and cost-effective AI capabilities.
OpenAI released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, supporting up to 1 million token context windows with significantly reduced API pricing, offering developers more powerful and cost-effective AI capabilities.
OpenAI officially released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, supporting up to 1 million token context windows, with API prices lower than GPT-4o, offering developers more powerful and cost-effective AI capabilities.
New users get 14 RMB (~$2) API credits for signing up, usable on multiple models.
SiliconCloud offers 20M free tokens for new users, supporting multiple models.
ActivePieces offers a free or open-source option: Open-source version free for self-hosting; cloud version offers free tier. Useful for low-cost developer testing.
LiveKit Agents offers a free or open-source option: Open-source framework is free; LiveKit cloud offers 1,000 free minutes/month. Useful for low-cost developer testing.
Free AI learning resource on yangmao.ai 原创: AI Coding — From Zero to Freelancing. Good for structured learning.
Free AI learning resource on 多平台整合: AI Monetization Guide — 10 Ways to Make Money with AI. Good for structured learning.
Free AI learning resource on yangmao.ai 原创: AI Presentations — Create Slides in 10 Minutes. Good for structured learning.
Free AI learning resource on yangmao.ai 原创: AI Side Hustle Guide for College Students. Good for structured learning.
Free AI learning resource on yangmao.ai 原创: AI Thesis Writing Guide — From Topic to Final Draft. Good for structured learning.
Simulates Gemini CLI, Antigravity, Codex, Grok, and Kiro client requests, compatible with the OpenAI API. Supports thousands of Gemini model requests per day with free built-in Claude model in Kiro. Easily connect to any client via API for efficient AI development.
Simulates Gemini CLI, Antigravity, Codex, Grok, and Kiro client requests, compatible with the OpenAI API. Supports thousands of Gemini model requests per day and offers free use of the built-in Claude model in Kiro. Easily connect to any client via the API, making AI development more efficient!
Aider offers a free or open-source option: Fully open-source and free, bring your own API key with no restrictions. Useful for low-cost developer testing.
Anthropic AI for Science Program is an official application-based API credits program for researchers attached to research institutions. Selected researchers can receive Anthropic API credits for high-impact scientific projects, especially biology / life sciences. The public page does not state a fixed amount and approval is reviewed by Anthropic.
Anthropic confirmed a Claude Code and Claude API quota boost on 2026-05-06: doubled five-hour Claude Code limits for Pro, Max, Team, and seat-based Enterprise plans, removal of peak-hours reduction for Pro/Max, and higher Claude Opus API rate limits. This is a quota boost rather than a free subscription.
Claude for Startups is an official application-based promo for venture-backed startups connected with Anthropic VC partners. It offers free API credits and priority rate limits, but the public page does not state a fixed credit amount. Official terms include regional restrictions, so do not position it as China-accessible.
Anthropic for Startups is a high-confidence official path for startup API credits and priority rate limits, but it is not an unconditional signup bonus. It targets VC-backed startups working with Anthropic VC partners; the credit amount is not publicly fixed and depends on Anthropic approval.
Anyscale API has a recorded free trial: $10 free credits; rate limit: 30 RPM.
Anyscale has a recorded free tier: credit-based. Good for testing before upgrading.
Anyscale is recorded as supporting OpenAI-compatible API access. Free/trial info: $10 free credits. Useful for low-cost testing by swapping SDK base_url.
Apfel offers a free or open-source option: 免费版. Useful for low-cost developer testing.
Athas offers a free or open-source option: 免费版. Useful for low-cost developer testing.
AutoDL has recorded free compute or trial credits: ¥10 credits. Useful for inference, deployment, or GPU experiments.
AutoGen offers a free or open-source option: Open-source project, self-hostable, no usage limits. Useful for low-cost developer testing.
Awesome n8n Templates offers a free or open-source option: Completely free, open-source template collection on GitHub. Useful for low-cost developer testing.
百川智能为新注册用户提供 100 万 token 免费 API 额度,支持 Baichuan4 系列模型,中国大陆直连,无需科学上网。
百川智能为 Baichuan4 模型提供新用户注册即送100万token免费API额度,支持中文优化,中国大陆直接访问,适合开发者快速集成。
Baichuan AI is recorded as China-friendly: Chinese platform, direct access.. Useful when you need access without complex network setup.
注册百川智能开放平台即送 100 万 token,支持 Baichuan4 和 Baichuan3-Turbo 模型,中国大陆直连,无需海外支付方式。
Baichuan AI API has a recorded free trial: 500万 tokens; rate limit: 5 RPM.
Baichuan AI has a recorded free tier: No explicit limit. Good for testing before upgrading.
百川智能为新注册用户提供 100万 token 免费额度,可用于调用 Baichuan4 系列模型 API,国内直连,注册即用,支持文本生成和对话场景。
Baichuan AI is recorded as open-source or providing open model resources. Useful for local deployment, customization, and low-cost evaluation.
Baichuan AI is recorded as supporting OpenAI-compatible API access. Free/trial info: 500万 tokens. Useful for low-cost testing by swapping SDK base_url.
百度千帆平台为注册用户提供每月 100 万 Token 的免费 API 额度,支持 ERNIE 系列模型,中国大陆直接访问,适合个人开发者和学生。
百度千帆大模型平台为新注册用户提供 100 万 token 的文本模型免费额度及 50 万次图片生成/理解额度,支持 ERNIE 系列模型,中国大陆用户可直接注册使用。
百度千帆大模型平台为新用户提供200万Token免费额度,支持ERNIE系列模型,国内直接访问,注册即可使用,无需海外环境。
百度千帆大模型平台为新用户提供100万Token免费调用额度(支持ERNIE 4.0、ERNIE Speed等),另赠50元体验金。中国大陆开发者可直接使用百度账号注册,API兼容OpenAI格式,迁移成本低。
百度千帆大模型平台为新用户提供 100 万 token 的免费调用额度,支持 ERNIE-Bot、ERNIE-Bot-turbo 等模型,中国大陆直接访问,注册即用,无需绑定支付方式。
百度千帆平台为新用户提供 ERNIE-Bot 系列模型免费调用额度,包含 100 万 tokens,支持 API 调用,中国大陆直接可用,无需海外支付方式。
百度千帆平台为新用户提供 ERNIE-Bot、ERNIE-3.5 等模型免费调用额度,每月基础免费额度充足,中国大陆直接使用,支持 SDK 和 REST API。
百度千帆平台近期调整免费政策,ERNIE-Bot、ERNIE-Bot-Turbo 等模型每日免费调用次数提升至 1000 次,注册即享,无需绑定银行卡,中国大陆开发者友好。
百度千帆大模型平台为新用户提供 200万 token 免费额度,支持 ERNIE-Bot、ERNIE-Bot-turbo 等模型,中国大陆网络直接使用,注册即送。
百度千帆大模型平台为新用户提供100万 token 免费额度,适用于 ERNIE 3.5 和 ERNIE 4.0 模型,支持文本生成、对话等场景。中国大陆直接访问,无需科学上网,注册即用。
Banana has recorded free compute or trial credits: trial credits. Useful for inference, deployment, or GPU experiments.
Bolt.new offers a free or open-source option: Free plan with limited daily tokens for generating and deploying apps. Useful for low-cost developer testing.
Free AI learning resource on 多平台整合: Build Your First LLM App with Free APIs. Good for structured learning.
Cerebras API has a recorded free trial: 1M tokens/day; rate limit: 30 RPM / 60K TPM / 1M TPD.
Cerebras uses proprietary WSE chips for the world's fastest inference (2000+ tokens/s, 20x faster than GPU). Free tier: 1M tokens/day, 30 RPM, no credit card. Models: Llama 3.3 70B, Llama 3.1 8B, Qwen 3.5, and more. OpenAI-compatible API. Best for latency-sensitive use cases: real-time chat, streaming, Agent tool calls. Competes with Groq on speed, but with a larger daily token budget.
Cerebras has a recorded free tier: 1M tokens/day. Good for testing before upgrading.
Cerebras is recorded as supporting OpenAI-compatible API access. Free/trial info: 1M tokens/day. Useful for low-cost testing by swapping SDK base_url.
ChatGPT (OpenAI) has a recorded free tier: Limited requests/day. Good for testing before upgrading.
ChatGPT (OpenAI) is recorded as supporting OpenAI-compatible API access. Free/trial info: $0. Useful for low-cost testing by swapping SDK base_url.
Free AI learning resource on DeepLearning.AI: ChatGPT Prompt Engineering for Developers. Good for structured learning.
Anthropic API API has a recorded free trial: $5; rate limit: 5 RPM.
A developer built a free local MCP server that significantly optimizes Claude Code's PR review process. The tool reduces token consumption per PR review from 63K to 8.7K, drastically lowering usage costs. Users need to set up the local server and integrate it into their Claude Code workflow. This solution is ideal for developers who frequently use Claude Code for code reviews.
Claude (Anthropic) has a recorded free tier: Limited messages/day. Good for testing before upgrading.
Reddit community users are compiling a hidden tips guide for Claude free tier users, focusing on advanced usage of Artifacts and Projects. These tips help users get a better experience within the free quota, including prompt optimization and using project features to manage conversation history. The guide is community-driven and continuously updated.
Cloudflare Workers AI is recorded as China-friendly: Direct access from China via Cloudflare edge network, low latency. Workers AI accelerated by global CDN.. Useful when you need access without complex network setup.
Cloudflare Workers AI API has a recorded free trial: 每天 10000 神经元(永久有效); rate limit: 10000 requests/day.
Cloudflare Workers AI has a recorded free tier: 10,000 free requests/day. Good for testing before upgrading.
Cloudflare Workers $5/mo plan includes Workers AI with 10,000 free AI calls per day (measured in neurons), permanently valid. 50+ open-source models: - LLM: Llama 3.1 8B, Llama 3.3 70B, Gemma, Mistral 7B, Phi-2 - Image generation: Stable Diffusion XL (completely free!) - Embeddings: BGE Base/Large (for RAG and semantic search) - Speech-to-text: Whisper Highlights: - Permanently valid, never expires - Inference on 300+ global edge nodes, ultra-low latency - Direct China access, no proxy needed - OpenAI-compatible via AI Gateway - Pay-as-you-go after free quota, no hard cutoff - If you already use Cloudflare Workers, this is essentially free Ideal for lightweight AI: blog writing, content tagging, summarization, embeddings, product image generation.
Cloudflare Workers AI is recorded as supporting OpenAI-compatible API access. Free/trial info: 每天 10000 神经元(永久有效). Useful for low-cost testing by swapping SDK base_url.
Code Relay is attractive because it may expose GPT-5.5 / Claude 4.7-style models through one relay, but it is still a third-party relay rather than an official provider. Security and long-term stability are materially weaker than official platforms. Treat it as a risk case or tiny test channel, not a production API.
Code2Prompt offers a free or open-source option: 免费版. Useful for low-cost developer testing.
Codeium offers a free or open-source option: Individual plan free forever with unlimited completions + AI chat, no credit card required. Useful for low-cost developer testing.
Cohere reduced Command R+ and Command R API prices by 50%, new Command R7B priced lower.
Cohere launched Command A for enterprise RAG and tool use, with updated API pricing. See official website for details.
Cohere released Command R7B with 7B parameters, focused on enterprise RAG and tool use, API price reduced.
Cohere API has a recorded free trial: 1000 calls/month; rate limit: Trial rate limits.
Cohere has a recorded free tier: 1,000 calls/month (Trial Key). Good for testing before upgrading.
新用户注册 Cohere 平台即获 $10 免费 API 额度,可用于 Command R+、Embed 等模型,支持 RAG 和分类任务,中国大陆需科学上网。
Cohere 为新注册用户提供 100 美元免费 API 额度,支持 Command R+、Embed 等模型,适合 RAG 和文本生成场景。需绑定信用卡验证身份,中国大陆用户可用虚拟卡。
Cohere offers a free Trial API Key with 1,000 calls/month across all models: - Command R+: top RAG and chat model - Rerank: document reranking for RAG pipelines - Embed: multilingual text embeddings No credit card required, resets monthly. Great for prototyping RAG projects. Note: Trial Key is not permitted for production use.
Cohere 为新注册用户提供 $20 免费 API 额度,可用于 Command R+、Embed 等模型,有效期 30 天,需绑定信用卡,中国大陆需科学上网。
Cohere 提供每月 100 万 token 免费额度,支持 Command R+、Embed 等模型,API 稳定,中国大陆需科学上网,适合 RAG 和文本生成场景。
Cohere 近期将免费试用额度从 40 万 token 提升至每月 100 万 token,支持 Command R、Embed 等模型 API,注册即享,中国大陆需科学上网访问。
ComfyUI-Copilot offers a free or open-source option: 免费版. Useful for low-cost developer testing.
Context7 offers a free or open-source option: Open-source, self-hostable or direct use, no limits. Useful for low-cost developer testing.
Continue offers a free or open-source option: Fully open-source and free, bring your own API key or use local models. Useful for low-cost developer testing.
Coze (ByteDance) is recorded as China-friendly: By ByteDance. China version coze.cn direct access. International coze.com needs proxy.. Useful when you need access without complex network setup.
Coze (ByteDance) API has a recorded free trial: Free tier; rate limit: Varies.
Coze offers a free or open-source option: China version (coze.cn) basic features free, international version (coze.com) has free quota. Useful for low-cost developer testing.
Coze (ByteDance) has a recorded free tier: No explicit limit. Good for testing before upgrading.
CrewAI offers a free or open-source option: Open-source framework, free to integrate, no commercial restrictions. Useful for low-cost developer testing.
Cursor is recorded as China-friendly: Cursor IDE accessible from China. AI features routed through Cursor servers.. Useful when you need access without complex network setup.
Cursor offers a free or open-source option: Free plan includes 2000 completions + 50 premium requests (GPT-4/Claude) per month. Useful for low-cost developer testing.
Cursor has a recorded free tier: 2000 completions + 50 premium requests/mo. Good for testing before upgrading.
Databricks announces the integration of OpenAI's GPT-5.5 model into its enterprise agent workflows. The model is designed for complex tasks, supporting multi-step reasoning and automated actions. Enterprise users can directly invoke it through the Databricks platform without additional configuration. This update marks a further expansion of OpenAI models in enterprise applications.
DeepSeek's official docs confirm a capacity expansion request path for API accounts that need higher concurrency than the default limits. DeepSeek matches appropriate concurrency based on submitted business needs, with no additional cost for capacity expansion. This is for teams or businesses needing higher DeepSeek V4 Pro / V4 Flash concurrency; it is not free token credit and is not automatic access.
DeepSeek 为新注册用户提供 500 万 token 免费 API 额度(含对话和代码模型),支持中国大陆直接访问,无需海外信用卡。
DeepSeek is recorded as China-friendly: Chinese service, direct access in China, very fast, no proxy needed. Useful when you need access without complex network setup.
注册即送 500 万 token,支持 DeepSeek-V2 和 DeepSeek-Coder 模型,兼容 OpenAI API 格式,中国大陆直连可用,无信用卡要求。
新注册用户可获得 500 万 token 免费额度,支持 DeepSeek-V2 和 DeepSeek-Coder 模型,中国大陆可直接访问。
DeepSeek 为新注册用户提供 500 万 token 的免费 API 额度(含输入和输出),支持 DeepSeek-V2 等模型,中国大陆可直接访问,无需海外信用卡。
DeepSeek API has a recorded free trial: $5; rate limit: 2 RPM.
DeepSeek 为新注册用户提供 500 万免费 tokens,支持 DeepSeek-V2 和 DeepSeek-Coder 模型,API 兼容 OpenAI 格式,中国大陆可直接访问,无需海外信用卡。
DeepSeek 为新注册用户提供 500万 token 的免费 API 调用额度,支持 DeepSeek-V2 和 DeepSeek-Coder 模型,中国大陆可直接访问,无需海外信用卡。
DeepSeek offers 50 free inferences daily (V3 + R1 models) plus $5 API credits on signup. R1 reasoning model excels at math and code, one of the best free AI options available.
DeepSeek increased free user daily conversation limit from 50 to 100, offering more free usage quota.
DeepSeek has a recorded free tier: 50 requests/day. Good for testing before upgrading.
DeepSeek continues to offer free API credits, new users receive 5 million tokens upon registration, allowing immediate use without payment.
DeepSeek 为新注册用户提供 500 万 token 的免费额度(含输入和输出),可用于 DeepSeek-V3 和 DeepSeek-R1 模型 API,有效期 30 天,支持中国大陆直接访问,无需翻墙。
DeepSeek 为新注册用户提供 500 万 Token 免费额度,可用于 DeepSeek-V2 和 DeepSeek-Coder 系列模型 API 调用,支持文本生成与代码补全,中国大陆直接访问,无需翻墙。
DeepSeek 为新注册用户提供500万Token免费额度,可用于其最新大模型API调用,支持文本生成、代码编写等,中国大陆可直接访问注册,无需海外信用卡。
Free AI learning resource on DeepSeek 官网: DeepSeek Official Tutorial. Good for structured learning.
DeepSeek is recorded as open-source or providing open model resources. Useful for local deployment, customization, and low-cost evaluation.
DeepSeek is recorded as supporting OpenAI-compatible API access. Free/trial info: $5. Useful for low-cost testing by swapping SDK base_url.
DeepSeek announced R1 model API pricing at $0.14/M tokens input and $0.28/M tokens output, highly competitive.
DeepSeek officially released the R1 reasoning model, achieving performance comparable to OpenAI o1 on multiple benchmarks, while offering more competitive API pricing, providing developers with cost-effective reasoning capabilities.
DeepSeek launched R1 reasoning model for complex tasks with competitive API pricing.
DeepSeek released an update to R1 reasoning model, improving math and code reasoning, API price unchanged.
新注册 DeepSeek 平台即赠送 500 万 token 免费额度,可用于调用 DeepSeek-V2 等模型 API,支持中国大陆网络直接使用,无需海外信用卡。
DeepSeek released V3-0324 with improved performance and reasoning, same API pricing.
DeepSeek-V3 input price dropped to $0.27/M tokens, output price dropped to $1.10/M tokens, applicable to all API users.
新注册用户赠送500万token免费额度,支持 DeepSeek V3 模型,中国大陆直接使用,无需翻墙。
DeepSeek-V4 is officially released with a million-token context window, greatly enhancing long-text processing capabilities. The model is optimized for agent applications, supporting more complex multi-step reasoning and tool calling. Developers can use it for free via the API with no additional cost. It is one of the longest-context open-source models available, suitable for document analysis, codebase understanding, and more.
The DeepSeek V4 Pro pricing transition is scheduled around May 31, 2026. The current 25%-of-list promotional window moves to the announced 1/4-of-list pricing basis, so users should verify the exact input/output rates in the DeepSeek console. Plan usage costs before high-volume API workloads.
Dify offers a free or open-source option: Open-source version fully free to self-host; cloud version with 200 GPT-4 calls per month. Useful for low-cost developer testing.
Domestic open-source large language model downloads have exceeded 10 billion, marking the vigorous growth of China's open-source AI ecosystem. These models include open-source versions released by multiple well-known vendors and institutions, covering different parameter scales from lightweight to large. Users can download model weights for free for academic research, commercial applications, or further development. This milestone reflects the widespread recognition and adoption of domestic AI technology by the open-source community.
Doubao (ByteDance) is recorded as China-friendly: Chinese platform by ByteDance, direct access, fast. Servers across China.. Useful when you need access without complex network setup.
Doubao (ByteDance) API has a recorded free trial: 50万 tokens; rate limit: 5 RPM.
Doubao (ByteDance) has a recorded free tier: No explicit limit. Good for testing before upgrading.
Doubao (ByteDance) is recorded as supporting OpenAI-compatible API access. Free/trial info: 50万 tokens. Useful for low-cost testing by swapping SDK base_url.
Easy-Dataset offers a free or open-source option: Open source and free to use, no quota limits. Useful for low-cost developer testing.
ElevenLabs API has a recorded free trial: 10K chars/month; rate limit: Varies.
ElevenLabs has a recorded free tier: 10,000 characters/month. Good for testing before upgrading.
ERNIE Bot (Baidu) is recorded as China-friendly: By Baidu, direct access in China, fast and enterprise-stable.. Useful when you need access without complex network setup.
ERNIE Bot (Baidu) API has a recorded free trial: Free tier; rate limit: 5 RPM.
ERNIE Bot (Baidu) has a recorded free tier: No explicit limit. Good for testing before upgrading.
fal.ai API has a recorded free trial: Promotional credits; rate limit: N/A.
fal.ai has a recorded free tier: Promotional credits on signup. Good for testing before upgrading.
Fireworks AI 提供每日 100 万 token 免费额度,支持 Llama 3、Mixtral、Gemma 等主流开源模型。API 兼容 OpenAI 格式,中国大陆可直连,适合原型开发和轻量应用。
提供高速推理 API,支持 Llama、Qwen 等开源模型。新用户有每日免费的 token 额度,适用于开发和测试。
Fireworks AI API has a recorded free trial: $1 free credits; rate limit: 600 RPM.
Fireworks AI has a recorded free tier: 600 RPM. Good for testing before upgrading.
Fireworks AI is recorded as supporting OpenAI-compatible API access. Free/trial info: $1 free credits. Useful for low-cost testing by swapping SDK base_url.
FLUX (Black Forest Labs) API has a recorded free trial: Free via platforms; rate limit: Varies.
FLUX (Black Forest Labs) has a recorded free tier: Free via third-party platforms. Good for testing before upgrading.
FLUX (Black Forest Labs) is recorded as open-source or providing open model resources. Useful for local deployment, customization, and low-cost evaluation.
FreeLLMAPI offers a free or open-source option: Fully open-source, aggregates free tiers from 14 providers, ~1.3B tokens/month. Useful for low-cost developer testing.
FreeModel is the lower-friction option in this GPT-5.5 free trial batch: no card, quick signup, and useful for light testing. The key unknowns are whether the model is truly native GPT-5.5, whether the weekly quota resets reliably, and long-term service stability.
The Gemini API free tier is suitable for developers, small projects, and prototypes. Actual free rate limits vary by model, project, and billing tier, so users should confirm current limits in AI Studio.
Gemini (Google) has a recorded free tier: No explicit limit. Good for testing before upgrading.
Local deployment resource for Gemma 4 31B: MLX and GGUF variants, Mac memory requirements, Ollama/LM Studio routes, and safety notes.
Gemma 4 offers a free or open-source option: Fully open-source, Apache 2.0 license, commercial use allowed. Useful for low-cost developer testing.
GitHub Copilot is recorded as China-friendly: Direct access from China. Supports VS Code and JetBrains.. Useful when you need access without complex network setup.
GitHub Education is one of the most reliable education-benefit paths for AI coding. Verified students and teachers can receive Copilot-related benefits for long-term learning and development.
GitHub Copilot offers a free or open-source option: Free plan with 2000 completions + 50 chats per month. Free for students and OSS maintainers. Useful for low-cost developer testing.
GitHub Copilot Free is the official free tier for AI coding tools: 2,000 completions and 50 agent/chat requests per month, with no credit card required according to GitHub’s pricing page.
GLHF.chat 提供 Llama、Mistral 等开源模型的免费 GPU 推理服务,注册即送每月 25 美元额度,无需绑定信用卡。支持中国大陆网络访问,适合低成本运行大模型。
Golutra offers a free or open-source option: 免费版. Useful for low-cost developer testing.
Google AI Edge Gallery has released v1.0.13 and v1.0.14 updates, adding support for Gemma 4 multi-token prediction models for more efficient on-device inference. The update also introduces Pixel TPU acceleration to boost model performance. Additionally, experimental MCP (Model Context Protocol) support, new skills, and chat history saving are now available, enhancing the practicality and user experience of edge AI.
Google Antigravity still has official free weekly limits and Pro/Ultra higher-limit signals, but the Ultra USD $100 bonus-credit sub-offer expired on 2026-05-25 and is no longer presented as claimable.
Google AI (Gemini) is recorded as China-friendly: Partially accessible, VPN recommended for stability. Useful when you need access without complex network setup.
Google Cloud's official $300 free credit offer for new customers can support AI API and cloud POC workflows. Eligibility and regional availability should be checked in the Google Cloud signup flow.
Google AI (Gemini) API has a recorded free trial: 免费 API 无需信用卡; rate limit: 15 RPM (Flash).
Google AI (Gemini) has a recorded free tier: Gemini free tier unlimited. Good for testing before upgrading.
Google 最新 Gemini 2.5 Pro 模型提供免费 API 层,每分钟最多2次请求,无需付费即可体验长上下文推理能力,适合开发测试和小型应用。
Google offers Gemini 2.5 Flash for free in AI Studio, with lower rate limits compared to the paid tier.
Google has updated the Gemini free tier quota. The Gemini 2.5 Flash model is now free on AI Studio with a rate limit of 30 requests per minute.
Google AI Studio free tier now includes Gemini 2.5 Flash, offering daily free quota for development and testing without any cost.
Google made Gemini 2.5 Flash GA with 1M token context, priced lower than Pro.
Google released Gemini 2.5 Flash with 1M token context, priced lower than Pro.
Google released Gemini 2.5 Flash with 1M token context, priced lower than Pro.
Google officially launched Gemini 2.5 Flash, supporting up to 1M token context window, priced lower than Pro, designed for developers with efficient reasoning capabilities.
Google released Gemini 2.5 Flash with 1M token context, priced lower than Pro.
Google officially launches the Gemini 2.5 Flash model, supporting up to 1M token context window with input pricing at just $0.15/M tokens, offering developers a cost-effective long-context AI capability.
Google officially released the Gemini 2.5 Flash model, supporting multimodal input and up to 1 million token context window, with strong performance and pricing lower than the Pro version, suitable for a wide range of applications.
Google launched Gemini 2.5 Flash with 1M token context, priced at $0.15/1M input tokens.
Google officially released Gemini 2.5 Flash, supporting up to 1M token context window, with significantly better performance than the previous 2.0 Flash at a lower price, ideal for large-scale inference and long-context processing.
Google releases Gemini 2.5 Flash preview with 1M token context, lower pricing than Pro.
Google has adjusted Gemini API pricing. The Gemini 2.5 Flash model now costs $0.15/M tokens for input and $0.60/M tokens for output, making it highly competitive.
Google announced the pricing for Gemini 2.5 Flash, with input at $0.15/M tokens and output at $0.60/M tokens, significantly cheaper than the Pro version.
Gemini 2.5 Flash input $0.15/M tokens, output $0.60/M tokens, very cost-effective.
Gemini 1.5 Flash 和 Gemini 1.5 Pro 模型免费层,每分钟 60 次请求,无需付费即可使用,中国大陆开发者可通过代理访问。
Google increased Gemini API free tier rate limit to 30 requests per minute, supporting Gemini 2.0 Flash model, ideal for developers and personal projects.
The official Gemini API / AI Studio no-card free tier now has an additional entry: beyond Gemini API Free Tier input/output tokens, Google's I/O 2026 Blog confirms that new AI Studio builders can deploy their first two apps to Google Cloud at no cost with no credit card required. Production use, higher limits, or projects with billing already enabled still follow Cloud Run / Paid Tier rules.
Google Gemini API 提供永久免费套餐,支持 Gemini 1.5 Flash 和 Gemini 1.5 Pro 模型,每分钟最多 60 次请求,无每日 token 上限,适合个人开发者和学习使用。中国大陆需科学上网。
Google significantly increased the Gemini Code Assist free tier from 2,000 to 180,000 code completions per month, supporting VS Code and JetBrains IDEs, providing developers with more powerful AI-assisted coding capabilities.
Google Gemini API 提供免费层,支持 Gemini 1.5 Pro 和 Flash 模型,每分钟最多 60 次请求,无需付费即可使用多模态能力,中国大陆需代理访问。
Google Gemini API 提供免费层级,每分钟最多60次请求,支持 Gemini 1.5 Flash 和 Gemini 1.5 Pro 模型,中国大陆开发者可通过代理或直接访问(部分地区可用)。无需绑定信用卡即可开始使用。
Google increased Gemini free tier context from 32k to 1M tokens and raised daily request limits, significantly enhancing the free user experience.
Google has announced the shutdown of its free search index, meaning AI applications and developers relying on web search can no longer access real-time search results for free. Traffic defense services like Cloudflare are also intensifying blocking of AI crawlers, further complicating web search. Users need to seek alternatives such as Bing API, DuckDuckGo, or self-built crawlers, though costs and technical barriers may increase.
Gorilla has recorded free compute or trial credits: unlimited. Useful for inference, deployment, or GPU experiments.
On May 11, 2026, OpenAI released GPT-5.5 and the cybersecurity-focused GPT-5.5-Cyber model. This model series enhances trusted access capabilities, suitable for security analysis, threat detection, and automated response scenarios. The new models offer improved reasoning accuracy and safety, providing enterprises and security teams with a more reliable AI assistant.
OpenAI has released the GPT-5.5 Instant model, the latest iteration of the GPT series. This model is optimized for low-latency responses, suitable for applications requiring real-time interaction. Users can access it directly via the OpenAI API without additional application. Specific pricing and free tier details have not been announced yet; please follow official documentation for updates.
OpenAI has released the GPT-5.5 system card, marking the arrival of a new generation model. The model features significant improvements in reasoning, coding, and multimodal capabilities. Specific pricing and free tier details have not been announced yet, but it is expected to follow the tiered pricing strategy of the GPT series. Users can experience the new model via OpenAI API or ChatGPT.
OpenAI officially releases GPT-5.5 and GPT-5.5-Cyber models, the latest upgrade in the GPT series. GPT-5.5-Cyber is specifically designed for cybersecurity, offering enhanced trusted access control features for threat detection, vulnerability analysis, and more. The model helps enterprises better protect sensitive data and systems through strengthened security mechanisms.
Grok (xAI) API has a recorded free trial: $25/月; rate limit: Varies.
Grok (xAI) has a recorded free tier: Limited requests/day. Good for testing before upgrading.
xAI's Grok gives $25 API credits monthly, auto-reset. Supports Grok-2 models with OpenAI compatible format. One of the highest monthly free API credits available.
Grok (xAI) is recorded as open-source or providing open model resources. Useful for local deployment, customization, and low-cost evaluation.
Grok (xAI) is recorded as supporting OpenAI-compatible API access. Free/trial info: $25/月. Useful for low-cost testing by swapping SDK base_url.
Groq launched DeepSeek R1 671B on its platform for high-speed inference, available via Groq API or interface.
Groq added DeepSeek R1 model with fast inference, available on free tier.
Groq 提供基于 LPU 推理引擎的免费 API,支持 Llama 3、Mixtral 等模型,每日 1440 次请求限制,速度极快。需海外邮箱注册,中国大陆可访问但需翻墙。
Groq 提供每日100万Token免费API调用额度,基于其自研LPU芯片实现极速推理(支持Llama 3、Mixtral等模型)。注册需海外邮箱,但API中国大陆可直连,适合低延迟场景。
Groq 提供基于 LPU 推理引擎的免费 API,支持 Llama 3、Mixtral 等模型,每天最多 1440 次请求,中国大陆可直连,适合低延迟推理测试。
Groq 提供完全免费的 API 访问,支持 Llama 3、Mixtral 等开源模型,速率限制为 30 次/分钟,无总量上限。中国大陆用户需自行解决网络访问问题,注册无需信用卡。
Groq API has a recorded free trial: Free tier(永久免费); rate limit: 30 RPM / 6000 TPM.
Groq is one of today's most useful free inference deals: the free tier lets developers test Llama, Mixtral, Gemma and other models through an OpenAI-compatible API. It is best for AI agents, RAG summarization, and low-latency chat prototypes. China access may require additional verification or a relay.
Groq 提供免费 API 额度,支持 Llama 3、Mixtral 等开源模型,推理速度极快,每日有限免费调用次数,注册即用,中国大陆需科学上网。
Groq uses custom LPU (Language Processing Unit) chips for the fastest AI inference in the industry. Free models: - Llama 3.3 70B Versatile — 6000 TPM / 30 RPM - Llama 4 Scout 17B — 6000 TPM / 30 RPM - Llama 4 Maverick 17B — 6000 TPM / 30 RPM - Mixtral 8x7B — 5000 TPM / 30 RPM - Gemma 2 9B — 15000 TPM / 30 RPM - DeepSeek R1 Distill Llama 70B — 6000 TPM / 30 RPM Highlights: - 10x+ faster than GPU solutions, Llama 3.3 70B reaches 300+ tokens/sec - API keys start with gsk_, OpenAI-compatible - No total cap, rate-limited only - Requires proxy from China (use openllmapi.com)
Groq 将免费套餐的每日 API 请求上限从 500 次提升至 1000 次,支持 Llama 3、Mixtral 等开源模型,中国大陆开发者可直接通过 API 调用,无需绑定信用卡。
Groq uses proprietary LPU (Language Processing Unit) chips for the world's fastest AI inference. Free tier requires no credit card. Free tier details: - Llama 3.3 70B: 30 RPM, 6000 tokens/min, 14400 requests/day - Llama 3.1 8B: 30 RPM, 20000 tokens/min - Gemma 2 9B: 30 RPM, 15000 tokens/min - Mixtral 8x7B: 30 RPM, 5000 tokens/min - Llama 4 Scout/Maverick (newly added) Why Groq is so fast: - Custom LPU chip designed specifically for LLM inference - Deterministic execution, no GPU memory bandwidth bottleneck - Llama 3.3 70B output at 300+ tokens/s (GPU typically 30-50 tokens/s) - Ultra-low time-to-first-token, ideal for real-time chat and streaming Best for: - Real-time AI chat (speed is the core experience) - Agent tool calls (low latency = faster multi-step reasoning) - Streaming output (buttery smooth typewriter effect) - Rapid prototyping China accessible. OpenAI-compatible API, base_url is https://api.groq.com/openai/v1.
Groq has a recorded free tier: 6000 tokens/min (Llama 3.3 70B). Good for testing before upgrading.
Groq free tier users can now access Llama 4 Scout and Maverick with rate limits.
Groq free tier rate limit reduced from 30 RPM to 20 RPM, but daily request cap increased, suitable for light usage.
Groq free tier rate limits adjusted, daily request caps reduced for some models. See official docs for details.
Groq increased free tier API rate limit from 30 to 60 requests per minute for more models.
Groq increased free tier API rate limits for more concurrent requests, ideal for developer testing and prototyping.
Groq increased free tier rate limit to 60 requests per minute, suitable for dev testing.
Groq increased API rate limits for free tier users, allowing more concurrent requests.
Groq increased free tier API rate limit to 60 requests per minute for models like Llama 3.
Groq free tier daily request limit increased to 1440, supporting more models including Llama 4 series, ideal for developers testing and lightweight applications.
Groq increased free tier rate limit from 30 to 60 requests per minute, supporting Llama 3 and Mixtral models for API calls.
Groq has added the Llama 3.3 70B model to its platform, available for inference on the free tier.
Groq deployed Meta's Llama 4 Scout and Llama 4 Maverick models on its platform for low-latency inference, with a free tier available.
Groq Cloud now offers Meta Llama 4 Scout and Llama 4 Maverick models, available via the free tier for high-speed inference.
Groq deployed Meta Llama 4 Scout and Llama 4 Maverick models on its platform for fast inference.
Groq launches Llama 4 Scout and Llama 4 Maverick models, offering free API credits for users to try the new models.
Groq deployed Meta's Llama 4 Scout and Llama 4 Maverick models with free API access.
Groq Cloud now hosts Meta Llama 4 Scout and Llama 4 Maverick for fast inference.
Groq 于2026年4月底上线Mixtral 8x7B免费推理服务,每日500次请求,无需信用卡,API兼容OpenAI格式,中国大陆开发者可直接调用。
Groq 提供 Mixtral 8x7B 等模型的免费 API 访问,速率限制为每分钟30次请求,适合快速原型开发。中国大陆需通过代理访问。
Groq 提供基于 LPU 的高速推理服务,Mixtral 8x7B 模型每日免费额度高达100万token,注册即用,中国大陆可直接访问 API。
Groq is recorded as supporting OpenAI-compatible API access. Free/trial info: Free tier(永久免费). Useful for low-cost testing by swapping SDK base_url.
Haystack offers a free or open-source option: Open-source project, self-hostable, no usage limits. Useful for low-cost developer testing.
HexStrike AI offers a free or open-source option: 免费版. Useful for low-cost developer testing.
HuggingFace AI Agents Course offers a free or open-source option: Completely free, certificate on completion. Useful for low-cost developer testing.
HuggingFace LLM Course offers a free or open-source option: Completely free, uses HuggingFace ecosystem tools. Useful for low-cost developer testing.
HuggingFace MCP Course offers a free or open-source option: Completely free, certificate on completion. Useful for low-cost developer testing.
HFViewer is recorded as China-friendly: The site is directly accessible; model metadata depends on Hugging Face and may require a proxy in some regions.. Useful when you need access without complex network setup.
HFViewer has a recorded free tier: Free web access. Good for testing before upgrading.
Hugging Face is recorded as China-friendly: Accessible from China (proxy may be needed in some regions). hf-mirror.com is a Chinese mirror.. Useful when you need access without complex network setup.
Hugging Face API has a recorded free trial: Free tier; rate limit: Varies.
Hugging Face released a free AI coding assistant on Spaces for code generation and debugging, helping developers boost coding productivity.
Hugging Face released a free AI coding assistant powered by open-source models, supporting code generation and debugging at no cost.
Hugging Face launched a free AI coding assistant based on StarCoder2, with a VS Code extension to help developers boost coding productivity.
Hugging Face launched a free inference API supporting multiple open-source models, no credit card required, with 30,000 free inference requests per month.
Hugging Face launched a free inference API supporting thousands of open-source models with daily limits, ideal for developers to test and integrate.
Hugging Face launches a free inference API supporting multiple models, available at no cost.
Hugging Face has a recorded free tier: Varies by model. Good for testing before upgrading.
Hugging Face 提供 Inference API 免费套餐,每月 3 万次调用,支持数千个开源模型(文本、图像、音频等),中国大陆可访问但速度较慢,适合学习和实验。
Hugging Face 提供免费推理 API,可调用数千个社区模型(包括文本、图像、音频等),中国大陆可直接访问,无需付费。
Free AI learning resource on Hugging Face: Hugging Face NLP Course. Good for structured learning.
Hugging Face is recorded as open-source or providing open model resources. Useful for local deployment, customization, and low-cost evaluation.
Hugging Face released SmolVLM, a lightweight vision-language model, completely free and open-source, suitable for local deployment and edge computing.
Hugging Face increased free GPU hours on Spaces from 10 to 20 per month, allowing users to run AI apps and demos for longer.
Tencent Hunyuan is recorded as China-friendly: By Tencent, direct access in China, enterprise-stable.. Useful when you need access without complex network setup.
Tencent Hunyuan API has a recorded free trial: 100万 tokens; rate limit: 5 RPM.
Tencent Hunyuan has a recorded free tier: No explicit limit. Good for testing before upgrading.
Tencent Hunyuan is recorded as open-source or providing open model resources. Useful for local deployment, customization, and low-cost evaluation.
Tencent Hunyuan is recorded as supporting OpenAI-compatible API access. Free/trial info: 100万 tokens. Useful for low-cost testing by swapping SDK base_url.
Ideogram has a recorded free tier: 10 images/day. Good for testing before upgrading.
Kimi (Moonshot AI) is recorded as China-friendly: Chinese service, direct access. Web version free and unlimited, one of the most popular AI assistants in China.. Useful when you need access without complex network setup.
Kimi (Moonshot AI) API has a recorded free trial: ¥15 + 充 $5 送 $5; rate limit: 3 RPM.
月之暗面(Moonshot AI)为 Kimi 大模型 API 新用户提供100万 token 免费额度,支持长上下文(128K),中国大陆直接访问,无需代理。注册即送,可用于对话、文档分析等场景。
Kimi (Moonshot AI) has a recorded free tier: No explicit limit. Good for testing before upgrading.
Kimi (Moonshot AI) is recorded as supporting OpenAI-compatible API access. Free/trial info: ¥15 + 充 $5 送 $5. Useful for low-cost testing by swapping SDK base_url.
Kiro offers a free or open-source option: Free plan with limited AI interactions per month. Useful for low-cost developer testing.
Official Kiro signup credit for new users: get up to $20 toward your first paid-plan upgrade. Kiro Pro includes premium models such as Claude Opus 4.7, Opus 4.6, and Sonnet 4.6. A card is required, so use a secondary/virtual card and confirm or cancel auto-renewal after activation.
Kling AI (Kuaishou) is recorded as China-friendly: By Kuaishou, direct access in China, fast. Leading video generation quality domestically.. Useful when you need access without complex network setup.
Kling AI (Kuaishou) has a recorded free tier: 66 credits/day. Good for testing before upgrading.
Lambda Cloud has recorded free compute or trial credits: 无免费额度,但价格有竞争力. Useful for inference, deployment, or GPU experiments.
Free AI learning resource on DeepLearning.AI: LangChain for LLM Application Development. Good for structured learning.
LangChain4j offers a free or open-source option: Open source and free under MIT license. Useful for low-cost developer testing.
DGX Cloud Lepton (formerly Lepton AI) is recorded as China-friendly: Founded by Chinese-American team, good China access. API is directly accessible.. Useful when you need access without complex network setup.
DGX Cloud Lepton (formerly Lepton AI) API has a recorded free trial: $10 free credits; rate limit: 10 RPM.
DGX Cloud Lepton (formerly Lepton AI) has a recorded free tier: 10M tokens/day. Good for testing before upgrading.
DGX Cloud Lepton (formerly Lepton AI) is recorded as supporting OpenAI-compatible API access. Free/trial info: $10 free credits. Useful for low-cost testing by swapping SDK base_url.
LM Studio is recorded as China-friendly: Runs locally, no network needed. May need proxy to download software and models.. Useful when you need access without complex network setup.
LM Studio API has a recorded free trial: Unlimited; rate limit: Local.
LM Studio has a recorded free tier: Unlimited (runs locally). Good for testing before upgrading.
LM Studio is recorded as supporting OpenAI-compatible API access. Free/trial info: Unlimited. Useful for low-cost testing by swapping SDK base_url.
Lovable offers a free or open-source option: Free plan with 5 projects + limited messages per month. Useful for low-cost developer testing.
Luma (Dream Machine) has a recorded free tier: Free trial credits. Good for testing before upgrading.
Free AI learning resource on yangmao.ai 原创: Master Midjourney Prompts — V7 Formula & Style Guide. Good for structured learning.
Free AI learning resource on yangmao.ai 原创: Master SD & FLUX Prompts — Free AI Image Generation Guide. Good for structured learning.
Mastra offers a free or open-source option: Open-source project, self-hostable, no usage limits. Useful for low-cost developer testing.
mcp-go offers a free or open-source option: Open source project, completely free to use. Useful for low-cost developer testing.
Midjourney has a recorded free tier: 25 images/day (limited time). Good for testing before upgrading.
Million Engine is recorded as China-friendly: Direct China access, optimized for Chinese developers, low latency. Useful when you need access without complex network setup.
Million Engine is recorded as supporting OpenAI-compatible API access. Free/trial info: 按量付费. Useful for low-cost testing by swapping SDK base_url.
MiniMax is recorded as China-friendly: Chinese platform, direct access, fast.. Useful when you need access without complex network setup.
MiniMax为新注册用户提供100万Token免费体验额度,支持abab系列模型,中国大陆用户可直接使用,注册无需海外信用卡。
MiniMax API has a recorded free trial: ¥15; rate limit: Varies.
MiniMax has a recorded free tier: No explicit limit. Good for testing before upgrading.
MiniMax is recorded as open-source or providing open model resources. Useful for local deployment, customization, and low-cost evaluation.
MiniMax is recorded as supporting OpenAI-compatible API access. Free/trial info: ¥15. Useful for low-cost testing by swapping SDK base_url.
Mistral AI 于2026年4月更新免费政策,Le Chat 平台每月提供100万token免费额度,支持Mistral Large 2模型,中国大陆可直连。
Mistral AI 的 Le Chat 聊天应用提供免费无限对话,支持 Mistral Large 等模型,中国大陆可直接访问网页版,无需注册即可使用基础功能。
Mistral offers free API trial credits for new users. After registration, you can check the specific amount in the console, ideal for trying Mistral's AI models.
Mistral AI API has a recorded free trial: Free tier; rate limit: 1 RPM.
Mistral AI 为新用户提供 500 万 token 免费 API 额度,支持 Mistral Large、Small 等模型,中国大陆可注册但需海外邮箱。
Mistral AI’s official free API entry point is the Experiment plan: free for evaluation and prototyping, with limited rate limits; production or higher usage requires the Scale plan.
Mistral AI 提供免费开发者计划,每月 50 万 token 的 API 调用额度,支持 Mistral Large、Mistral Small 等模型,中国大陆需科学上网。
Mistral Small 3.1 model has been added to the free tier, developers can use the API for free with a daily quota of 5 million tokens.
Mistral AI has a recorded free tier: No explicit limit. Good for testing before upgrading.
Mistral AI 为新注册用户提供 50 万 token 免费额度,可用于调用 Mistral Large、Mistral Small 等模型,支持文本生成和代码能力。中国大陆用户需自行解决网络访问,注册需邮箱验证。
Mistral AI’s official free API entry point is the Experiment plan: free for evaluation and prototyping, with limited rate limits; production or higher usage requires the Scale plan.
新注册用户赠送 €10 API 额度,可用于 Mistral Large 等模型,支持中国大陆邮箱注册,需绑定国际信用卡。
Mistral AI 的 Le Chat 平台提供免费层,支持无限次对话、文件上传(图像、PDF、Word、Excel)和网络搜索,无需付费。中国大陆可直接访问网页版。
Mistral AI 推出的 Le Chat 聊天助手提供每日100次免费对话额度,使用自家 Mistral Large 模型,支持中文。可通过网页或 API 使用,注册即享,无需付费。中国大陆可正常访问。
Mistral released Mistral Small 3.1 with optimized inference speed and efficiency, suitable for applications requiring fast responses.
Mistral AI is recorded as open-source or providing open model resources. Useful for local deployment, customization, and low-cost evaluation.
Mistral AI is recorded as supporting OpenAI-compatible API access. Free/trial info: Free tier. Useful for low-cost testing by swapping SDK base_url.
Mistral launched Small 3.1 with 128k context window and reduced API pricing, offering developers more cost-effective inference capabilities.
Modal has recorded free compute or trial credits: $30/month credits. Useful for inference, deployment, or GPU experiments.
注册月之暗面开放平台即送 1500 万 token,支持 Kimi 长上下文模型(128K),中国大陆直连,适合长文本处理任务。
新注册用户获赠 1500 万 token 免费额度,可用于 Kimi 大模型 API,支持长上下文(128K),中国大陆网络直接使用。
月之暗面(Moonshot AI)为新注册用户提供 100 万免费 tokens,支持长上下文模型,API 兼容 OpenAI 格式,中国大陆直接使用。
月之暗面 Moonshot 为新注册用户提供 150万 token 的免费 API 额度,支持 Moonshot-v1 模型,中国大陆可直接访问,适合长文本处理。
月之暗面 Kimi 大模型 API 新用户注册即送 1500万 token 免费额度(约 15元),支持长上下文模型,中国大陆直连,适合开发者和个人使用。
月之暗面 Kimi 为新注册开发者提供 100 万 Token 免费额度,支持长上下文模型,中国大陆直接使用,无需海外信用卡。
月之暗面 Kimi 大模型为新注册开发者提供 500 万 token 的免费 API 调用额度,支持长上下文模型,中国大陆网络可直接使用,适合构建对话和文本处理应用。
n8n offers a free or open-source option: Open-source version fully free to self-host; cloud version free with 5 workflows. Useful for low-cost developer testing.
Notion AI is recorded as China-friendly: Notion accessible from China (occasionally unstable in some regions).. Useful when you need access without complex network setup.
Notion AI has a recorded free tier: Limited requests. Good for testing before upgrading.
Novita AI API has a recorded free trial: $0.50 free credits; rate limit: 60 RPM.
Novita AI has a recorded free tier: credit-based. Good for testing before upgrading.
Novita AI is recorded as supporting OpenAI-compatible API access. Free/trial info: $0.50 free credits. Useful for low-cost testing by swapping SDK base_url.
NVIDIA Build (NIM API) is recorded as China-friendly: Direct access from China via integrate.api.nvidia.com, no proxy needed. Medium speed, may slow during peak hours.. Useful when you need access without complex network setup.
NVIDIA Build (NIM API) API has a recorded free trial: 无限制(已取消额度限制); rate limit: 40 RPM(可申请提升到 200 RPM).
NVIDIA Build (NIM API) has a recorded free tier: Unlimited (40 RPM rate limit). Good for testing before upgrading.
NVIDIA Build (NIM API) is recorded as supporting OpenAI-compatible API access. Free/trial info: 无限制(已取消额度限制). Useful for low-cost testing by swapping SDK base_url.
NVIDIA NIM has recorded free compute or trial credits: 40 RPM (upgradable to 200). Useful for inference, deployment, or GPU experiments.
NVIDIA offers free inference API for 100+ AI models spanning LLM, vision, speech, biology, and simulation. No credit card, no token limits, 40 RPM only. China accessible. 2026 highlight models: - Kimi K2.5 (Moonshot flagship, 1M context) - GLM-5.1 (Zhipu AI latest flagship, GLM-5 deprecated 04/20) - MiniMax M2.7 (230B params, coding/reasoning SOTA) - DeepSeek V3.2 / R1 (671B MoE, top reasoning) - Qwen 3.5 (397B/17B active, native multimodal) - Nemotron-3-Super-120B (NVIDIA's own, 1M context, Hybrid Mamba-Transformer) - Llama 4 Maverick, Gemma 4 31B, and more Why NVIDIA NIM is the most underrated free resource: - One account unlocks 100+ models, no need to register everywhere - 40 RPM is generous enough for daily dev and testing - Direct China access, no proxy needed - OpenAI-compatible API, base_url is https://integrate.api.nvidia.com/v1 - NVIDIA aggregates top models as infrastructure, not a model maker - New models added fast, often the first platform to offer free trials Best for: model comparison, prototyping, Agent tool calls, multi-model routing.
NVIDIA NIM platform has launched Meta Llama 4 models, offering free trial credits for users to experience the latest model inference capabilities at no cost.
NVIDIA NIM platform now hosts Meta Llama 4 series models, offering efficient deployment for rapid integration and inference.
OctoAI API has a recorded free trial: $10 free credits; rate limit: 60 RPM.
OctoAI has a recorded free tier: credit-based. Good for testing before upgrading.
OctoAI is recorded as supporting OpenAI-compatible API access. Free/trial info: $10 free credits. Useful for low-cost testing by swapping SDK base_url.
Ollama is recorded as China-friendly: Runs locally, no network needed. May need proxy to download models (or use Chinese mirrors).. Useful when you need access without complex network setup.
Ollama API has a recorded free trial: Unlimited; rate limit: Local.
Ollama has a recorded free tier: Unlimited (runs locally). Good for testing before upgrading.
Ollama is recorded as open-source or providing open model resources. Useful for local deployment, customization, and low-cost evaluation.
Ollama is recorded as supporting OpenAI-compatible API access. Free/trial info: Unlimited. Useful for low-cost testing by swapping SDK base_url.
OmniRoute offers a free or open-source option: 免费版. Useful for low-cost developer testing.
OnePass (Google AI Pro) has a recorded free tier: Google AI Pro free for one year. Good for testing before upgrading.
OpenAI released Codex CLI, an open-source command-line coding tool that enables AI-assisted coding directly in the terminal, completely free to use.
The OpenAI Codex Enterprise Promo is an official limited-time application entry for enterprises adding net-new Codex users. The official page confirms that new Codex users on eligible enterprise accounts can request two months of free Codex usage; eligibility, routing, and approval remain subject to OpenAI's review.
OpenAI Codex for Open Source is an official application program for OSS maintainers. The key confirmed benefits are six months of ChatGPT Pro with Codex, API credits, and conditional Codex Security access; all benefits remain subject to OpenAI review and the Program Terms.
OpenAI Codex for Students is an official OpenAI Developers student offer: verified U.S. and Canadian university students can claim $100 in ChatGPT credits (shown as about 2,500 credits) for Codex, expiring 12 months after the grant date. These are not API credits and the offer is not global student access.
OpenAI API has a recorded free trial: $5; rate limit: 3 RPM (free tier).
OpenAI has a recorded free tier: ChatGPT free tier unlimited. Good for testing before upgrading.
OpenAI launches new GPT-4.1 API features including controlled generation, improved structured outputs, enhanced image understanding, and code execution support, providing developers with more powerful model capabilities.
OpenAI announced a significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output price to $8 per million tokens, approximately 26% cheaper than GPT-4o, greatly reducing developer costs.
OpenAI launches GPT-4.1 series API, approximately 26% cheaper than GPT-4o, with input at $2/M tokens and output at $8/M tokens. GPT-4.1 mini and nano are even more affordable for various use cases.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, approximately 50% lower than GPT-4o, greatly reducing developer costs.
OpenAI announced a significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output price to $8 per million tokens, 26% cheaper than GPT-4o, offering developers more cost-effective AI capabilities.
OpenAI announces GPT-4.1 API price reduction, with input prices 26% lower and output prices 50% lower than GPT-4o; GPT-4.1 mini and nano are even cheaper.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2/M tokens and output to $8/M tokens, 26% cheaper than GPT-4o, greatly reducing developer costs.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, offering better value than GPT-4o for large-scale inference and generation tasks.
OpenAI announces price reduction for GPT-4.1 API series, with input price dropping to $2 per million tokens and output to $8 per million tokens, offering better value than GPT-4o.
OpenAI announces a significant price cut for GPT-4.1 API, with input price reduced to $2/M tokens and output to $8/M tokens, offering better value than GPT-4o for large-scale API usage.
OpenAI announced a significant price reduction for the GPT-4.1 API, with input prices dropping to $2 per million tokens and output prices to $8 per million tokens, about 50% cheaper than GPT-4o, greatly reducing developer costs.
OpenAI announces significant price reduction for GPT-4.1 API, with input at $2/M tokens and output at $8/M tokens, 26%-50% cheaper than GPT-4o, offering developers more cost-effective AI capabilities.
OpenAI launches GPT-4.1 API series with significant price reduction compared to GPT-4o. GPT-4.1 nano input is only $0.1/1M tokens, output $0.4/1M tokens, ideal for cost-effective AI applications.
GPT-4.1 input $2/M tokens, output $8/M tokens, ~26% cheaper than GPT-4o.
OpenAI announced a significant price drop for GPT-4.1 API, with input price reduced to $2/1M tokens and output to $8/1M tokens, offering better value than GPT-4o.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, representing a 26%-50% decrease compared to GPT-4o, offering developers more cost-effective AI capabilities.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2/M tokens and output to $8/M tokens, approximately 50% cheaper than GPT-4o, offering developers more cost-effective AI capabilities.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, 26% cheaper than GPT-4o, greatly reducing developer costs.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, approximately 26% cheaper than GPT-4o, offering developers more cost-effective AI capabilities.
OpenAI announces a significant price reduction for the GPT-4.1 API, with input dropping to $2 per million tokens and output to $8 per million tokens, offering a substantial cost saving compared to GPT-4o for AI application development.
OpenAI announces a significant price reduction for GPT-4.1 API, with input price reduced to $2/1M tokens and output to $8/1M tokens, about 50% cheaper than GPT-4o, greatly reducing developer costs.
OpenAI announces significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, 26%-50% lower than GPT-4o, greatly reducing developer costs.
OpenAI announces a significant price reduction for GPT-4.1 API, with input price dropping to $2 per million tokens and output to $8 per million tokens, 26% cheaper than GPT-4o, greatly reducing developer costs.
GPT-4.1 adds code completion capability for seamless IDE integration.
GPT-4.1 supports invoking a code execution sandbox via API, enhancing coding and data analysis.
OpenAI announced that the GPT-4.1 series models now support calling the code interpreter via API, allowing developers to leverage code execution for programming assistance, data processing, and analysis directly within their applications, significantly enhancing the model's utility in coding and data analysis scenarios.
OpenAI launched GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, with 1M token context and reduced API pricing.
OpenAI officially released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano. The new series supports up to 1 million token context windows, with significantly reduced API pricing compared to previous generations, offering developers more powerful and cost-effective AI capabilities.
OpenAI released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, with 1M token context and reduced API pricing.
OpenAI officially released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, all supporting up to 1M token context windows with significantly reduced API pricing compared to previous generations, offering developers more powerful and cost-effective AI capabilities.
OpenAI released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, with 1M token context, significant performance improvements, and reduced API pricing starting at $2 per million input tokens.
OpenAI officially releases the GPT-4.1 series, including standard, mini, and nano versions, with significant performance improvements across benchmarks and substantially reduced inference costs, offering developers more efficient and cost-effective AI capabilities.
OpenAI officially releases the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano. The new series offers significant performance improvements at lower prices, suitable for various AI applications.
OpenAI officially released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano. All models support a 1M token context window, with significant performance improvements in code generation, instruction following, and long-context understanding. API pricing is substantially reduced compared to GPT-4o series, with input prices starting at $2/M tokens and output at $8/M tokens, offering developers better cost-effectiveness.
OpenAI launched GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, with 1M token context and reduced API pricing.
OpenAI released the GPT-4.1 series, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, supporting up to 1M token context window with improved performance and reduced pricing.
OpenAI released GPT-4.1 series models, including GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano, with 1M token context and reduced API pricing.
OpenAI released GPT-4.1, GPT-4.1 mini, and GPT-4.1 nano with 1M token context window and lower API pricing compared to GPT-4o, ideal for long-context and high-throughput applications.
OpenAI 于2026年4月将GPT-4o免费层从每日10次提升至50次,无需绑定支付方式即可使用,支持文本和图像输入。
ChatGPT free users can now access GPT-4o mini with limits, experiencing more powerful AI conversation capabilities.
OpenAI 为 GPT-4o-mini 模型提供免费层,注册后每日可免费调用约100次,适合轻量级应用和测试。中国大陆需通过代理访问。
OpenAI announces a significant price reduction for GPT-4o mini API, with input price dropping to $0.15/M tokens and output to $0.60/M tokens, offering developers a more cost-effective AI service.
新注册用户可获 $5 API 额度,用于体验 o3-mini 模型,有效期30天,支持中国大陆信用卡注册。
OpenAI is recorded as supporting OpenAI-compatible API access. Free/trial info: $5. Useful for low-cost testing by swapping SDK base_url.
新注册用户可获得 $50 免费 API 额度,可用于 Realtime API 及 GPT-4o 等模型,有效期 90 天。
OpenAI has enhanced Structured Outputs for the GPT-4.1 series, improving JSON mode reliability and performance, enabling developers to obtain structured outputs more consistently.
OpenRouter API has a recorded free trial: Free models; rate limit: 20 RPM.
新注册用户可获得少量免费额度,用于体验其聚合的众多模型API(如 Claude、GPT、Llama 等)。额度有限,适合初步测试。
OpenRouter 为新用户提供 $1 免费额度,同时提供多个永久免费模型(如 Mistral 7B、Llama 3 8B 等),支持统一 API 调用多种模型,中国大陆需科学上网。
OpenRouter 聚合多模型 API,新注册用户赠送 $1 免费额度,可用于 GPT-4、Claude 3.5、Gemini 等模型,中国大陆可访问,无需信用卡。
OpenRouter has a recorded free tier: Varies by model. Good for testing before upgrading.
OpenRouter 为新注册用户提供 $1 免费额度,可用于调用多种开源和商业模型(如 GPT-4、Claude、Llama 等),中国大陆需代理访问。
OpenRouter is recorded as supporting OpenAI-compatible API access. Free/trial info: Free models. Useful for low-cost testing by swapping SDK base_url.
Paperspace has recorded free compute or trial credits: free GPU notebooks (6hr sessions). Useful for inference, deployment, or GPU experiments.
Perplexity AI has a recorded free tier: No limit (basic search). Good for testing before upgrading.
Perplexity AI is recorded as supporting OpenAI-compatible API access. Free/trial info: $0. Useful for low-cost testing by swapping SDK base_url.
Perplexity Pro 提供1个月免费试用,包含无限次搜索、高级模型(GPT-4、Claude 3等)和文件上传功能。需绑定支付方式,试用结束后自动续费(可取消)。中国大陆可访问,但需科学上网。
Pieces offers a free or open-source option: Free plan includes core features — snippet management, AI chat, local model support. Useful for low-cost developer testing.
PipesHub AI offers a free or open-source option: 免费版. Useful for low-cost developer testing.
Poe has a recorded free tier: Credit-based, daily refresh. Good for testing before upgrading.
Pydantic AI offers a free or open-source option: Open-source framework, completely free to use. Useful for low-cost developer testing.
Qwen (Alibaba) is recorded as China-friendly: Alibaba Cloud service, direct access in China, very fast. Top-tier Chinese understanding. Qwen3.6-Plus coding near Claude Sonnet level.. Useful when you need access without complex network setup.
Qwen (Alibaba) API has a recorded free trial: 7000 万 tokens(新用户一次性); rate limit: 按模型不同.
Qwen (Alibaba) has a recorded free tier: No explicit limit. Good for testing before upgrading.
Alibaba's Qwen3.6-Plus is the strongest Chinese coding model. New Bailian users get 70M free tokens (one-time). Coding ability close to Claude Sonnet 4.6, priced at only ¥2/M tokens.
Qwen (Alibaba) is recorded as open-source or providing open model resources. Useful for local deployment, customization, and low-cost evaluation.
Qwen (Alibaba) is recorded as supporting OpenAI-compatible API access. Free/trial info: 7000 万 tokens(新用户一次性). Useful for low-cost testing by swapping SDK base_url.
Replicate API has a recorded free trial: Free tier; rate limit: Varies.
Replicate 平台新用户注册即送$10免费额度,可用于运行多种开源模型(如Llama 3、Stable Diffusion),无需绑定信用卡,中国大陆可注册使用。
平台托管大量 AI 模型,新用户注册可获得少量免费 GPU 时间,用于运行各种开源模型。超出后需付费。
Replicate 提供每月 50 次免费推理额度,支持大量开源模型(如 Stable Diffusion、Llama、Whisper),中国大陆需代理访问,适合模型测试和小型项目。
Replicate has a recorded free tier: Credit-based. Good for testing before upgrading.
Replicate 为新用户提供 $5 免费额度,可运行多种 AI 模型(图像生成、文本、语音等),中国大陆可注册但需绑定支付方式。
Replit has launched a Free Day of Coding event, offering users one day of free access to its AI-assisted development platform. The platform integrates code generation, auto-completion, and intelligent debugging to help developers build projects faster. This event aims to let more people experience the productivity boost of AI-driven coding.
Replit offers a free or open-source option: Free plan with unlimited public projects + limited AI features. Useful for low-cost developer testing.
Repomix offers a free or open-source option: Fully open-source and free, install via npm. Useful for low-cost developer testing.
Roo Code offers a free or open-source option: Open-source and free, bring your own API key. Useful for low-cost developer testing.
RunPod has recorded free compute or trial credits: $1 credits. Useful for inference, deployment, or GPU experiments.
Runtime, a YC P26-backed project, introduces sandboxed coding agents for teams. The tool allows team members to safely run AI coding agents in isolated sandbox environments, supporting collaboration and code review. A free trial is currently available, making it ideal for development teams exploring AI-assisted coding.
Runway has a recorded free tier: 125 credits (one-time). Good for testing before upgrading.
SaladCloud has recorded free compute or trial credits: trial credits. Useful for inference, deployment, or GPU experiments.
SambaNova Cloud offers the world's only free LLaMA 3.1 405B API access. Core advantages: - LLaMA 3.1 405B (405 billion parameters) completely free — the largest free open-source model - The only platform globally offering free 405B access, bar none - Custom RDU (Reconfigurable Dataflow Unit) chip acceleration, ultra-fast inference - 30 RPM rate limit, no total cap — thousands of calls per day - API keys start with sn-, OpenAI-compatible format Supported models: - LLaMA 3.1 405B (flagship, best for complex reasoning) - Llama 3.3 70B (best value) - DeepSeek R1/V3 (671B MoE) - Qwen 2.5 72B - More models added regularly 405B vs 70B difference: - Significantly better complex reasoning (math, logic, multi-step) - Stronger long-text understanding (128K context) - Higher code generation quality - More precise instruction following Requires proxy from China (use openllmapi.com). Ideal for developers needing large model capabilities on a budget.
SambaNova API has a recorded free trial: Free tier(永久免费); rate limit: 30 RPM.
SambaNova has a recorded free tier: 30 RPM (no total cap). Good for testing before upgrading.
SambaNova is recorded as supporting OpenAI-compatible API access. Free/trial info: Free tier(永久免费). Useful for low-cost testing by swapping SDK base_url.
SenseNova Token Plan beta is a lead for free DeepSeek-V4-Flash API access. Developers in China can test it for low-cost document handling, summarization, and simple agent subtasks. Current details come from a public article and platform entry; quotas and limits need re-verification.
A low-cost telco AI token package lead worth testing for effective cost per million tokens. Current info comes from a 2026-05-16 CLS screenshot: ¥1 for about 250k quota points, mobile-bill payment, and multiple model access. Verify quota conversion, supported models, and rate limits first.
SiliconFlow offers a 14-day free API trial for new users, supporting a variety of mainstream models, ideal for developers to quickly experience and test.
SiliconFlow provides 2M free tokens for new users, supporting multiple models, ideal for developers to get started quickly.
SiliconFlow is recorded as China-friendly: Chinese service, direct access in China. Top choice for Chinese developers to access open-source model APIs.. Useful when you need access without complex network setup.
SiliconFlow added DeepSeek-R1 reasoning model for API access, supporting efficient inference tasks.
SiliconFlow 为新注册用户提供 2000 万 token 免费额度,支持 Llama、Qwen、DeepSeek 等多个开源模型,兼容 OpenAI API 格式,中国大陆可直连,注册即送。
SiliconFlow offers free API credits for new users, supporting multiple models upon registration.
SiliconFlow 是中国大陆领先的 AI 模型聚合平台,新用户注册即赠送 2000万 token 免费额度,支持 Llama、Qwen、DeepSeek 等多种开源模型,API 兼容 OpenAI 格式,中国大陆直接访问。
注册即送 14 元 API 额度,支持 Llama、Qwen、DeepSeek 等多种开源模型,中国大陆网络可直接访问,适合开发者快速测试。
SiliconFlow API has a recorded free trial: ¥14; rate limit: Varies.
SiliconFlow 提供长期免费API额度,每月200万Token调用量,另赠送15元体验金可用于更高性能模型。支持多种开源模型(如Qwen、Llama、ChatGLM等),中国大陆直连,注册即用。
SiliconFlow 提供每日200次免费API调用额度,支持Llama、Qwen、DeepSeek等主流开源模型,中国大陆用户可直接注册使用,无需海外信用卡。
SiliconCloud added multiple free models including DeepSeek-V3 and Qwen2.5 series, available for free API calls.
SiliconFlow offers 14 open-source model APIs completely free, including Qwen, DeepSeek, Llama. Direct China access, fast speed, OpenAI compatible. The most convenient free AI API for Chinese developers.
注册 SiliconFlow 平台即送 2000 万 token,支持 Llama、Qwen、DeepSeek 等多种开源模型,中国大陆直连,提供 OpenAI 兼容 API。
SiliconFlow has a recorded free tier: Varies by model. Good for testing before upgrading.
SiliconFlow gives new users $10 API credits, valid for 30 days.
SiliconFlow offers new users 14 yuan voucher for API usage.
SiliconFlow offers 14 RMB (~$2) API credits for new users, usable on DeepSeek and other models.
SiliconCloud offers 20 million free tokens for new users, supporting multiple models, ongoing promotion.
SiliconCloud gives new users 20 million free tokens for multi-model API calls, suitable for various AI application development.
SiliconFlow gives 20 million free tokens to new users, supporting multiple models.
SiliconCloud by SiliconFlow offers a 14-day free trial for new users, granting 20 million tokens for all models on the platform.
SiliconCloud provides 20 million free tokens for new users, supporting multiple models for AI application development.
SiliconCloud offers 20M free tokens for new users, supporting multiple mainstream models, ideal for developers to quickly start testing.
SiliconCloud offers 20 million free tokens for new users, supporting multiple models, ideal for developers to get started quickly.
New SiliconFlow users receive 2M free tokens for various models upon registration, no minimum usage required.
SiliconFlow offers $5 free API credits for new users, usable across multiple models.
New SiliconCloud users get a 14 RMB coupon for API calls on various models, covering popular open-source models.
SiliconFlow offers free API credits for new users, supporting multiple models, ideal for developers to get started quickly.
SiliconFlow 为新注册用户提供 14元 免费额度,可用于调用 Llama、Qwen、Yi、DeepSeek 等多种开源大模型 API,国内直连,支持 OpenAI 兼容接口,适合开发者测试和集成。
SiliconFlow is recorded as supporting OpenAI-compatible API access. Free/trial info: ¥14. Useful for low-cost testing by swapping SDK base_url.
iFlytek Spark is recorded as China-friendly: By iFlytek, direct access in China, industry-leading voice capabilities.. Useful when you need access without complex network setup.
iFlytek Spark API has a recorded free trial: 200万 tokens; rate limit: 5 RPM.
iFlytek Spark has a recorded free tier: No explicit limit. Good for testing before upgrading.
iFlytek Spark is recorded as supporting OpenAI-compatible API access. Free/trial info: 200万 tokens. Useful for low-cost testing by swapping SDK base_url.
StepFun is recorded as China-friendly: StepFun, direct access in China. Step 3.5 Flash is extremely fast.. Useful when you need access without complex network setup.
StepFun API has a recorded free trial: ¥10; rate limit: 5 RPM.
阶跃星辰为新注册用户提供 100万 token 免费 API 额度,支持 Step-2 万亿参数大模型,中国大陆直连,注册即用,无需复杂审核。
StepFun has a recorded free tier: No explicit limit. Good for testing before upgrading.
StepFun is recorded as open-source or providing open model resources. Useful for local deployment, customization, and low-cost evaluation.
StepFun is recorded as supporting OpenAI-compatible API access. Free/trial info: ¥10. Useful for low-cost testing by swapping SDK base_url.
阶跃星辰 Step-2 大模型为新注册用户提供 100 万 token 的免费 API 调用额度,支持多模态和文本生成,中国大陆直连,适合快速体验和开发测试。
Suno has a recorded free tier: 50 credits/day (~10 songs). Good for testing before upgrading.
Superset offers a free or open-source option: Open source and free to self-host locally. Useful for low-cost developer testing.
Superset is an integrated development environment (IDE) designed for the agent era, incubated by YC P26. It provides a complete toolchain to help developers build, debug, and deploy AI agent applications. The project is fully free and open source, with anyone able to access the GitHub repository for source code and contributions. As a newly launched product on its first day, Superset aims to lower the barrier to agent development, enabling more developers to get started quickly.
Tabnine offers a free or open-source option: Free plan offers basic completions using local small model, code stays on your machine. Useful for low-cost developer testing.
TEN Framework offers a free or open-source option: Open-source framework, completely free to use. Useful for low-cost developer testing.
腾讯混元大模型为开发者提供每月 100 万 token 的免费 API 调用额度,支持文本生成、对话等能力,中国大陆开发者可直接使用微信/QQ 登录,无需绑定信用卡。
textgen has recorded free compute or trial credits: 免费算力/额度. Useful for inference, deployment, or GPU experiments.
Tiangong AI is recorded as China-friendly: By Kunlun Tech, direct access in China.. Useful when you need access without complex network setup.
Tiangong AI API has a recorded free trial: Free tier; rate limit: Varies.
Tiangong AI has a recorded free tier: No explicit limit. Good for testing before upgrading.
Tiangong AI is recorded as open-source or providing open model resources. Useful for local deployment, customization, and low-cost evaluation.
Together AI 为新用户提供 $25 免费 API 额度,可用于调用 Llama、Mixtral、Stable Diffusion 等开源模型,支持 OpenAI 兼容接口,中国大陆需代理访问。
Together AI 为新用户提供每月 $25 免费额度,支持 Llama、Mistral、DeepSeek 等多种开源模型,中国大陆需代理,适合模型微调和推理测试。
Together AI has recorded free compute or trial credits: $5 credits. Useful for inference, deployment, or GPU experiments.
新注册用户获得 $25 免费 API 额度,支持 Llama 3、Mixtral、Falcon 等多种开源模型,兼容 OpenAI 格式,中国大陆需代理访问,注册无需信用卡。
Together AI gives new users $5 free credits for 200+ open-source model APIs. Highlights: - $5 free credits, enough for tens of thousands of API calls - FLUX image generation completely free, doesn't consume credits (hidden perk!) - Supports Llama 3.3 70B/405B, Mixtral 8x22B, Qwen 2.5, DeepSeek V3/R1 - Serverless and Dedicated deployment modes - OpenAI-compatible format - Fast inference, JSON Mode, Function Calling support FLUX free image generation is the biggest highlight: - FLUX.1 Schnell (fast, 1-4 step generation) - FLUX.1 Dev (high quality) - Completely free, unlimited, doesn't consume $5 credits - Quality comparable to Midjourney, great for batch product images and marketing assets Perfect for developers needing quality open-source model APIs plus free image generation.
Together AI offers $25 free API credits for new users, supporting 200+ open-source models. Key highlight: FLUX.1 Schnell Free image generation is completely free! - No credits consumed - Unlimited use - High-quality AI image generation - The only platform offering free high-quality AI image generation API LLM models: Llama 3.3 70B Turbo, Llama 4 Maverick, DeepSeek V3, Mixtral 8x22B, and 200+ more. API keys start with together-, OpenAI-compatible. base_url: https://api.together.xyz/v1 Requires proxy from China (use openllmapi.com).
Together AI API has a recorded free trial: $5(注册赠送); rate limit: Varies by model.
Together AI has a recorded free tier: Credit-based ($5 signup bonus). Good for testing before upgrading.
Together AI is recorded as supporting OpenAI-compatible API access. Free/trial info: $5(注册赠送). Useful for low-cost testing by swapping SDK base_url.
tradingview-mcp offers a free or open-source option: 免费版. Useful for low-cost developer testing.
unity-mcp offers a free or open-source option: Open-source and free to use, no usage limits. Useful for low-cost developer testing.
useknockout is an open-source project offering a free SOTA background removal and super-resolution API as an alternative to remove.bg and Topaz. It is MIT licensed and runs on the Modal platform, allowing users to utilize it within Modal's free tier. Suitable for developers and businesses needing image background removal or super-resolution processing.
UUSEC WAF is an industry-leading free, high-performance Web Application Firewall and API Security Gateway powered by AI and semantic technology. It supports SQL injection, XSS, DDoS protection, data masking, RASP, and ModSecurity rule compatibility for enterprise-grade application security.
v0 offers a free or open-source option: Free plan with 200 messages per month for generating and iterating UI components. Useful for low-cost developer testing.
Vast.ai has recorded free compute or trial credits: $1 credits. Useful for inference, deployment, or GPU experiments.
VideoCaptioner offers a free or open-source option: Fully open-source and free, no usage limits when running locally. Useful for low-cost developer testing.
Vidu is recorded as China-friendly: Chinese platform, direct access in China, fast speed.. Useful when you need access without complex network setup.
Vidu API has a recorded free trial: $1; rate limit: N/A.
Vidu has a recorded free tier: 80 credits/month. Good for testing before upgrading.
字节跳动火山引擎提供的豆包大模型 API,新用户通常有一定量的免费 tokens 额度,中国大陆可直接使用且稳定。
Warp announces an open-source model built on OpenAI's GPT-5.5, available for free to developers. The model supports various NLP tasks including text generation, code writing, and logical reasoning. Users can sign up for a Warp account to obtain an API key and start using it immediately. This initiative aims to advance the open-source AI ecosystem and lower the barrier for developers to access cutting-edge models.
Warpdrv is a newly released open-source Llama.cpp launcher designed for daily-driving Qwen 35b and 27b models on Strix Halo and RTX Pro hardware. The project is completely free, and users can obtain the code directly from Reddit or GitHub. It simplifies the local LLM deployment process, suitable for users with compatible hardware for local inference.
Windsurf offers a free or open-source option: Free plan with unlimited basic completions + limited Cascade premium requests per month. Useful for low-cost developer testing.
wx_channels_download packages local proxying, stream parsing, and WeChat WebView button injection into a usable open-source tool. Its real value is not just downloading videos, but turning WeChat Channels content into assets that can be archived, transcribed, analyzed, and repurposed.
01.AI (Yi) is recorded as China-friendly: By 01.AI, direct access in China. Yi-Lightning is very cost-effective.. Useful when you need access without complex network setup.
01.AI (Yi) API has a recorded free trial: ¥10; rate limit: 5 RPM.
01.AI (Yi) has a recorded free tier: No explicit limit. Good for testing before upgrading.
01.AI (Yi) is recorded as open-source or providing open model resources. Useful for local deployment, customization, and low-cost evaluation.
01.AI (Yi) is recorded as supporting OpenAI-compatible API access. Free/trial info: ¥10. Useful for low-cost testing by swapping SDK base_url.
yn offers a free or open-source option: 免费版. Useful for low-cost developer testing.
ChatGLM (Zhipu AI) is recorded as China-friendly: Chinese platform, direct access. GLM-4-Flash is free.. Useful when you need access without complex network setup.
注册智谱AI开放平台即送 100 万 token,可用于 GLM-4 系列模型,支持文本和图像生成,中国大陆开发者直接使用,无需翻墙。
新注册用户获赠 100 万 token 免费额度,可用于 GLM-4、GLM-4V 等模型 API 调用,中国大陆直连,支持联网搜索和图像理解。
ChatGLM (Zhipu AI) API has a recorded free trial: 500万 tokens; rate limit: 5 RPM.
智谱AI 为新注册用户提供 100万 token 的免费 API 额度,可用于 GLM-4、GLM-4V 等模型,中国大陆直连,支持 Python 和 HTTP 调用。
智谱 AI 为新注册用户提供 500 万免费 tokens,支持 GLM-4 系列模型,中国大陆直接使用,无需翻墙,注册即送。
ChatGLM (Zhipu AI) has a recorded free tier: No explicit limit. Good for testing before upgrading.
智谱AI为GLM-4系列模型提供注册即送18元免费API额度,支持对话、代码生成等,中国大陆开发者可直接使用,无需海外工具。
智谱 AI 为新注册开发者提供 500 万 token 免费额度,可用于 GLM-4、GLM-4V 等最新模型,中国大陆直接使用,支持手机号注册,无需海外支付方式。
智谱AI为新注册用户提供500万Token免费额度(含GLM-4、GLM-4V等多模态模型),额外赠送100元API体验金,可用于更高阶模型调用。中国大陆手机号直接注册,无需海外支付方式。
智谱AI为注册用户提供100万Token免费额度,支持GLM-4、GLM-4V等模型,国内直接访问,注册即用,无需海外环境。
Zhipu GLM is a strong free API option for China-based developers today: registration is local-friendly, access is stable, and the API can be used in an OpenAI-compatible style. It is useful for Chinese customer support, knowledge-base QA, content generation, and multimodal experiments.
智谱AI 为新注册用户提供 100 万 token 的免费调用额度,同时赠送 100 元体验金,可用于 GLM-4、GLM-4V 等模型,支持中国大陆直连,适合开发者和学生使用。
智谱 AI 为新用户提供 100 万 token 免费额度,可用于 GLM-4 系列模型(含 API 和 Web 端),中国大陆直接注册使用,无需海外支付方式,适合中文场景开发。
智谱 AI 为开发者提供 GLM-4、GLM-3-Turbo 等模型的免费 API 调用额度,每月 100 万 Token,注册即享,支持中国大陆网络直接使用,适合个人开发者和中小企业测试集成。
智谱 AI 为注册用户提供免费 100 万 token 额度,可用于 GLM-4、GLM-4-Flash 等模型 API 调用,中国大陆开发者可直接使用,支持 Python SDK 和 OpenAI 兼容接口。
ChatGLM (Zhipu AI) is recorded as open-source or providing open model resources. Useful for local deployment, customization, and low-cost evaluation.
ChatGLM (Zhipu AI) is recorded as supporting OpenAI-compatible API access. Free/trial info: 500万 tokens. Useful for low-cost testing by swapping SDK base_url.
智谱 AI 为新注册用户提供 500万 Token 免费额度,可用于 GLM-4、GLM-4V 等模型 API 调用,中国大陆直接访问,支持微信/支付宝实名认证。
Google has increased the Gemini 1.5 Flash free tier to 30 RPM and 1500 requests per day, significantly boosting the free usage quota.
Hugging Face launched a free inference API supporting multiple open-source models with rate-limited free access.
OpenAI released GPT-4o mini with pricing at $0.15/M input tokens and $0.60/M output tokens, 97% cheaper than GPT-4o, significantly reducing API usage costs.
🎁 Free Resource Pack
Free API credits list, AI business case studies, payment stack, risk checklist, and a monetization roadmap.