Vercel AI SDK with OpenAI-Compatible API: baseURL, Streaming, JSON

How do I use an OpenAI-compatible API with Vercel AI SDK?

Short answer

Configure the compatible provider explicitly: baseURL, apiKey, and model. Then test basic chat, streaming chunks, JSON/structured output, and error handling before migrating real traffic. For low-cost routes, benchmark DeepSeek, Qwen, GLM, Groq/OpenRouter, or OpenLLMAPI behind the same app interface.

Vercel AI SDK OpenAI compatible APIOpenAI compatible baseURL Next.jsVercel AI SDK custom providerOpenAI compatible streaming JSON mode

Conclusion

OpenAI-compatible does not guarantee identical streaming, JSON, or tool-call behavior.
Vercel apps should keep provider settings in env vars so route changes do not require code rewrites.
Always test the actual SDK path, not only curl, because adapters can change request shape.
Use fallback when the app has paying users, scheduled jobs, or agent loops.

What to do next

Choose a provider and copy its current compatible endpoint from official docs.
Set baseURL, apiKey, and model explicitly in your provider/client configuration.
Run a minimal chat request from the same Next.js route your app will use.
Test streaming, structured output, timeout, rate-limit, and provider error bodies.
Add OpenLLMAPI or another routing layer when you need fallback, logs, and budget caps.

Recommended paths

Provider	Free / credits	Best for
DeepSeek	Verify current pricing	Low-cost Vercel app backend tests
Qwen DashScope	Signup credits vary	Compatible-mode China-friendly apps
Zhipu GLM	Signup tokens vary	Domestic fallback and GLM experiments
OpenRouter/Groq	Free routes vary	Fast demos and broad model tests
OpenLLMAPI	Trial varies	One endpoint with fallback and spend logs

Global developer checklist

Confirm whether signup, billing, and API keys work from your country before writing production code.
Prefer OpenAI-compatible endpoints when you may need to switch models, regions, or providers later.
Test free credits with a real smoke prompt and record latency, error shape, streaming behavior, and quota burn.
Keep at least one fallback route for provider outages, model deprecations, and regional access changes.

Production handoff

Use one compatible endpoint for Vercel apps

Route low-cost models, fallback failures, and track spend without rewriting your Next.js AI routes. Signup is UTM-tagged for Vercel AI SDK intent.

Configure a routed endpoint →

FAQ

Is changing baseURL enough?

It is enough for some chat calls, but you still must test streaming, JSON, errors, model names, and rate limits.

Which provider is best for Vercel AI SDK?

For prototypes, pick the fastest no-card or low-cost route that passes your SDK smoke tests. For production, choose by accepted-task cost and fallback support.

Why does streaming break?

Some compatible endpoints emit chunks or finish reasons differently. Test with your exact SDK helper and UI parser.

How do I avoid lock-in?

Keep provider config in env vars, normalize errors, log route/model, and keep at least one fallback route.