Public endpoints
GET /healthGET /readyGET /v1/modelsGET /v1/lanesPOST /v1/responsesPOST /v1/chat/completions
Authentication
Create API keys in the Cloud app. Send a key with either:Gateway behavior
The gateway:- rejects missing or invalid keys
- rejects suspended workspaces or revoked keys
- enforces key-level rate limits
- requires sufficient prepaid balance before routing
- resolves stack policy before provider calls
- records route traces and model-call records
- returns Cloud request ids and routing headers
Compatibility
UsePOST /v1/responses for new integrations. Use POST /v1/chat/completions when an existing client only supports Chat Completions.
Both endpoints use the same stack, billing, and routing runtime.
