Gateway endpoints
Responses
Primary Astrolabe Cloud inference endpoint. Supports virtual models, concrete model IDs, routing stack selection, streaming, billing, and route traces.
POST
POST /v1/responses is the recommended Astrolabe Cloud inference endpoint for new integrations.
It accepts OpenAI-style Responses payloads and adds Cloud routing, billing, stack policy, fallback, verification, and traceability.
Recommended request
Streaming
Setstream: true for server-sent events. Streaming responses still include Cloud request, stack, model, and rate-limit headers.
Billing
The gateway reserves prepaid balance before provider execution and finalizes the debit from provider usage where available.Authorizations
Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Body
application/json

