Skip to main content
POST
/
v1
/
responses
curl --request POST \
  --url https://api.astrolabe.run/v1/responses \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "astrolabe/auto",
  "input": "Explain routing stacks in one sentence."
}
'
{
  "id": "<string>",
  "object": "<string>",
  "created": 123,
  "status": "<string>",
  "model": "<string>",
  "output": [
    {}
  ],
  "usage": {},
  "metadata": {}
}
POST /v1/responses is the recommended Astrolabe Cloud inference endpoint for new integrations. It accepts OpenAI-style Responses payloads and adds Cloud routing, billing, stack policy, fallback, verification, and traceability.
{
  "model": "astrolabe/auto",
  "input": "Summarize this support conversation.",
  "metadata": {
    "astrolabe": {
      "stack": "customer-data",
      "quality": "high",
      "verification": "auto"
    }
  }
}

Streaming

Set stream: true for server-sent events. Streaming responses still include Cloud request, stack, model, and rate-limit headers.

Billing

The gateway reserves prepaid balance before provider execution and finalizes the debit from provider usage where available.

Authorizations

Authorization
string
header
required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

application/json
input
any
required
model
string
default:astrolabe/auto
instructions
string
stream
boolean
default:false
tools
object[]
tool_choice
any
response_format
any
reasoning
any
text
any
max_output_tokens
integer
metadata
object

Response

Successful JSON or streaming response.

id
string
object
string
created
integer
status
string
model
string
output
object[]
usage
object
metadata
object