System map
Astrolabe Cloud has four production surfaces:- Cloud app at
https://app.astrolabe.runfor workspace setup, API keys, billing, routing stacks, usage, and traces. - Gateway at
https://api.astrolabe.run/v1for model requests. - Control plane for workspace state, API key mirrors, billing ledger, stack versions, and trace records.
- Provider layer through OpenRouter for concrete model execution.
Request lifecycle
ForPOST /v1/responses and POST /v1/chat/completions:
- The gateway extracts the API key from
Authorization: Bearer ...orx-api-key. - The key is resolved to a workspace and key prefix.
- Rate limits are checked for the key.
- Astrolabe estimates request size and reserves prepaid workspace balance.
- The requested model is normalized. Virtual aliases such as
autoresolve toastrolabe/auto. - The gateway resolves the active routing stack.
- The stack applies boundaries, objectives, fallback policy, verification settings, and request overrides.
- Astrolabe selects a concrete serving model and calls OpenRouter.
- Fallback, verification, schema recovery, or policy blocking may run depending on the stack and response.
- Usage is finalized from provider usage where available.
- Billing, model-call records, and route traces are written back to the control plane.
- The response returns with Cloud, stack, model, policy, rate-limit, and request-id headers.
Stack resolution
Astrolabe chooses a stack in this order:metadata.astrolabe.stackon the request.- Stack assigned to the API key.
- Stack assigned to the workspace.
- Managed default stack.
Billing model
Astrolabe Cloud uses prepaid credits.- A request must pass auth, workspace status, and balance checks before provider work starts.
- Astrolabe reserves enough balance for the selected route.
- Successful provider usage becomes the customer debit.
- Failed, rejected, duplicate, unpriced, or missing-usage requests are recorded as unbillable.
- Route traces and usage rows remain available for debugging.
What clients can observe
Logx-astrolabe-cloud-request-id for every request. It is the primary support and trace lookup key.
Response headers can expose:
- workspace and key prefix
- rate-limit state
- selected model and provider
- stack id, name, and version
- policy status and decision kind
- category, action class, complexity, and lane
- verification status and confidence
- estimated baseline cost and estimated savings

