Skip to main content

Documentation Index

Fetch the complete documentation index at: https://budecosystem-b7b14df4.mintlify.app/llms.txt

Use this file to discover all available pages before exploring further.

Workspace Model

Observability is organized around three tabs:
  • Metrics: aggregate trends and KPIs.
  • Requests: request-level evidence.
  • Rules: active governance controls.

Key Entities

EntityDescriptionTypical Questions
Inference RequestOne end-user or service callWhy did this request fail or slow down?
DeploymentRuntime endpoint for a modelWhich deployment is degrading?
ModelInference model variant/providerIs this model causing latency increase?
RuleTraffic control policyAre blocks intentional and effective?

Signal Types

  • Volume: Request rate and traffic shape.
  • Performance: Latency, TTFT, processing time.
  • Quality: Success/failure rates and status codes.
  • Cost/Usage: Input/output token totals and trends.
  • Security/Governance: Block events and active rules.

Investigation Loop

  1. Detect anomaly in Metrics.
  2. Narrow Requests with time range and filters.
  3. Inspect request detail for metadata, payload, and IDs.
  4. Correlate with Rules and tune policies if needed.

Best Practices

Start broad (24h) and then narrow to the exact incident window.
Use deployment-level pivots before deep diving into single requests.
Track both latency and success rate; one without the other is incomplete.
Keep inactive rules for audit history instead of deleting by default.