Observability Concepts - Bud Stack Documentation

Metrics: aggregate trends and KPIs.
Requests: request-level evidence.
Rules: active governance controls.

Workspace Model

Observability is organized around three tabs:

Entity	Description	Typical Questions
Inference Request	One end-user or service call	Why did this request fail or slow down?
Deployment	Runtime endpoint for a model	Which deployment is degrading?
Model	Inference model variant/provider	Is this model causing latency increase?
Rule	Traffic control policy	Are blocks intentional and effective?

Start broad (24h) and then narrow to the exact incident window.

Use deployment-level pivots before deep diving into single requests.

Track both latency and success rate; one without the other is incomplete.

Keep inactive rules for audit history instead of deleting by default.