1. Prerequisites
- Bud AI Foundry account with access to the platform.
- A project created in your workspace.
- Deploy model in the project.
- API key with permission to call endpoint.
2. Step 1: Create an account
- Sign in or create a Bud AI Foundry account.
- Complete workspace setup and verify your organization details.
- Invite teammates if needed and assign roles.
3. Step 2: Create your first project
- Navigate to Projects and click Create Project.
- Name the project, add tags and description.
- Generate an API key for the project.
4. Step 3: Add a model
- Go to Models and select +Model.
- Choose a model source (Cloud, Hugging Face, URL, or Disk).
- Provide metadata, tags, and approval details, then save the model.
5. Step 4: Deploy the model
- Open your project detail page and select Deploy Model.
- Choose a model from the catalog and confirm the model source.
- Select a cluster and hardware profile.
- Configure scaling and safety settings, then launch the deployment.
6. Step 5: Test your endpoint
Use your project API key to call the endpoint with an OpenAI-compatible request.7. Step 6: Monitor and iterate
- Open Observability to review latency, token usage, and error trends.
- Run evaluations or benchmarks to compare models and configurations.
- Update routing or scaling policies based on usage patterns.