> ## Documentation Index
> Fetch the complete documentation index at: https://docs.budecosystem.com/llms.txt
> Use this file to discover all available pages before exploring further.

# Creating Your First Playground Workflow

> Build a practical prompt evaluation workflow from start to finish

## Goal

Set up a repeatable experiment to choose the best model for a customer-support summarization use case.

```mermaid theme={null}
flowchart LR
    A[Define Use Case] --> B[Create Baseline Prompt]
    B --> C[Run on Model A]
    B --> D[Run on Model B]
    C --> E[Score Quality + Latency]
    D --> E
    E --> F[Refine Prompt/Settings]
    F --> G[Finalize Winning Setup]
```

## Step 1: Define Evaluation Criteria

Use simple scoring dimensions:

* Accuracy (0-5)
* Instruction adherence (0-5)
* Clarity/formatting (0-5)
* Response time (fast/medium/slow)

## Step 2: Create Baseline Prompt

Example prompt:

```text theme={null}
Summarize the following support ticket in exactly 3 bullets:
- Problem
- Business impact
- Recommended next action
```

## Step 3: Run Across Two Models

1. Open two chat panes.
2. Bind each pane to a different model.
3. Send the same baseline prompt.
4. Capture outputs and latency observations.

<img src="https://mintcdn.com/budecosystem-b7b14df4/2tcX_R_P-2h-Xs-7/images/image-43.png?fit=max&auto=format&n=2tcX_R_P-2h-Xs-7&q=85&s=c3141aaf6bc916cfb48353520100c86d" alt="Image" width="1920" height="871" data-path="images/image-43.png" />

## Step 4: Tune Parameters

Adjust one variable at a time:

* Temperature
* Max response length
* Stop conditions

Re-run and compare with previous outputs.

## Step 5: Save the Winner

1. Keep the best conversation in history.
2. Note final prompt and parameter values.
3. Share results with deployment owners before production rollout.

## Expected Outcome

At the end of this workflow, you should have:

* A validated prompt template
* A preferred model choice for the task
* Reproducible settings for follow-up testing
