Docs/Test Automation/Run and Track

Run and Track

After scenario design, execution tells you whether your agent actually performs as expected. This page explains how to run simulations and monitor execution status confidently.

Step-by-Step: Run a Scenario

1) Open an existing conversation simulation

Go to Simulation -> Conversation.
Open a scenario from the list.

2) Start execution

Click Run on the scenario form page.
Confirm when prompted.
You will be redirected to the simulation result detail page.

3) Track run state

Watch the execution status while it processes:

running: simulation is still processing
completed: all steps finished
failed: one or more steps failed
stopped: execution was manually stopped

4) Use controls during and after execution

Refresh while running to fetch latest status
Stop if you need to cancel the run
Run Again after completion for regression checks
Delete old runs you no longer need

Use Results List Efficiently

Open Simulation -> Results to manage many executions.

Use filters and sorting to:

Find runs by date range.
Search by execution code.
Filter by status.
Sort by latest or oldest.

This helps you quickly inspect relevant runs after major changes.

Execution Best Practices

Run critical scenarios first after every major update.
Keep a consistent run cadence (daily or weekly depending on traffic risk).
Re-run the same scenario set before and after model changes.
Use execution codes in team discussions for traceability.
Stop and investigate immediately when high-priority scenarios fail.

Model-Switch Regression Mini-Process

Use this lightweight process whenever you change model:

Run baseline scenarios on current model.
Switch model in draft settings.
Run the exact same scenarios.
Compare criteria outcomes.
Publish only after critical scenarios pass.

Next Step

Now interpret criteria quality and convert findings into improvements:

Analyze and Improve

PreviousDesign Scenarios

NextAnalyze and Improve