Docs/Test Automation/Analyze and Improve

Analyze and Improve

Execution is only half the process. The real value of simulation comes from analysis and targeted improvements.

Understand Criteria Evaluation

In simulation result detail, each conversation step is evaluated against your criteria.

You will see outcomes such as:

Use these outcomes to identify exactly where quality breaks.

Prioritize by user risk:

Based on failure type, update:

Use this checklist before publishing a new model:

Check	Pass condition
Baseline run exists	Same scenario set already executed on old model
Critical scenarios stable	No critical regression in key categories
Criteria quality preserved	Match quality remains acceptable across high-risk steps
Failure review completed	Every contradiction/unmentioned on critical steps is reviewed
Action plan documented	Any remaining gaps have owner and timeline

If one of these is not met, keep the new model in draft and continue iteration.

Treat simulation as an ongoing loop:

This loop keeps conversation quality strong even as your model, resources, and automation grow over time.