Lesson: Testing with Simulations · Topic 2 of 2
Lesson summary
Running a single simulation is a good start, but real-world QA requires broader coverage. In this topic you will learn how to pick the right simulation type for your agent, design a test plan that covers the scenarios that matter most, and organize your Digital Humans into reusable Communities so you can run tests at scale.Objectives
- Choose the correct simulation type for your agent’s connection method
- Design a test plan that covers happy paths, edge cases, and failure modes
- Use Customer Traits to add realistic variability to Digital Humans
- Organize Digital Humans into Communities for reuse and batch testing
Video walkthrough
Video coming soon. Follow the written walkthrough below in the meantime.
Walkthrough
Pick the right simulation type
Bluejay supports several simulation types. Each one matches a different way your agent communicates:
Pick the type that matches your production setup. If your agent handles phone calls, use Voice. If it is a text-based chatbot, use HTTP Webhook. For the full reference, see Simulation Types.
| Type | Best for | Example |
|---|---|---|
| Voice (Telephony / SIP) | Phone-based agents | A support line that customers call |
| LiveKit | Real-time WebRTC voice agents | A browser-based voice assistant |
| WebSocket | Streaming text or audio agents | A chat agent using persistent connections |
| HTTP Webhook | Request/response chat agents | A chatbot that replies to POST requests |
Build a test plan
A strong test plan goes beyond happy paths. Cover these four areas:
- Happy paths — The most common interactions your agent handles every day. Example: a customer calling to check an order status and getting a clear answer.
- Edge cases — Unusual inputs like long pauses, interruptions, background noise, or requests in a different language. Example: the caller goes silent for 30 seconds mid-sentence.
- Failure modes — What happens when the agent cannot help? Example: the caller asks for something outside the agent’s scope and the agent should escalate.
- Regression checks — Re-running the same simulation after you change a prompt, model, or knowledge base. This catches quality dips before they reach production.
Craft effective Digital Humans
A great Digital Human has three parts:
- Persona — Demographics, personality, and communication style. Is the caller patient or frustrated? Formal or casual?
- Scenario script — What the caller wants to accomplish, any constraints, and what to do if the agent cannot help. Write it in second person: “You are calling to cancel your subscription. You are upset because you were charged twice.”
- Customer Traits — Structured attributes that add realism. Traits are typed values like account type, preferred language, urgency level, or simulated PII (name, order number, account ID).
Organize with Communities
When you have built a library of Digital Humans, group them into Communities. A Community lets you:
- Attach a batch of personas to a Simulation with one click
- Keep test coverage consistent across simulation runs
- Share personas across teams and simulations
- Run batch tests — start a Community-based simulation run and Bluejay will call your agent once for every Digital Human in the group
Activity
Knowledge check
When should you use an HTTP Webhook simulation instead of a Voice simulation?
When should you use an HTTP Webhook simulation instead of a Voice simulation?
Use HTTP Webhook when your agent communicates through synchronous HTTP request/response (like a chatbot API) rather than a persistent voice or streaming connection.
Why use Communities instead of creating Digital Humans directly inside each simulation?
Why use Communities instead of creating Digital Humans directly inside each simulation?
Communities let you reuse the same set of personas across multiple simulations. This keeps test coverage consistent, saves time, and makes it easy to grow your test library as you discover new scenarios.
What are Customer Traits?
What are Customer Traits?
Customer Traits are structured attributes you attach to a Digital Human — things like account type, language preference, urgency level, or simulated personal information. They add realistic variability so your tests cover a wider range of real-world situations.