Skip to main content
Get from account creation to running Simulations, evaluating production calls, and optimizing your Agent quality in under 15 minutes.

Prerequisites

Basic

Everything you need to run your first Simulation and connect production monitoring.
1

Add an Agent

An Agent represents the conversational AI system you want to test and monitor. Navigate to the Agents page in the Bluejay dashboard and click Add Agent. Fill in the Agent name, system prompt, knowledge base, and goals.Learn more in the Agents overview or jump to the Add Agent API reference.
2

Create a Simulation

A Simulation groups Digital Humans together so you can run them in parallel against your Agent — think of it as a test suite. Open your Agent and click Create Simulation. Give it a name and optionally configure settings like max call duration or sequential calling.Learn more in the Simulations overview or explore Simulation Types.
3

Create or Generate Digital Humans

Digital Humans are the synthetic callers that interact with your Agent during Simulations. Inside your Simulation, click Generate Digital Humans. Choose from goal adherence, red teaming, load testing, or Customer Persona modes. You can also manually create individual Digital Humans for specific scenarios.Learn more in the Digital Humans overview.
4

Run the Simulation

Click the Run button on your Simulation. Bluejay’s Digital Humans will call your Agent, and results are evaluated automatically. You can watch conversations happen in real time and review results as they complete.Learn more about Simulation Runs.
5

Add Custom Metrics

Custom Metrics define the exact quality signals you care about — compliance checks, empathy scoring, task completion, or anything specific to your domain. Navigate to the Custom Metrics section of your Agent, click Create Metric, and define the name, description, response type, and scoring guidance.Learn more in the Custom Metrics overview.
6

Hook Up Observability

Observability evaluates your production conversations against the same Custom Metrics you use in Simulations. Go to the Observability tab for your Agent. You can connect a native integration (Retell, Vapi, Bland, ElevenLabs) or configure webhook-based ingestion directly from the Dashboard.Learn more in the Observability overview or follow the API integration tutorial.

Advanced

Level up with Alerts, Dashboards, optimization, automated testing, and Prompt Versioning.
1

Create Alerts

Alerts notify your team when a Custom Metric crosses a threshold so issues get caught before they compound. Navigate to the Alerts configuration for your Agent. Set the Custom Metric, threshold, and delivery channel (Slack, email).Learn more in the Alerts overview.
2

Create Custom Dashboards

Dashboards aggregate Simulation and production data into a single view. Open the Dashboards section to see at-a-glance health scores, trend sparklines, and Alert badges across all your Agents. Customize views to focus on the Custom Metrics that matter most.Learn more in the Dashboards overview.
3

Optimize Metrics via Metrics Lab

Metrics Lab lets you prototype, test, and refine Custom Metrics before deploying them. Open Metrics Lab from the sidebar. Import or draft Custom Metrics, annotate sample conversations, and run side-by-side comparisons to find the most reliable scoring approach. Promote validated Custom Metrics to production with one click.Learn more in the Metrics Lab overview.
4

Test Agents via Workflows

Workflows let you chain Simulation Runs, evaluations, and notifications into automated pipelines. Navigate to Workflows and create a new Workflow. Use the visual builder to add steps — Simulation Runs, evaluations, and notification nodes — and connect them into a pipeline. Set a schedule to run automatically.Learn more in the Workflows overview.
5

Version Prompts via Prompt Versioning

Prompt Versioning tracks changes to your Agent’s system prompt over time. Open the Prompts section of your Agent. Create a new version with an updated prompt, add a commit message, and optionally tag it with labels like production or staging. Compare performance across Simulation Runs and roll back when needed.See the Create Prompt Version API reference.

Next Steps

Key Concepts

Deep dive into Agents, Digital Humans, Custom Metrics, and more.

Simulation Types

Explore goal adherence, red teaming, load testing, and replay strategies.

Observability Guide

Step-by-step guide to connecting your production pipeline.

API Reference

Full endpoint documentation for every Bluejay API.