Setting Up Observability

Lesson: Monitoring, Metrics & Dashboards · Topic 1 of 2

Lesson summary

Testing with simulations catches problems before launch, but you also need to watch what happens once real customers start talking to your agent. Bluejay’s observability pipeline takes your production calls, runs your Custom Metrics against them, and surfaces the results on dashboards — automatically. This topic shows you how to connect that pipeline.

Objectives

Understand the observability data flow from production call to scored result
Send your first production call to Bluejay via the API or a provider integration
Verify that evaluation results appear in the dashboard

Video walkthrough

Video coming soon. Follow the written walkthrough below in the meantime.

Walkthrough

Understand the data flow

Here is what happens when a production call gets evaluated:

Your Agent (production)
  │
  ├── call completes → transcript + metadata
  │
  └── sent to Bluejay (API or provider integration)
       │
       ├── runs Custom Metrics against the transcript
       ├── detects hallucinations and redundancy
       └── stores results → Dashboard, Call Logs, Alerts

The key idea: Bluejay does not sit in the middle of your calls. Instead, it receives completed conversations after they happen and evaluates them.

Choose your integration method

There are two ways to send production calls to Bluejay:Option A: API integration — Send transcripts directly to the /evaluate endpoint. This works with any agent regardless of provider. See the Observability API Integration Tutorial for step-by-step instructions.Option B: Provider integration — If your agent runs on Retell, Vapi, Bland, or ElevenLabs, you can connect the provider directly. Bluejay will automatically receive call data without extra code on your side. See the integration guides:

Either way, the result is the same: your production calls appear in Bluejay with metric scores attached.

Send a test call

Before turning on the full pipeline, send a single call to make sure everything is connected:

API path: Make a POST request to /evaluate with a transcript and your agent identifier.
Provider path: Trigger a test call through your provider and confirm it appears in Bluejay.

Once the call is processed, navigate to the Observability page in the dashboard.

Review evaluation results

Click into the call log entry to see:

Transcript — The full conversation, turn by turn
Custom Metric scores — Pass/fail, ratings, or qualitative assessments
Built-in checks — Hallucination detection (with reasoning) and redundancy detection
Metadata — Call duration, latency, turn counts, agent speaking percentage

If a metric score looks wrong, you can click Re-evaluate to re-run the evaluation with updated or additional metrics.

Activity

Hands-on exercise: Using the API or your provider integration, send one production call transcript to Bluejay. Navigate to the Observability page and click into the call log entry. Verify that you can see the transcript, metric scores, and metadata. If a metric score surprises you, try re-evaluating the call.

Knowledge check

What data does Bluejay need at minimum for observability?

At minimum, Bluejay needs a transcript and the agent identifier. Additional metadata like call duration, caller information, and timestamps enriches the evaluation but is not required.

Can you re-evaluate a call with different metrics?

Yes. Use the Re-evaluate action on any call log entry to re-run it against updated or additional Custom Metrics. This is useful when you add new metrics or refine existing ones.

What is the difference between the API integration and a provider integration?

The API integration requires you to send transcripts to Bluejay yourself (via the /evaluate endpoint). A provider integration connects your voice provider (Retell, Vapi, Bland, ElevenLabs) directly so Bluejay receives call data automatically. Both produce the same evaluation results.

Next topic

Continue to Topic 2: Metrics, Dashboards & Alerts.

Bluejay University

Lesson 1: Foundations

Lesson 2: Testing with Simulations

Lesson 3: Monitoring, Metrics & Dashboards

Lesson 4: API & Automation

Setting Up Observability

Lesson summary

Objectives

Video walkthrough

Walkthrough

Understand the data flow

Choose your integration method

Send a test call

Review evaluation results

Activity

Knowledge check

Next topic

Bluejay University

Lesson 1: Foundations

Lesson 2: Testing with Simulations

Lesson 3: Monitoring, Metrics & Dashboards

Lesson 4: API & Automation

​Lesson summary

​Objectives

​Video walkthrough

​Walkthrough

Understand the data flow

Choose your integration method

Send a test call

Review evaluation results

​Activity

​Knowledge check

​Next topic

Lesson summary

Objectives

Video walkthrough

Walkthrough

Activity

Knowledge check

Next topic