Lesson: Monitoring, Metrics & Dashboards · Topic 1 of 2
Lesson summary
Testing with simulations catches problems before launch, but you also need to watch what happens once real customers start talking to your agent. Bluejay’s observability pipeline takes your production calls, runs your Custom Metrics against them, and surfaces the results on dashboards — automatically. This topic shows you how to connect that pipeline.Objectives
- Understand the observability data flow from production call to scored result
- Send your first production call to Bluejay via the API or a provider integration
- Verify that evaluation results appear in the dashboard
Video walkthrough
Video coming soon. Follow the written walkthrough below in the meantime.
Walkthrough
Understand the data flow
Here is what happens when a production call gets evaluated:The key idea: Bluejay does not sit in the middle of your calls. Instead, it receives completed conversations after they happen and evaluates them.
Choose your integration method
There are two ways to send production calls to Bluejay:Option A: API integration — Send transcripts directly to the
/evaluate endpoint. This works with any agent regardless of provider. See the Observability API Integration Tutorial for step-by-step instructions.Option B: Provider integration — If your agent runs on Retell, Vapi, Bland, or ElevenLabs, you can connect the provider directly. Bluejay will automatically receive call data without extra code on your side. See the integration guides:Either way, the result is the same: your production calls appear in Bluejay with metric scores attached.Send a test call
Before turning on the full pipeline, send a single call to make sure everything is connected:
- API path: Make a POST request to
/evaluatewith a transcript and your agent identifier. - Provider path: Trigger a test call through your provider and confirm it appears in Bluejay.
Review evaluation results
Click into the call log entry to see:
- Transcript — The full conversation, turn by turn
- Custom Metric scores — Pass/fail, ratings, or qualitative assessments
- Built-in checks — Hallucination detection (with reasoning) and redundancy detection
- Metadata — Call duration, latency, turn counts, agent speaking percentage
Activity
Knowledge check
What data does Bluejay need at minimum for observability?
What data does Bluejay need at minimum for observability?
At minimum, Bluejay needs a transcript and the agent identifier. Additional metadata like call duration, caller information, and timestamps enriches the evaluation but is not required.
Can you re-evaluate a call with different metrics?
Can you re-evaluate a call with different metrics?
Yes. Use the Re-evaluate action on any call log entry to re-run it against updated or additional Custom Metrics. This is useful when you add new metrics or refine existing ones.
What is the difference between the API integration and a provider integration?
What is the difference between the API integration and a provider integration?
The API integration requires you to send transcripts to Bluejay yourself (via the
/evaluate endpoint). A provider integration connects your voice provider (Retell, Vapi, Bland, ElevenLabs) directly so Bluejay receives call data automatically. Both produce the same evaluation results.