Dev ToolsAdvancedCourse

Monitor AI agents with tracing and observability

See what your agents actually did across prompts, tool calls, latency, failures, and handoffs instead of debugging by guesswork.

90 minLangSmith, OpenTelemetry, Grafana10xCareer Team

Choose your training style

Pick the format that matches the level of support you want.

Self-pacedAvailable

Self-paced

Start immediately and work through the training on your own schedule.

Free

Human trainerComing soon

Human trainer

Join a guided cohort or workshop format when live delivery is available.

$99

Guided by an instructor

AI trainerComing soon

AI trainer

Practice with an AI-guided trainer experience tailored to the course topic.

Personalized guidance

Overview

Once an agent is in production, the hardest problems are often invisible: slow tool calls, brittle prompts, looping behavior, and confusing handoffs. This course teaches you how to instrument agents so you can trace execution and diagnose failures quickly.

Who it's for

Developers running agent workflows in staging or production
Platform teams responsible for AI reliability
Product teams that need better visibility into agent behavior

What you'll build

A tracing setup that records prompts, tool calls, outputs, and timing
A dashboard for identifying slow paths, failure clusters, and retry loops
A debugging workflow for reproducing and fixing agent incidents

Prerequisites

An agent workflow with multiple steps or tool calls
Access to logs or a staging environment
Basic familiarity with monitoring concepts

Tools and setup

Decide which spans, events, and metadata matter most
Instrument prompts, tool invocations, and final outputs
Capture enough context to reproduce failures without leaking sensitive data

Modules

Module 1: Trace the execution path

You will record the agent's steps from user input to final action so you can understand what happened when something goes wrong.

Module 2: Build useful dashboards

You will track failure rates, tool latency, retry frequency, and cost so the team can spot unhealthy behavior early.

Module 3: Turn traces into fixes

You will use the collected traces to isolate brittle prompts, bad tool contracts, and ambiguous routing logic.

Deliverable

An observability setup that makes agent behavior measurable, debuggable, and easier to improve.

Common mistakes

Logging final answers but not intermediate tool behavior
Collecting too much raw data without defining the questions it should answer
Ignoring privacy and retention concerns when storing traces

Next steps

Connect observability to incident reviews, eval pipelines, release gates, and model-routing decisions.

Run AI models locally with edge inference

Set up local-first AI inference on your own hardware — no cloud, no API keys, full privacy.

90 min

Ollamallama.cppvLLM

View details

Build and deploy MCP servers

Create Model Context Protocol servers that give AI models access to your tools, data, and services.

90 min

MCP SDKTypeScriptPython+1

View details

Build document parsing & Markdown pipelines

Extract, transform, and structure content from PDFs, DOCX, HTML, and more into clean Markdown for AI consumption.

75 min

DoclingMarkItDownUnstructured+1

View details

Choose your training style

Pick the format that matches the level of support you want.

Self-pacedAvailable

Self-paced

Start immediately and work through the training on your own schedule.

Free

Human trainerComing soon

Human trainer

Join a guided cohort or workshop format when live delivery is available.

$99

Guided by an instructor

AI trainerComing soon

AI trainer

Practice with an AI-guided trainer experience tailored to the course topic.

Personalized guidance

What you'll learn

Instrument agents with prompt, tool, and latency traces
Build dashboards for debugging agent performance issues
Diagnose looping, slow, and brittle agent behaviors faster
Use observability data to improve reliability and releases

Monitor AI agents with tracing and observability

Choose your training style

Self-paced

Human trainer

AI trainer

Overview

Who it's for

What you'll build

Prerequisites

Tools and setup

Modules

Module 1: Trace the execution path

Module 2: Build useful dashboards

Module 3: Turn traces into fixes

Deliverable

Common mistakes

Next steps

Related trainings

Run AI models locally with edge inference

Build and deploy MCP servers

Build document parsing & Markdown pipelines