Nielsen is a global leader in audience measurement, data, and analytics, powering decision-making for media companies, advertisers, and brands worldwide. As part of its AI transformation strategy, Nielsen launched ‘Ask Nielsen’; a multi-agent AI analytics copilot that enables users to query complex datasets conversationally and receive business-ready insights in real time.
‘Ask Nielsen’ is not a simple chatbot. It is a multi-agent orchestration system that includes a master routing agent, domain-specific agents for Campaign Analysis, Competitive Intelligence, Audience, and Content, and sub-agents for natural language processing, planning, SQL generation, code execution, and insight narration.
A single user request may trigger multi-step reasoning, SQL query generation, Python code execution, data visualization, narrative insight generation, and real-time guardrail validation. In a representative trace, that means over 1,000 spans, 100+ LLM calls, and 50+ guardrail calls.
As ‘Ask Nielsen’ scaled, the team identified two categories of practical risk that had to be addressed before the system could serve enterprise clients reliably.
The objective was clear: ship fast without compromising safety, consistency, or compliance.
Traditional observability tools could surface traces, but not answer the harder question: which step in the multi-agent flow caused degradation, and why. Without that visibility, both risks remained unresolved until the point of damage. Nielsen needed an alternative: a runtime trust layer that could operate at the agent level, not just report on what had already gone wrong.
Rather than treating governance as a periodic review, Nielsen built for continuous oversight. Their framework emphasized three outcomes: keeping the system within intended operational boundaries in the face of adversarial behavior, maintaining brand safety across key toxicity categories, and improving quality and reliability through continuous evaluation and production observability, supported by actionable diagnostics.
Nielsen partnered with Fiddler AI to serve as the runtime trust and governance layer for ‘Ask Nielsen’, integrating directly into the multi-agent architecture alongside existing orchestration and telemetry systems.
The firm uses Fiddler as a core component of their agentic solution by integrating Fiddler Trust Models for in-environment evaluation, and Guardrails to support safe and compliant LLM deployments. The team tracks 40+ metrics across reliability, performance, and business impact, each paired with diagnostics to support root cause analysis.
Nielsen operates a rigorous evaluation lifecycle built around golden datasets per primary agent, offline benchmarking, online monitoring using user feedback and baseline comparisons, and drift detection using clustering techniques. Fiddler bridges offline and online evaluation by continuously monitoring runtime performance and surfacing regressions when production metrics fall below validated baselines.
To make observability actionable, the team uses Fiddler's query clustering to identify patterns in user queries, diagnose behavior across similar query types, and proactively address clusters where the model may be underperforming.
‘Ask Nielsen’ enforces Fiddler Guardrails for Safety, PII, and Faithfulness at critical checkpoints within the LLM gateway layer, ensuring policy compliance before responses reach users. Guardrails operate with synchronous enforcement paired with asynchronous monitoring, delivering governance without degrading the speed of customer interactions.
‘Ask Nielsen’ exports OpenTelemetry traces for agentic workflows. Fiddler transforms these traces into actionable intelligence, enabling the team to isolate degradation to specific agent steps, understand whether issues originate in natural language processing, SQL generation, code execution, or insight narration, monitor guardrail enforcement across complex flows, and alert on completion rate and faithfulness drops. This step-level visibility allows teams to resolve issues faster and deploy improvements with confidence.
Nielsen continues to expand ‘Ask Nielsen’'s capabilities across additional datasets, client segments, and agent types. By pairing continuous evaluation with production observability, and enforcing safety in the runtime path, Nielsen built a system that is both usable and trustworthy. The result is a more interactive way for customers to access insights, backed by measurable governance and brand-safety performance.
With Fiddler as the runtime trust layer, Nielsen can innovate confidently, ensuring that as AI systems grow more powerful, they remain safe, governed, and reliable.
To learn more about the Fiddler Unified AI Observability and Security Platform, book a demo or read additional case studies