Loading…
May 21-22, 2026
Learn more and Register to Attend

The Sched app allows you to build your schedule, but is not a substitute for your event registration. You must be registered for Observability Summit North America 2026.

Please note: This schedule is automatically displayed in Central Daylight Time (UTC -5). To see the schedule in your preferred timezone, select from the drop-down menu located at the bottom of the menu to the right.

The schedule is subject to change.
Company: Advanced clear filter
arrow_back View All Dates
Thursday, May 21
 

10:20am CDT

[CANCELLATION] Scaling a Proprietary-to-OpenTelemetry Migration With AI-Assisted, Spec-Driven Workflows - Ying Mo & Paras Kampasi, IBM
Thursday May 21, 2026 10:20am - 10:45am CDT
This talk presents a practical methodology for migrating a large proprietary observability platform to an OpenTelemetry-native architecture, using a GenAI-assisted workflow paired with a robust spec-driven strategy. Faced with hundreds of custom Java-based sensors, the engineering team designed a spec-driven conversion process that leverages GenAI to extract specifications, generate unit tests, and assist in implementing Go-based OpenTelemetry receivers. Each stage incorporates human review and test feedback loops to address the reliability limitations of GenAI and ensure functional correctness.

Additionally, a data-driven feasibility evaluation was conducted prior to large-scale conversion, where defined task types were benchmarked with and without GenAI to quantify effort savings and highlight where GenAI provides the greatest value.

Attendees will learn a reproducible workflow for large-scale migrations from proprietary to OpenTelemetry, how to pair GenAI with automated testing to manage risk, and insights on where GenAI accelerates real-world engineering tasks without compromising quality.
Speakers
avatar for Ying Mo

Ying Mo

Senior Software Engineer, IBM
Ying Mo is a Senior Software Engineer at IBM, recently working on IBM Instana, an observability platform, leading engineering team to transform the product to OpenTelemetry native. He is always enthusiastic to bring innovative ideas into product by leveraging open source technology... Read More →
avatar for Paras Kampasi

Paras Kampasi

Technical Product Manager, IBM
I work at the intersection of OpenTelemetry, observability, and modern cloud-native practices, helping teams make complex systems understandable and reliable. I speak and write about practical ways to apply open standards, close feedback loops between SREs and product teams, and turn... Read More →
Thursday May 21, 2026 10:20am - 10:45am CDT
Level One | Ballroom B
  End-User Case Studies

11:20am CDT

AI-Powered Root Cause Analysis at Scale: From Theory To Production Lessons From Nubank's 120M+ Cus - Letícia Mota & Yevgeny Gladun, Nubank
Thursday May 21, 2026 11:20am - 11:45am CDT
This session presents an AI-powered SRE Agent designed to autonomously orchestrates complex, multi-source investigations by querying internal observability providers and knowledge bases.
A primary focus is the "Data Volume Problem." Modern observability systems generate terabytes of metrics and logs daily; at Nubank’s scale, the Prometheus MCP alone has more than 23,000 metrics available, while log queries can span billions of rows. The team overcame LLM context limits through on-premises data filtering, intelligent summarization, and selective context assembly. This architecture utilizes "Expert Guides" to reduce 23,000 raw metrics to approximately 14 relevant data points before LLM processing.
The talk covers multi-source orchestration using the Model Context Protocol (MCP) for pluggable tool discovery, allowing the AI to progressively load and correlate only the observability sources.
The platform enables the delivery of expert instructions for any specific scenario through targeted, versioned prompts. This transformation allows the platform to scale across the enterprise, performing virtually any investigative task beyond its original root cause analysis mission.
Speakers
avatar for Letícia Mota

Letícia Mota

Nubank
Letícia is a Product Manager at Nubank with 8+ years of experience. After working with data & image recognition products, she now works with Resilience and Troubleshooting products, including a DR Test Platform and an SRE Agent for Nubank.​​​


... Read More →
avatar for Yevgeny Gladun

Yevgeny Gladun

Staff Runtime Platforms Engineer, Nubank
Yevgeny Gladun is a Staff Engineer at Nubank with nearly 20 years of software development experience. Over his four-year tenure at Nubank, he transitioned from scaling Data ETL pipelines to deep architectural analysis of microservice interactions. As part of the Runtime Platforms... Read More →
Thursday May 21, 2026 11:20am - 11:45am CDT
Level One | Ballroom B
  AI and MCP in Observability

1:25pm CDT

Unified End-to-End Observability: How Comcast Generates SpanMetrics at Enterprise Scale - Raghu Vamshi Challa, Comcast
Thursday May 21, 2026 1:25pm - 1:50pm CDT
Enterprises often struggle with the "black box" nature of proprietary APM tools and the high cost of distributed tracing at scale. In this session, we will demonstrate how Comcast tackled this challenge by migrating 350 critical applications from AppDynamics to a cloud-native OpenTelemetry (OTel) stack, achieving a truly unified end-to-end observability experience.

We will pull back the curtain on the architecture that powers this migration. Specifically, we will show how we leveraged the OpenTelemetry Collector to generate Request, Error, and Duration (R.E.D.) metrics from trace data using the SpanMetrics connector. A key highlight will be our unique deployment of Conduit, which serves as a resilient transport layer to ensure data integrity and effective load balancing in a high-volume environment.

Attendees will leave with a blueprint for breaking free from APM vendor lock-in. To help the community fast-track this transition, we will also be sharing and walking through our reusable, battle-tested Grafana dashboards that can be leveraged by any enterprise.
Speakers
avatar for Raghu Challa

Raghu Challa

Comcast Engineer 6, Software Development & Engineering - Backend Engineering, Comcast
Raghu is an Observability Lead at Comcast, driving the enterprise-wide migration from legacy APM tools to OpenTelemetry. He specializes in designing high-scale telemetry pipelines that process massive volumes of trace data. Raghu is passionate about democratizing observability and... Read More →
Thursday May 21, 2026 1:25pm - 1:50pm CDT
Level One | Ballroom B
  End-User Case Studies
 
  • Filter By Date
  • Filter By Venue
  • Filter By Type
  • Content Experience Level
  • Timezone

Share Modal

Share this link via

Or copy link

Filter sessions
Apply filters to sessions.
Filtered by Date -