Introduction to Observability

2.Intro And Setup

Learn the fundamentals of observability—metrics, logs, and traces—and how they help you monitor and debug distributed systems. Then set up local Ray observability by installing Ray, launching Prometheus and Grafana, starting a two-node Ray cluster, and using the Ray Dashboard to verify and explore collected metrics.

Observability Introduction
Observability Overview
Setting Up Local Ray Observability

3.Ray Anyscale Introduction

Learn the core observability concepts in Ray—logs, metrics, and events—and how to use the Ray Dashboard to monitor application and cluster behavior. Then compare Ray’s native tooling with Anyscale’s managed, contextualized observability (persistent metrics, workload context, and post-failure visibility) through an example job that triggers memory pressure and OOM failures.

Ray and Anyscale Observability Introduction
Ray Observability
Anyscale Observability
Example

+1 more lesson

4.Ray Anyscale Observability In Detail

Learn how to monitor and debug Ray workloads using Ray and Anyscale observability dashboards, with hands-on examples that show when to use each view. You’ll explore Ray Data pipeline execution status, logs, and metrics (and Anyscale-specific workload dashboards) to identify bottlenecks and operational issues.

Ray and Anyscale Observability in Detail
Data Pipeline Observability (Ray Data)
Web Application Observability (Ray Serve)

Introduction to Observability

About this course

2.Intro And Setup

3.Ray Anyscale Introduction

4.Ray Anyscale Observability In Detail

2.Intro And Setup

3.Ray Anyscale Introduction

4.Ray Anyscale Observability In Detail