An example of batch inference implemented with Ray Data.
Learn how to run scalable batch inference with Ray Data by loading a Hugging Face dataset, converting it to a Ray Dataset, and applying a SentenceTransformer embedding model in parallel using `map_batches`. You’ll configure batching, device placement (CPU/GPU/MPS), and concurrency to efficiently materialize embeddings for an entire dataset.