Two Sigma uses third-party advertising and advertising analytics cookies that allow us and our partners to serve your more relevant advertisements across platforms. You may accept or decline our use of these kinds of cookies by selecting “accept” or “decline” below. For more information about our privacy practices, please visit our Cookies Policy here.

Learn More

Data Science Engineering

A 24x Speedup for Reinforcement Learning with RLlib + Ray

Author: Raoul Khouri (Two Sigma)

Presented at: Ray Summit 2021

Abstract: Training a reinforcement learning (RL) agent is compute intensive. Under classical deep learning assumptions bigger and better GPUs reduce training time. However, for RL, bigger and better GPUs do not always lead to reduced training time. In practice, RL can require millions of samples from a relatively slow and CPU-only environment leading to a bottleneck in training that GPUs do not solve. Empirically, we find that training agents with RLlib removes this bottleneck because its Ray integration allows scaling to many CPUs across a cluster of commodity machines. This talk details how such scaling can cut training wall-time down by orders of magnitude.