Suggested Searches

AI/ML STIG Lecture Series

Artificial Intelligence and Machine Learning Science and Technology Interest Group (AI/ML STIG)

Module 7: Reinforcement Learning

AI/ML STIG about AI/ML STIG Lecture Series

Location

Virtual

Dates

18 May 2026
4:00pm ET

Community

AI/ML STIG

Type

Seminar

Reinforcement Learning Applications

Speaker

Carol Cuesta-Lazaro, IAS/Flatiron

A hands-on reinforcement learning tutorial that builds policy-gradient methods from scratch with LunarLander as the running environment, progressing from vanilla REINFORCE to variance-reduced policy gradients and actor-critic learning.

Topics Covered

  • Using LunarLander to connect the agent-environment loop to code
  • Implementing a policy network and sampling actions with PyTorch
  • Training vanilla REINFORCE from trajectory-level returns
  • Reducing variance with reward-to-go, discounting, and normalized advantages
  • Building actor-critic methods with a learned value-function baseline
  • Comparing learning curves across REINFORCE, improved REINFORCE, and actor-critic
Session Recording

Meeting Connection

Join the Meeting

News Straight to Your Inbox

Subscribe to your community email news list

We will never share your email address.

Sign Up
Angled from the upper left corner to the lower right corner is a cone-shaped orange-red cloud known as Herbig-Haro 49/50. This feature takes up about three-fourths of the length of this angle. The upper left end of this feature has a translucent, rounded end. The conical feature widens slightly from the rounded end at the upper right down to the lower right. Along the cone there are additional rounded edges, like edges of a wave, and intricate foamy-like details, as well as a clearer view of the black background of space. In the upper left, overlapping with the rounded end of Herbig-Haro 49/50, is a background spiral galaxy with a concentrated blue center that fades outward to blend with red spiral arms. The background of space is speckled with some white stars and smaller, more numerous, fainter white galaxies throughout.