May 10, 2025
An exploration of policy gradient methods in reinforcement learning, focusing on the transition from REINFORCE to Proximal Policy Optimization (PPO).
March 14, 2025
A reflection on my time at Berkeley EECS, and some lessons I've learned.