Deep RL agent for Pong

Aug 15, 2024 · 1 min read
projects

During my internship at DISCOVERY LAB GLOBAL, I trained an autonomous agent to play and win Pong using deep reinforcement learning and convolutional neural networks in Python.

What I built

  • A custom simulation to log rewards, frame stacks, and policy rollouts so the team could compare runs quickly.
  • Training loops with tuned discount factors, learning rates, and exploration schedules to stabilize policy improvement.
  • A concise technical paper capturing architecture choices, experiments, and lessons learned for stakeholders.

Outcomes

The project sharpened my intuition for credit assignment, sample efficiency, and how engineering choices in the simulator change what the policy can learn—skills I still apply when thinking about data quality and evaluation for ML systems.

Aaryan Sharma
Authors
CSE & Math @ OSU · Robot Operator @ Diligent Robotics
Computer Science and Engineering student at The Ohio State University focused on machine learning and systems programming in Python and C++. I care about shipping reliable software—from deep reinforcement learning experiments to real-world robotics in hospitals—and about learning from every layer of the stack.