Deep RL agent for Pong

During my internship at DISCOVERY LAB GLOBAL, I trained an autonomous agent to play and win Pong using deep reinforcement learning and convolutional neural networks in Python.

What I built

A custom simulation to log rewards, frame stacks, and policy rollouts so the team could compare runs quickly.
Training loops with tuned discount factors, learning rates, and exploration schedules to stabilize policy improvement.
A concise technical paper capturing architecture choices, experiments, and lessons learned for stakeholders.

Outcomes

The project sharpened my intuition for credit assignment, sample efficiency, and how engineering choices in the simulator change what the policy can learn—skills I still apply when thinking about data quality and evaluation for ML systems.

Machine Learning Internships Python

Authors

Aaryan Sharma

CSE & Math @ OSU · Robot Operator @ Diligent Robotics

Computer Science and Engineering student at The Ohio State University focused on machine learning and systems programming in Python and C++. I care about shipping reliable software—from deep reinforcement learning experiments to real-world robotics in hospitals—and about learning from every layer of the stack.

← LLM training data & evaluation May 1, 2025

No results found

What I built

Outcomes