Python

LLM training data & evaluation

Generative-AI workflows to produce precise, reviewable datasets that improve model behavior—experience from Outlier collaborating on OpenAI-facing programs.

May 1, 2025 • 1 min read

Machine Learning

Deep RL agent for Pong

Convolutional policy trained with deep reinforcement learning in a custom Python simulator—team lead for a summer DISCOVERY LAB GLOBAL internship.

Aug 15, 2024 • 1 min read

No results found

Python

LLM training data & evaluation

Deep RL agent for Pong