People
Faculty
Mingfei Sun
Lab Director, Assistant Professor - Google Scholar
Research: large-scale reinforcement learning, optimization, and dependable agent systems.
Personal website: mingfeisun.github.io
Doctoral Researchers
Adrian Mircea Nenu
PhD Researcher - Google Scholar
Research: policy gradients, reinforcement learning theory, and algorithmic improvements.
Personal Website: nenuadrian.com
Brendan Bennett
PhD Researcher - Google Scholar
Research: Reinforcement learning for LLMs.
Personal website: brendanbennett.github.io
Hossein Abdi
PhD Researcher - Google Scholar
Research: Second-Order Optimization, Policy Gradient Reinforcement Learning, Control Theory, Robotics
Personal website: hossein-abdi.github.io
Satya Prakash Dash
PhD Researcher - Google Scholar
Research: Convex and Non-Convex Analysis, Natural Gradient Descent, Policy Optimization, Transformers learning theory
Personal Website: sprakashdash.github.io
Maytus Piriyajitakonkij
PhD Researcher - Google Scholar
Research: Cognitively-inspired AI, Machine Theory of Mind, Emergent Communication.
Personal Website: maytusp.com
Yihe Zhou
PhD Researcher - Google Scholar
Research: Policy Gradients, Multi-agent Reinforcement Learning, LLM post-training.
Yingxiao Huo
PhD Researcher - Google Scholar
Research: Policy Gradients, Optimization, Second-Order Optimization.
Personal Website: huoyingxiao.github.io
Beining Zhang
PhD Researcher - Google Scholar
Research: Policy Gradients, Decentralized Optimization.
Personal website: zbn111.github.io
Victor Yeom Song
PhD Researcher - Google Scholar
Research: Diffusion Generative Models, Policy Gradients.
Personal website: vyeoms.github.io/
Jacob Mitchell Cummins
PhD Researcher - Google Scholar
Research: Masked Auto Encoders, 3D Model Feature Extraction
Personal website: jacobcummins.github.io/
MSc Researchers
Daniel Shi
MSc Thesis Student
Project: Roguelike games using deep reinforcement learning.
Collaborators
We actively collaborate with academic and industry partners across reinforcement learning, robotics, and multi-agent systems.