
Jonathan Ouyang
Incoming Google DeepMind engineer | CS student at UCLA
jonathan-ouyang-4185
Los Angeles
Joined May 2026
Network
342 connectionsKMTRIB
CCKTJM
KJDWDT
TNKSMF
GLMMJA
VAMFLG
JHFHJM
Summary
Practical multimodal AI systems builder — focuses on building vision-language and desktop-operating agents that bridge perception and action (e.g., JAYU, CLOVIS), demonstrating strong engineering toward production-capable prototypes. jonathanouyang+2
Robotics and shared-autonomy researcher — contributes to robotics research through UCLA collaborations and authored/co-authored work on gaze-guided manipulation and other lab publications. github+1
Full-stack software engineer with production experience — internships and roles (Amazon Prime Video Studios, Daily Bruin) show experience building scalable systems, tooling, and integrating services across platforms. jonathanouyang+1
Competitive hackathon and developer contest winner — active in developer competitions (Google Gemini API Developer Competition winner) and hackathons, using competitions to prototype ambitious multimodal AI projects. ucla+1
Work
Education
Projects
Writing
Intent at a Glance: Gaze-Guided Robotic Manipulation via Foundation Models
January 1, 2026Paper presenting GAMMA, a system combining ego-centric gaze tracking with a vision-language model to infer user intent and autonomously execute tabletop manipulation tasks without task-specific training.
Optimization of Swim Pose Estimation and Recognition with Data Augmentation
January 1, 2024Conference paper describing a methodology to augment raw video data and adapt a YOLOv7-based model plus ensemble classifiers to improve swim pose estimation and stroke recognition, reporting improved pose confidence and ~80% recognition accuracy.