Reinforcement Learning Engineer, Locomotion

VinDynamics

Hanoi, Vietnam

Fresher

Save

Posted 7 hours ago
Be among the first 10 applicants

Early Applicant

Job Description

Description:

At VinDynamics, we build home assistive robots that will help billions of people and families across the world. We are seeking talented and self-driven Reinforcement Learning engineers to join our stellar team. You will work on cutting-edge policy learning algorithms that allow the robot to walk, run, balance, and recover from disturbances in real-world environments.

Key Responsibilities

Develop and implement reinforcement learning algorithms specialized for locomotion tasks (e.g., walking, running, climbing, balancing).
Design, integrate, and optimize high-fidelity simulation environments for safe and efficient policy training.
Conduct sim-to-real transfer by addressing robustness, domain randomization, and system identification challenges.
Incorporate perception, sensor feedback, and proprioception into RL agents to enable adaptive and reactive motion.
Evaluate and benchmark locomotion policies under diverse real-world conditions (e.g., terrain variation, disturbances, slopes, payloads, and friction).
Work on reward design, stability, sample efficiency, and safety-constrained learning.
Write clean, maintainable, and well-documented code, ensuring reproducibility and version control for experiments and policies.

Requirements

Solid background in Reinforcement Learning (Deep RL, Policy Gradient, Model-based RL, Imitation Learning, etc.).
Hands-on experience with simulation platforms such as MuJoCo, PyBullet, Isaac Gym, or Gazebo.

Preferred Qualifications

Experience with locomotion, motion control, or physical control systems (e.g., legged robots, drones, exoskeletons, robotic arms).
Experience in sim-to-real transfer, domain randomization, or system identification in robotics.
Proficiency in Python and/or C++, and familiarity with ML frameworks such as PyTorch, TensorFlow, or JAX.
Strong analytical and debugging skills for physical systems; ability to identify stability and performance bottlenecks.
Familiarity with sensor fusion, feedback control, and proprioceptive sensing.

Location:

Hanoi Vietnam
Reno, Nevada, US (coming soon)

Compensation:

VinDynamics offers a competitive base, full benefits and other incentives. Flexible time-off policy. Focus on output and ability to work with an interdisciplinary team.