I am a Staff Research Scientist at Boston Dynamics working on reinforcement learning for the Atlas robot. I completed my Ph.D. in Computer Science from University of Washington, advised by Byron Boots. My thesis research focused on developing practical machine learning algorithms and systems that enable robots to operate in dynamic real-world environments with minimal human supervision. Specific topics include integrating model-predictive control with reinforcement learning, learning-based methods for accelerated motion planning, and offline reinforcement learning.

Before transferring to University of Washington, I spent a year as a Ph.D. Robotics student at Georgia Institute of Technology. Prior to that I was a Robotics Engineer at Near Earth Autonomy, Inc. I received my Master’s in Robotic Systems Development from the Robotics Institute at Carnegie Mellon University, where I conducted research with The Air Lab under the guidance of Sanjiban Choudhury and Dr. Sebastian Scherer, and my B.Tech in Mechanical Engineering from the Indian Institute of Technology (BHU), Varanasi.

During my Ph.D study, I had the amazing opportunities to intern at various industry research labs. In Summer 2022, I was a Research Scientist Intern at Google DeepMind, London in the Controls Team lead by Martin Reidmiller. I spent Fall 2020 and Summer 2019 as an intern at NVIDIA Seattle Robotics Lab working with Dieter Fox, Fabio Ramos, Balakumar Sundaralingam and Ankur Handa.

When not busy with research, you'll catch me doing stand-up and improv comedy or practicing Capoeira.

Research

Pre-prints

  • Dynamic Non-Prehensile Object Transport via Model-Predictive Reinforcement Learning N. Jawale, B. Boots, B. Sundaralingam*, M. Bhardwaj [Arxiv]

Journal Publications

  • Lazy SP AuRo

    Leveraging Experience in Lazy Search

    M. Bhardwaj, S. Choudhury, B. Boots, S. Srinivasa

    Autonomous Robots, 2021

    [Paper] [BibTex]
  • Data-Driven Planning IJRR

    Data-driven Planning via Imitation Learning

    S. Choudhury, M. Bhardwaj, S. Arora, A. Kapoor, G. Ranade, S. Scherer and D. Dey

    International Journal on Robotics Research, 2017

    (Finalist for Paper of the Year)

    [Paper] [BibTex]

Conference Publications

    • Armor

      Adversarial Model for Offline Reinforcement Learning

      M. Bhardwaj*, T. Xie*, B. Boots, N. Jiang, C. Cheng

      Conference on Neural Information Processing Systems, 2023

      [Arxiv] [Proceedings]
    • STORM

      STORM: An Integrated Framework for Fast Joint-Space Model-Predictive Control for Reactive Manipulation

      M. Bhardwaj, B. Sundaralingam, A. Mousavian, N. Ratliff, D. Fox, F. Ramos, B. Boots

      Conference on Robot Learning, 2021 (Selected for Oral Talk - 6% Acceptance)

      [Paper] [BibTex] [Website] [Code]
    • MPQLambda

      Blending MPC & Value Function Approximation for Efficient Reinforcement Learning

      M. Bhardwaj, S. Choudhury, B.Boots

      International Conference on Learning Representations, 2021

      [Paper] [BibTex]
    • MPQ

      Information Theoretic Model Predictive Q-Learning

      M. Bhardwaj, A. Handa, D.Fox, B.Boots

      Learning for Dynamics and Control, 2020

      [Paper] [BibTex] [Website]
    • DGPMP2

      Differentiable Gaussian Process Motion Planning

      M. Bhardwaj, B. Boots and M. Mukadam

      International Conference on Robotics and Automation, 2020

      [Paper] [BibTex] [Website]
    • LazySP

      Leveraging Experience in Lazy Search

      M. Bhardwaj, S. Choudhury, B. Boots and S. Srinivasa

      Robotics:Science and Systems, 2019

      [Paper] [BibTex] [Talk]
    • SaIL

      Learning Heuristic Search via Imitation

      M. Bhardwaj, S. Choudhury and S. Scherer

      Conference on Robot Learning, 2017

      [Paper] [BibTex] [Website] [Talk]
    • VisServo

      Real-time dynamic singularity avoidance while visual servoing of a dual-arm space robot

      P. Mithun, V.V. Anurag, M. Bhardwaj, S.V. Shah

      Advances in Robotics, 2015

      [Paper]