Waleed Bin Khalid — Robotics Engineer

About

I'm a Robotics Engineer who is excited about a world where robots and humans coexist and cooperate across a variety of tasks, boosting productivity, safety, and efficiency. My research interests center on robot autonomy, with a particular focus on embodied AI, robot learning, perception, and control theory. I also hold a keen interest in AI methods and applying machine learning to real-world problems across different domains.

I completed my M.S. in Robotics from the Georgia Institute of Technology, Atlanta, GA, USA, and my B.S. in Electrical Engineering with double minors in Physics and Mathematics from Habib University, Karachi, Pakistan.

Beyond technology, I love to travel solo ✈️, capture moments through photography 📸, and try out new sports 🏓🏊. I'm also a passionate cricket fan 🏏, enjoy playing cards ♠️, and have recently developed an interest in chess ♟️.

➡️ Scroll down to learn more about my past research work, academics, teaching, and professional work experience.

Research & Publications

Legged Robot Navigation in Uncertain Terrains using Deep Reinforcement Learning

Georgia Institute of Technology · Research Project

Developed a dynamic, vectorized RaiSim environment and integrated deep reinforcement learning (PyTorch) to train a navigation policy on top of a low-level locomotion controller. Designed custom rewards to optimize point-to-goal navigation. Created diverse terrains and explored motion planning (RRT, A*) and dataset generation for imitation learning. Gained expertise in robot kinematics and frame transformations for legged robots.

Video 1

Video 2

Project Report

An AI Model for Instance Segmentation and Tooth Numbering on Orthopantomograms

International Journal of Computerized Dentistry (2023) · Niha Adnan, Waleed Bin Khalid, Fahad Umer

Worked with a team of dentists at one of the leading hospitals in Pakistan to develop a deep-learning pipeline for teeth segmentation and labeling using U-NET and FRCNN. An Orthopantomograms (OPGs) dataset was self-annotated & processed for the project using PIL, numpy, JSON & OpenCV.

Ground-truth tooth segmentation mask — Ground-truth Mask

Visualization of instance segmentation of teeth — Instance Segmentation

Teeth detection and numeric labeling visualization — Detection & Labels

Publication

Contactless Vitals Measurement Robot

ICARA (2022) · Waleed Bin Khalid, Amna Anwar, Owais Talaat Waheed

Designed and developed a contactless vitals measurement robot equipped with an Ackermann drive for my undergraduate final year project. The robot utilised a camera, machine learning, and signal processing to measure vitals and decrease medical staff contact time with contagious patients. The system reported around 70% accuracy for Blood Pressure, and 90% for Heart Rate, Oxygen Saturation, and Temperature. I presented the work as the first author at ICARA 2022 and received the Best Capstone Project award for this work.

Video

Thesis Publication

Projects

TAMP for Mobile Manipulation using Perception, LLMs and Motion Planning.

LLMs, Task and Motion Planning (TAMP), Kinematics, Perception, PyBullet Simulation

Building a bottom-up simulation framework for mobile manipulation in PyBullet, to study and understand Task and Motion Planning (TAMP) for robotic systems like a mobile manipulator. A Clearpath Husky base with a Franka Panda arm executes multi-step language instructions by combining LLM-based task planning with end-effector camera perception and IK motion execution. The system successfully achieves reliable single-cube picks and two-cube stacking, while multi-cube stacking reveals challenges such as motion smoothness, base drift, and grasp retries. This ongoing project not only demonstrates the integration of symbolic task planning with continuous control but also serves as a platform to study richer perception, collision-aware motion planning, feedback-driven correction, and vision–language grounding to push toward general-purpose mobile manipulation.

This project is currently ongoing and further updates will be made as I progress.

Demo videos of picking single cube and stacking 2 cubes

mobile 2 — Husky + Franka Emika Arm platform

Initial Report

LLM-Based Path Planning in Grid Worlds: Local and Global Policies vs A* Baseline

LLMs, A* Search, Hybrid Planning, Grid Navigation

Working on this project to understand and study how LLMs can be used for path planning. Current findings are that vanilla local policies achieve around 60% success on A*-solvable cases, while vanilla global one-shot planning reaches only about 25% and degrades more quickly as obstacle density increases. These initial results highlight the relative resilience of local prompting, but the work remains ongoing. I am systematically exploring improvements through richer local contexts (e.g., 5×5 windows, clearance features), prompt refinements (few-shots, ranking), and hybridization with classical priors (subgoal induction, A* cost-to-go). The broader goal is to examine how far prompt engineering and minimal feature additions can push LLM-based planning toward reliable performance, and to assess the feasibility of such methods for robotic path planning in more realistic settings.

This project is currently ongoing and further updates will be made as I progress.

Comparison of Global, Local and A* Policy with 21 random obstacles

Initial Report

Motion Planning for 3-Link Robot Arm

RRT, Forward and Inverse Kinematics, PID, PyGame

Implemented RRT from scratch with custom collision detection conditions to move a 3 link arm in the presence of obstacles and walls. Simulated the environment and robot using PyGame and tuned RRT and PID parameters for optimal performance in varying configerations. Implemented the forward and inverse kinematics from scratch to compute path variables. Created multiple demos, to show different aspects such as wrist limitations, reachable space considerations, and RRT performance.

Video 1

Video 2

Project Report

Robot Maze Navigation for Variable Maze Structures

CNNs, ROS 2, Path Planning

Tackled the challenge of navigating an unknown maze by implementing a combination of the Bug algorithm, collision avoidance, and CNN-based image classification to create a solution invariant to the maze structure, starting point, and end point. Optimized a CNN to detect signs with 99% accuracy by augmenting for invariance and extensively preprocessing the data. The motion plan integrated CNN-generated maneuvering commands, lidar-based wall detection for collision avoidance, the Bug algorithm as a fallback, and PID control to ensure accurate straight-line movement and sharp turns, achieving 100% navigation success.

Video

Learning Robot Tasks From Videos

RL and IL, Open AI Gym, Transformers, ManiSkill2 dataset

Studied manipulator control using reinforcement and imitation learning via MLPs, CNNs, and transformers on the ManiSkill2 dataset. Mitigated covariate shift in the Franka Emika Panda Arm’s pick-and-place task through an encoder transformer for behavior cloning, achieving a 20% success rate with imitation learning, with 80% showing either correct picking or placing. Implemented PPO-based reinforcement learning control, achieving 100% task success for a selected task.

Video

Report

Road Image Segmentation with Supervised & Unsupervised Methods

Supervised & Unsupervised ML, CARLA dataset

Deployed supervised and unsupervised methods like K-Means, DB Scan, Gaussian Mixture Models, and U-NET for segmenting road images from the CARLA dataset. Achieved a 95% Dice Score using U-NET for instance segmentation and a Silhouette Coefficient of 0.5 with K-Means and Gaussian Mixture Models.

Example of a road image from the CARLA dataset — Actual Image

Ground-truth mask for the road image — Ground-truth Mask

Predicted segmentation mask — Predicted Mask

Report

SLAM using TurtleBot3 and ROS 2

Python, ROS 2, SLAM

Developed a 2D map by teleoperating the robot in both physical and Gazebo environments, utilizing the Cartographer algorithm. Tuned ROS2 SLAM parameters to ensure the robot could smoothly traverse the map and autonomously navigate to three randomly provided locations within a 7x6 feet arena with barriers.

Hardware

Simulation

CartPole Control using Deep Reinforcement Learning

DQN, DDQN, Dueling DDQN, CNN-DQN, PPO

Conducted a comparative study of Deep Q-learning, its variations (DQN, DDQN, Dueling DDQN, CNN-DQN), and Proximal Policy Optimization (PPO) on the classical CartPole control problem using OpenAI Gym. Results showed that PPO and DDQN achieved perfect scores of 500 across 100 episodes, outperforming DQN variations. CNN-DQN struggled with convergence, while PPO demonstrated superior stability by preventing drastic policy changes. The study provided valuable insights into the performance differences of policy-based and Q-learning methods for reinforcement learning tasks.

Video

Report

Legged and Humanoid Robot Control in MuJoCo

Reinforcement & Imitation Learning

Implemented legged robot control with RL, curriculum learning, and custom reward functions to make a quadruped robot move over obstacles with increasing height. Implemented two walking modes (normal and stealthy) for a humanoid robot using imitation learning and reference state initialization.

Video

Diffusion Model as Robot Action Policy

IL, Robot Task Planning, Diffusion

Implemented a diffusion model to imitate demonstrations and model action distribution given states, obtaining 100% accuracy in guiding a point agent to push a T-shaped block onto a T-shaped outline.

GIF 1

GIF 2

GIF 3

Pedestrian Behaviour Classification

Detection + Transfer Learning (VGG16)

Prepared a pedestrian image dataset by combining data from various sources and annotating pedestrian state as distracted or non-distracted. Detected pedestrians with 100% accuracy via FRCNN and identified distraction with 90% accuracy via transfer learning with VGG16.

Pedestrian detection bounding boxes — Detection

Pedestrian using mobile phone — Use of Mobile

Detecting distraction state of pedestrians — Distraction Classification

Report

Autonomous Line Following Ball Throwing Robot

Arduino, IR sensors, encoders, differential drive, PID

Developed a robot using Arduino, IR sensors, self-fabricated motor drivers, and regulators with discrete components. Implemented differential drive for mobility and a flywheel mechanism for shooting. Calibrated motor encoders to ensure accurate odometry readings, enabling the robot to autonomously traverse a 4x8 feet arena and drop balls into four hoops within three minutes, guided by line-following and PID control.

Video

HAWAI FIRING: Shooting Game (C++)

SDL 2.0 · OOP design · UML

Developed a bubble shooting game using C++ and the SDL 2.0 library to practice OOP design principles, clean code practices, and maintainability through UML diagrams. Created the game from scratch, including multiple levels, menu screens, saving mechanisms, and object graphics. Implemented game physics using kinematic principles and collision detection strategies.

Video

Research Paper Reflections

I have had opportunities to read and reflect on a number of state-of-the-art papers recently. I’m documenting my reviews and reflections as concise write-ups and presentation slides. I also plan to convert this section into a blog and implement selected findings in code where feasible.

Work Experience

Software Engineer (April 2025 - Present)

Borderless, Remote (Freelance)

Led ideation to translate client needs into a scalable web solution, developed with Python (Flask), HTML, CSS, JavaScript, and deployed on AWS Lightsail. Engineered a Mapbox-based interactive globe enabling filterable data visualization to drive thematic storytelling. Working on the backend to support content automation and transition from manual Excel-based workflows to a database-driven system.

Summer Intern at EPIC Lab (May 2023 - August 2023)

Georgia Institute of Technology, Atlanta, GA, USA

Worked with a Ph.D. student to enhance the circuit and mechanical design quality of a hip exoskeleton, resolving electronic and mechanical design issues by cleaning up circuits, fabricating new PCBs, 3D-printing worn-out clippers, and reinforcing components to reduce vibrations and eliminate circuit failures. Concurrently conducted data collection, amassing 60 hours of respiratory and EMG data from human subject trials, required by lab researchers to assess metabolic costs and muscle activity.

Hip exoskeleton device — Hip Exoskeleton

Ankle exoskeleton device — Ankle Exoskeleton

Video

Internship Report

Robotics Lab Engineer (July 2021 - December 2021)

Habib University, Karachi, Pakistan

Spearheaded the establishment of a new robotics lab by creating an experiment bench with PhantomX manipulators and Intel RealSense D435i. Enabled students to test their robots for a robo soccer project by engineering a 6x8 feet robot tracking arena using Python, C, and camera vision.

Teaching Experience

Graduate Teaching Assistant (January 2023 - May 2024)

Georgia Institute of Technology Atlanta, GA, USA

Course Assisted: Physics 2211 - Matter and Interactions (Spring 2023, Fall 2023 & Spring 2024)

Conducted bi-weekly lab experiment and recitation sessions for an average class size of 60 students. Consistently rated 4–5/5 in student evaluations across all metrics, reflecting strong teaching impact.

Lab Instructor (July 2021 - December 2021)

Habib University, Karachi, Pakistan

Designed and instructed labs in Physics 101 and Basic Electronics, strengthening students’ grasp of fundamental physical principles and electronic circuits.

Undergraduate Teaching Assistant/Peer Tutor (January 2019 - May 2021)

Habib University, Karachi, Pakistan

Courses Assisted: Calculus II (Spring 2019), Engineering Mathematics (Fall 2019), Electrical Machines (Spring & Fall 2020), Mechanics & Thermodynamics (Spring 2020), Probability & Statistics (Spring & Fall 2020), Electric Network Analysis (Spring 2021)

Managed teaching responsibilities for classes of 30–60 students, encompassing grading, assignment development, recitation and revision sessions, and individual tutoring support.

Education

Georgia Institute of Technology

M.S. in Robotics | Janunary 2023 – May 2024 | CGPA: 4.00/4.00

Coursework: Introduction to Robotics Research, Robotics, Machine Learning, Deep Learning, Deep Reinforcement Learning for Intelligent Control, Machine Vision, Linear Control Systems, Artificial Intelligence, Robotics Professional Prep Seminar 1–3, Robotics Capstone Project 1 & 2

MS Degree MS Transcript

Habib University, Karachi, Pakistan

B.S. in Electrical Engineering | Minors: Physics and Mathematics | August 2017 – June 2021 | CGPA: 3.82/4.00 | Major Rank: 03/37

Selected Coursework: Feedback and Control Theory, Computer Vision, Mobile Robotics, Instrumentation, Linear Algebra, Probability and Statistics, Calculus, Engineering Mathematics, Engineering Design

Best capstone project certificate — Best Capstone Project

Outstanding Teaching Assistant certificate — Outstanding TA

Double minor completion certificate — Double Minor

BS Degree BS Transcript