🤖 Robotics & Autonomous Systems

Robot Localization with Kalman and Particle Filters

View on GitHub

Implemented and compared Extended Kalman Filter (EKF) and Particle Filter algorithms for robot localization on the PR2 robot. Tested under five different challenging scenarios with complex paths to study advantages and limitations of each approach. Analyzed performance in terms of accuracy, computational efficiency, and robustness to sensor noise.

Python ROS Kalman Filters Particle Filters State Estimation

Autonomous Mobile Robot Navigation System

View on GitHub

Developed a complete autonomous navigation system for a differential-drive mobile robot using ROS2 and the Nav2 navigation stack. Implemented real-time obstacle detection and collision avoidance in C++ by processing lidar data, enabling safe navigation in dynamic environments. Integrated SLAM for mapping and localization.

ROS2 Nav2 C++ Lidar SLAM Path Planning

👁️ Computer Vision & 3D Reconstruction

Challenges in Multi-view 3D Scene Reconstruction

Read Paper

Conducted a comprehensive analysis of failure cases in state-of-the-art multi-view 3D reconstruction method MASt3R under challenging capture conditions. Collected and annotated a custom scene dataset with varying lighting, occlusions, and viewpoint distributions. Documented systematic limitations and proposed potential improvements for future work.

3D Reconstruction Multi-view Geometry PyTorch Dataset Creation

Bar-Code Detection using OpenCV

View on GitHub

Built a classical computer vision pipeline to detect barcodes on grocery items and classify bounding boxes by barcode readability status. Used traditional CV techniques including edge detection, morphological operations, and contour analysis. Achieved robust detection across various lighting conditions and barcode orientations.

OpenCV Python Image Processing Object Detection

🧠 Machine Learning & AI

Content-Agnostic Deepfake Audio Detection

Research Project

Created a benchmark audio dataset with 1M+ samples using multiple TTS models with paired real/fake audios. Engineered end-to-end pipelines for generation, labeling, and evaluation. Trained an audio-language detection model that outperforms human listeners and achieves state-of-the-art performance on deepfake detection.

PyTorch Audio ML TTS Systems CLAP Dataset Engineering

Video-LLM Activity Hallucination Mitigation

Research Project

Addressed activity hallucinations in Video-LLMs by creating an annotated video-activity dataset with fine-grained evaluation metrics for actor, action, object, and environment understanding. Designed targeted question types and evaluation methods for robust activity recognition.

Video-LLMs Multimodal Learning Dataset Annotation Evaluation Metrics

📚 Coursework Projects

Algorithmic Motion Planning

Implemented RRT, PRM, and A* algorithms for robot motion planning in complex environments

C++ Path Planning

Probabilistic State Estimation

Developed Bayesian filtering techniques for robot state estimation with noisy sensors

MATLAB Probability

Sensor Fusion for Localization

Combined IMU, GPS, and wheel odometry data for accurate robot localization

Python Sensor Fusion

Object Detection Pipeline

Built end-to-end object detection system using YOLO for warehouse applications

PyTorch YOLO

🏆 Achievements & Competitions

🥈

Top-25 Team

Smart India Hackathon 2024

Disaster notification webpage with NLP
🥉

54th All-India Rank

Amazon ML Challenge 2023

FAISS-based product dimension prediction View Solution →