About me

I am a Master’s student in Mechatronics and Information Technology at Karlsruhe Institute of Technology (KIT), working at the intersection of embodied AI, robotics, and perception.

Research

My research interests lie in embodied AI and intelligent robotic systems.

I am interested in:

  • Vision-Language Navigation (VLN) and Vision-Language-Action (VLA)
  • Multimodal large language models (MLLMs) and agent-based systems
  • Robot perception and 3D scene understanding
  • Real-world robotic system development and deployment

Selected Projects

ROS2-Based Autonomous Exploration and Object Retrieval

Developed a ROS2-based robotic system for autonomous exploration and object retrieval on Boston Dynamics Spot. My primary contribution focused on the perception pipeline, including object detection and 3D localization.

  • Trained and deployed a YOLOv5 model on a custom dataset (mAP@0.5 = 0.962)
  • Built RGB-D perception modules for real-time object detection and 3D localization
  • Implemented 2D-to-3D projection and TF-based coordinate transformation
  • Designed multi-view fusion and temporal filtering
  • Supported both global detection and close-range grasping

GitHub Repository

LLM-Based Heuristic Generation for Robotic Palletization

Developed an LLM-driven system for heuristic generation in robotic palletization (3D bin packing), integrating human-in-the-loop feedback and physical constraints.

  • Designed an LLM interface for translating natural language into executable heuristics
  • Built structured prompt templates incorporating constraints and runtime context
  • Developed an API-based interaction pipeline
  • Applied LoRA fine-tuning for efficient local model deployment

GitHub Repository

Technical Skills

Programming: Python, MATLAB, C++
Robotics: ROS2, TF, RViz, sensor integration
Perception: OpenCV, YOLO, RGB-D perception, multi-view fusion
Machine Learning: PyTorch, deep learning pipelines
Generative AI: LLM integration, prompt engineering, LoRA
Tools: Git, Linux, Docker