Blog | Prashant Rawat

Open Source Jul 15, 2025

AlchemyCV: A Desktop App for Multi-Stage Image Processing

Why I built a 5-stage image processing pipeline as an open-source desktop tool — blur, enhance, filter, mask, and detect edges with real-time parameter tuning.

6 min read Read more

Open Source Oct 10, 2025

AlchemyAnnotate: A Lightweight Image Annotation Tool

I built a desktop annotation tool that exports to YOLO, Pascal VOC, and COCO — because existing tools were either too heavy or too limited.

7 min read Read more

Open Source Jan 20, 2026

AlchemyDetect: Train Detectron2 Models Without Writing Code

A desktop GUI for training Faster R-CNN, RetinaNet, and Mask R-CNN models — with real-time loss plots and one-click inference on images or folders.

8 min read Read more

Project Deep-Dive Feb 28, 2026

Building a Surveillance Quadruped with Face Recognition

Turning a Unitree Go2 into an autonomous patrol robot — random-walk navigation in a mapped area, person detection, face-DB verification, and intrusion alerts.

7 min read Read more

Project Deep-Dive Jan 25, 2026

Indoor Navigation for a Humanoid Robot

A walkthrough of integrating VLM, LiDAR, and RealSense D435i on the Unitree G1 to give a humanoid robot the ability to navigate, see, and speak.

8 min read Read more

Project Deep-Dive Mar 15, 2026

Building a Browser-Based Control Panel for the Unitree G1 (Part 1: Foundation)

How I built a React + Three.js operator console for a Unitree G1 humanoid — live costmap, point cloud overlay, camera feed, and click-to-go navigation, all from scratch.

9 min read Coming soon

Project Deep-Dive Apr 7, 2026

From Click-to-Go to Programmable Missions: Routes, Actions & Detection (Part 2)

Programmable waypoint routes, an action library on the robot, recovery flows for failed goals, and a Detectron2 object detection pipeline wired straight into the operator UI.

9 min read Coming soon

Project Deep-Dive Apr 7, 2026

Inside the Robot: Localization, Navigation, and Action Control on the Unitree G1 (Part 3)

The robot-side deep dive — FAST-LIO + Open3D ICP localization, move_base tuning, the cmd_vel bridge, and the asyncio executor that runs everything on the Jetson.

10 min read Coming soon

Technology Mar 20, 2026

VLM vs VLA: Understanding Vision-Language Models in Robotics

Breaking down the difference between Vision-Language Models and Vision-Language-Action models, when to use each, and real-world trade-offs I've encountered.

6 min read Coming soon

Technology Feb 15, 2026

Anomaly Detection in Manufacturing: PaDiM, Anomalib, and Beyond

A practical comparison of anomaly detection approaches for manufacturing QA — what worked, what didn't, and how we got to 90%+ accuracy across 8 modules.

9 min read Coming soon

Course Brief Mar 10, 2026

CNN to Transformer: A Visual Guide to Modern Architectures

A concise walkthrough of the evolution from convolutional networks to transformers — what changed, why it matters, and how it connects to current work in vision and NLP.

10 min read Coming soon

Course Brief Feb 1, 2026

ROS2 for AI Engineers: What You Actually Need to Know

A condensed guide to ROS2 for ML/AI engineers who need to ship robot software — nodes, topics, actions, and the parts that matter for real integration.

12 min read Coming soon