13. Mediapipe recognizes and tracks palm-controlled robotic arms

13.1. Introduction

MediaPipe is a data stream processing machine learning application development framework developed and open sourced by Google. It is a graph-based data processing pipeline for building data sources that use many forms, such as video, audio, sensor data, and any time series data.

MediaPipe is cross-platform and can run on embedded platforms (Raspberry Pi, etc.), mobile devices (iOS and Android), workstations and servers, and supports mobile GPU acceleration.

MediaPipe provides cross-platform, customizable ML solutions for real-time and streaming.

13.2. Using

After the program starts, after the camera captures the image, the robotic arm will follow the movement of the palm in the screen. The movement speed of the palm movement here should not be too fast, otherwise the image processing will not be able to keep up, which will cause stuttering.

After startup, press R2 of the handle to turn on the function.

After the function is turned on, the robotic arm will move with the palm, and at the same time, the end gripper will also change with the opening and closing of the palm.

13.3. Core code parsing ArmCtrl.py