Mediapipe development1. Introduction2. Use2.1. Hand detection2.2, Posture detection2.3. Overall detection2.4, Face Detection2.5, Face Recognition2.6, Face effects2.7, 3D object recognition2.8, Brush2.9, Finger control2.10, Gesture Recognition
MediaPipe is a data stream processing machine learning application development framework developed and open sourced by Google. It is a graph-based data processing pipeline for building applications that use a variety of data sources, such as video, audio, sensor data, and any time series data. MediaPipe is cross-platform and can run on embedded platforms (Raspberry Pi, etc.), mobile devices (iOS and Android), workstations, and servers, and supports mobile GPU acceleration. MediaPipe provides cross-platform, customizable ML solutions for real-time and streaming media.
The core framework of MediaPipe is implemented in C++ and provides support for languages such as Java and Objective C. The main concepts of MediaPipe include packets, streams, calculators, graphs, and subgraphs.
Features of MediaPipe:
Deep learning solution in MediaPipe
Face Detection | Face Mesh | Iris | Hands | Pose | Holistic |
---|---|---|---|---|---|
![]() | ![]() | ![]() | ![]() | ![]() | ![]() |
Hair Segmentation | Object Detection | Box Tracking | Instant Motion Tracking | Objectron | KNIFT |
![]() | ![]() | ![]() | ![]() | ![]() | ![]() |
Android | iOS | C++ | Python | JS | Coral | |
---|---|---|---|---|---|---|
Face Detection | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
Face Mesh | ✅ | ✅ | ✅ | ✅ | ✅ | |
Iris | ✅ | ✅ | ✅ | |||
Hands | ✅ | ✅ | ✅ | ✅ | ✅ | |
Pose | ✅ | ✅ | ✅ | ✅ | ✅ | |
Holistic | ✅ | ✅ | ✅ | ✅ | ✅ | |
Selfie Segmentation | ✅ | ✅ | ✅ | ✅ | ✅ | |
Hair Segmentation | ✅ | ✅ | ||||
Object Detection | ✅ | ✅ | ✅ | ✅ | ||
Box Tracking | ✅ | ✅ | ✅ | |||
Instant Motion Tracking | ✅ | |||||
Objectron | ✅ | ✅ | ✅ | ✅ | ||
KNIFT | ✅ | |||||
AutoFlip | ✅ | |||||
MediaSequence | ✅ | |||||
YouTube 8M | ✅ |
You need to enter Docker to run the case. The ROS1 environment is located in the Docker image.
sh ~/ros_melodic.sh
Source code location: ~/yahboomcar_ws/src/yahboomcar_mediapipe/scripts
xxxxxxxxxx
roscore
rosrun yahboomcar_mediapipe 01_HandDetector.py
Source code location: ~/yahboomcar_ws/src/yahboomcar_mediapipe/scripts
xxxxxxxxxx
roscore
rosrun yahboomcar_mediapipe 02_PoseDetector.py
Source code location: ~/yahboomcar_ws/src/yahboomcar_mediapipe/scripts
xxxxxxxxxx
roscore
rosrun yahboomcar_mediapipe 03_Holistic.py
Source code location: ~/yahboomcar_ws/src/yahboomcar_mediapipe/scripts
xxxxxxxxxx
roscore
rosrun yahboomcar_mediapipe 04_FaceMesh.py
Source code location: ~/yahboomcar_ws/src/yahboomcar_mediapipe/scripts
xxxxxxxxxx
roscore
rosrun yahboomcar_mediapipe 05_FaceEyeDetection.py
Source code location: ~/yahboomcar_ws/src/yahboomcar_mediapipe/scripts
xxxxxxxxxx
cd ~/yahboomcar_ws/src/yahboomcar_mediapipe/scripts
python3 06_FaceLandmarks.py
Source code location: ~/yahboomcar_ws/src/yahboomcar_mediapipe/scripts
xxxxxxxxxx
cd ~/yahboomcar_ws/src/yahboomcar_mediapipe/scripts
python3 08_Objectron.py
Click the camera preview screen and press the f
key to switch the recognized model.
Source code location: ~/yahboomcar_ws/src/yahboomcar_mediapipe/scripts
xxxxxxxxxx
cd ~/yahboomcar_ws/src/yahboomcar_mediapipe/scripts
python3 09_VirtualPaint.py
Source code location: ~/yahboomcar_ws/src/yahboomcar_mediapipe/scripts
xxxxxxxxxx
cd ~/yahboomcar_ws/src/yahboomcar_mediapipe/scripts
python3 10_HandCtrl.py
Source code location: ~/yahboomcar_ws/src/yahboomcar_mediapipe/scripts
xxxxxxxxxx
cd ~/yahboomcar_ws/src/yahboomcar_mediapipe/scripts
python3 11_GestureRecognition.py