TU Darmstadt / ULB / TUprints

Accuracy Evaluation of 3D Pose Reconstruction Algorithms Through Stereo Camera Information Fusion for Physical Exercises with MediaPipe Pose

Dill, Sebastian ; Ahmadi, Arjang ; Grimmer, Martin ; Haufe, Dennis ; Rohr, Maurice ; Zhao, Yanhua ; Sharbafi, Maziar ; Hoog Antink, Christoph (2025)
Accuracy Evaluation of 3D Pose Reconstruction Algorithms Through Stereo Camera Information Fusion for Physical Exercises with MediaPipe Pose.
In: Sensors, 2024, 24 (23)
doi: 10.26083/tuprints-00028980
Article, Secondary publication, Publisher's Version

[img] Text
sensors-24-07772-v2.pdf
Copyright Information: CC BY 4.0 International - Creative Commons, Attribution.

Download (2MB)
Item Type: Article
Type of entry: Secondary publication
Title: Accuracy Evaluation of 3D Pose Reconstruction Algorithms Through Stereo Camera Information Fusion for Physical Exercises with MediaPipe Pose
Language: English
Date: 15 January 2025
Place of Publication: Darmstadt
Year of primary publication: December 2024
Place of primary publication: Basel
Publisher: MDPI
Journal or Publication Title: Sensors
Volume of the journal: 24
Issue Number: 23
Collation: 18 Seiten
DOI: 10.26083/tuprints-00028980
Corresponding Links:
Origin: Secondary publication DeepGreen
Abstract:

In recent years, significant research has been conducted on video-based human pose estimation (HPE). While monocular two-dimensional (2D) HPE has been shown to achieve high performance, monocular three-dimensional (3D) HPE poses a more challenging problem. However, since human motion happens in a 3D space, 3D HPE offers a more accurate representation of the human, granting increased usability for complex tasks like analysis of physical exercise. We propose a method based on MediaPipe Pose, 2D HPE on stereo cameras and a fusion algorithm without prior stereo calibration to reconstruct 3D poses, combining the advantages of high accuracy in 2D HPE with the increased usability of 3D coordinates. We evaluate this method on a self-recorded database focused on physical exercise to research what accuracy can be achieved and whether this accuracy is sufficient to recognize errors in exercise performance. We find that our method achieves significantly improved performance compared to monocular 3D HPE (median RMSE of 30.1 compared to 56.3, p-value below 10−6) and can show that the performance is sufficient for error recognition.

Uncontrolled Keywords: computer vision, human pose estimation, information fusion, MediaPipe Pose
Identification Number: Artikel-ID: 7772
Status: Publisher's Version
URN: urn:nbn:de:tuda-tuprints-289807
Additional Information:

This article belongs to the Special Issue: Deep Learning Applications for Pose Estimation and Human Action Recognition

Classification DDC: 600 Technology, medicine, applied sciences > 610 Medicine and health
600 Technology, medicine, applied sciences > 621.3 Electrical engineering, electronics
Divisions: 18 Department of Electrical Engineering and Information Technology > Artificial Intelligent Systems in Medicine (KISMED)
03 Department of Human Sciences > Institut für Sportwissenschaft > Sportbiomechanik
Zentrale Einrichtungen > Centre for Cognitive Science (CCS)
Date Deposited: 15 Jan 2025 12:13
Last Modified: 15 Jan 2025 12:14
SWORD Depositor: Deep Green
URI: https://tuprints.ulb.tu-darmstadt.de/id/eprint/28980
PPN:
Export:
Actions (login required)
View Item View Item