Preprint / Version 1

Mediapipe Fingering Classification for Violin

##article.authors##

  • Spencer Wang Student

DOI:

https://doi.org/10.58445/rars.1670

Keywords:

Mediapipe, Violin, AI, Classification, SVM

Abstract

The extent of the potential of Mediapipe in analyzing and classifying complex hand patterns, like those found in violin hand movements, remains elusive. This research explores the feasibility of using Mediapipe, an open-source framework that allows the extraction of finger position key points from images/videos, to detect violin fingering positions for playing a virtual instrument. Further, we explore the limitations and potential of Mediapipe for recognizing complex hand movements involved in playing the violin by using a combination of Google Colab and support vector classification models. The repeated training on misclassified images was able to take the F1 score from an initial value of 0.82 to a final value of 0.95. This initial research shows the potential for a video based virtual violin instrument. Further areas of exploration could include expanding the total image training data and testing viability of other classifiers.

References

Westermann, Dirk, and Johann M. Marz. “Understanding Wind Turbine Condition Monitoring Systems.” Condition Monitoring, 2014, pp. 13–44. Springer, doi:10.1007/978-3-319-12093-5_2.

Stern, Frederick. “Evaluation of System Identification for Operational Modal Analysis of Naval Structures.” MIT DSpace, Massachusetts Institute of Technology, 2023, dspace.mit.edu/bitstream/handle/1721.1/150654/3544549.3585838.pdf?sequence=1&isAllowed=y.

Gil, Manuel, et al. “Applications of UAVs in Forest Fire Management: A Systematic Review.” Journal of Imaging, vol. 6, no. 8, 2020, p. 73, doi:10.3390/jimaging6080073.

Goodfellow, Ian, et al. Deep Learning. MIT Press, 2016, www.deeplearningbook.org.

Verbeke, Mathias, et al. “The Electronic Tabla Controller.” ResearchGate, 2011, www.researchgate.net/publication/221164928_The_Electronic_Tabla_Controller.

Lugmayr, Artur, et al. “MediaPipe: A Framework for Building Perception Pipelines.” arXiv, 19 June 2019, arxiv.org/abs/1906.08172.

Lugaresi, C et al. “Mediapipe: A framework for building perception pipelines.” arXiv preprint, 2019, https://arxiv.org/abs/1906.08172

Cortes, C., & Vapnik, V. Support-vector networks. Machine Learning, 1995, 20(3), 273-297. doi:10.1007/BF00994018.

The AIGuysCode. “Colab Webcam.” GitHub, 2020, github.com/theAIGuysCode/colab-webcam/blob/main/colab_webcam.ipynb.

Evidently AI. “Confusion Matrix: Classification Metrics.” Evidently AI, 2020, www.evidentlyai.com/classification-metrics/confusion-matrix.

Downloads

Posted

2024-09-24