Controlling Computer Device Cursors Using Hand Gestures by Utilizing OpenCV, MediaPipe, and PyAutoGui

Authors

  • Muhammad Juzairi Safitli Universitas Nurdin Hamzah
  • Fattachul huda Aminuddin Universitas Nurdin Hamzah
  • Gustina Universitas Nurdin Hamzah

Keywords:

OpenCV, Python, MediaPipe, Pyautogui, controlling the cursor

Abstract

This research presents an innovative interactive solution for controlling computer device cursors by utilizing hand movements using OpenCV, MediaPipe, and PyAutoGui. By combining these three libraries, this program is able to detect and analyze hand movements in real-time, allowing users to control the cursor and click with hand movements. Through image processing methods and hand landmark analysis, this program succeeded in achieving its designed goals. The main goal of the research is to create interactive solutions that are responsive and efficient in controlling computer device cursors. Test results on computer devices with certain specifications show success in detecting hand movements and controlling the cursor well. However, this research still has the potential for further development, such as adding complex hand gesture recognition features and integration with certain applications to improve program functionality and performance.

References

Jubilee Enterprise. (2019). Python for Beginner Programmers. Jakarta: PT Elex Media Komputindo.

Lubis, M. S. (2021). Implementation of Artificial Intelligence in Integrated Manufacturing Systems.Posing of the UISU National Engineering Seminar, 1-7. Retrieved January 26, 2024, from https://jurnal.uisu.ac.id/index.php/semnastek/article/download/4134/2966

PyAutoGui. (2019).Installation. Retrieved January 25, 2024, from PyAutoGUI: https://pyautogui.readthedocs.io/en/latest/install.html

Wali, M., Nengsih, T. A., Hts, D. I. G., Choirina, P., Awaludin, A. A. R., Yusuf, M., ... & Baradja, A. (2023). INTRODUCTION TO THE 15 BEST PROGRAMMING LANGUAGES OF THE FUTURE (Reference & Coding For Beginners). PT. Sonpedia Publishing Indonesia.

Dea, G. (2020). January 2).Get to know the Code Editor, and the Three Code Editors. https://vantura.id/blog/mengenal-code-editor-dan-tiga-code-editor-terpopular

Dr. Aneu Yulianeu, S., & Oktamala, R. (2022). Web-Based Public Transport Route Information System in Tasikmalaya City.Journal of Information Engineering (pp. 125–134). http://jurnal.stmik-dci.ac.id/index.php/jutekin/

Enterprise, J. (2019). Python for Beginner Programmers. PT Elex Media Komputindo.

fattachulhudacom. (2023). Image Processing Using OpenCV Integrated with Python. https://www.fattachulhuda.com/2023/11/pengolahan-citra-using-opencv.html

Han, J.-S., Lee, C.-I., Youn, Y.-H., & Kim, S.-J. (2022). A Study on Real-time Hand Gesture Recognition Technology by Machine Learning-based MediaPipe. Journal of System and Management Sciences, 12(2), 468–482. https://doi.org/10.33168/JSMS.2022.0225

Harahap, H. R. (2023). Concepts and Functions of Programming Algorithms.Journal of Mathematics and Natural Science (pp. 254–257). https://doi.org/10.59581/konsanta-widyakarya.v1i4.1875

Jawas, N. (2017). Hand Movement Tracking for Gesture Recognition.Informatic Technique Journal (pp. 13–23). https://www.e-journal.potensi-utama.ac.id/ojs/index.php/ITJournal/article/download/364/337

Kim, J.-W., Choi, J.-Y., Ha, E.-J., & Choi, J.-H. (2023). Human Pose Estimation Using MediaPipe Pose and Optimization Method Based on a Humanoid Model. Applied Sciences (Switzerland), 13(4). https://doi.org/10.3390/app13042700

Lemonaki, D. (2022). Loops in Python - While True Loop Statement Example.

Ma’Arif, A. (2020). Programming Language Advanced Programming Textbook. https://eprints.uad.ac.id/32743/1/buku%20python.pdf

MediaPipe. (2023). September 13).Project Mediapipe. https://pypi.org/project/mediapipe/

Miftah, S. (2021). Mei 17).Python Library Get to know the differences between modules, packages and libraries in Python (A. W. Davita, Ed.). https://dqlab.id/library-python-kenali-perbedaan-module-package-dan-library-pada-python

Parashar, D., Mishra, O., Sharma, K., & Kukker, A. (2023). Improved Yoga Pose Detection Using MediaPipe and MoveNet in a Deep Learning Model. Revue d’Intelligence Artificielle, 37(5), 1197–1202. https://doi.org/10.18280/ria.370511

Patria, R. (2023). Studio Code Complete Definition, Features. Advantages! https://www.domainesia.com/berita/visual-studio-code/#Visual_Studio_Code_dengan_Code_Editor

Pradono, K. A., Safitri, Y. D., Adhiatma, B. S., Hestrio, Y. F., Soleh, M., Gunawan, H., & Sunarmodo, W. (2020). Development of Automatic Landsat Data Download Engine.IOP Conf. Materials Science and Engineering. https://doi.org/10.1088/1757-899X/1007/1/012109

Prihatiningsih, S., M, N. S., Andriani, F., & Nugraha, N. (2019). Performance Analysis of Handwritten Number Recognition Based on the Number of Iterations Using the Convolutional Neural Network. Method.Scientific Journal of Technology and Engineering, 58–66. https://ejournal.gunadarma.ac.id/index.php/tekno/article/download/1934/1644

Rahman, M. F., & Bambang. (2021). Trash Detection in Real-time Video using the Faster R-CNN. Method.Applied Technology and Computing Science Journal, 117–125. https://doi.org/10.33086/atcsj.v3i2.1846

Rasyid, M. F., Mustafa, M. S., & Suradi, A. A. (2022). Eye Detection in Smartphone Videos Using Mediapipe Python.JOINTECS (pp. 49–56). Journal of Information Technology and Computer Sciense. https://www.researchgate.net/publication/371968761_Eye

Romzi, M., & Kurniawan, B. (2020a). Implementation of Python programming using Visual Studio Code.Journal of Informatics and Computers, 1-9. https://jurnal.unmaha.ac.id/index.php/jik/article/view/11

Romzi, M., & Kurniawan, B. (2020b). Learning Python Programming with an Algorithmic Logic Approach.JTIM. Mahakarya Information Engineering Journal, 37–44. https://journal.unmaha.ac.id/index.php/jtim/article/view/6

Samaan, G. H., Wadie, A. R., Attia, A. K., Asaad, A. M., Kamel, A. E., Slim, S. O., Abdallah, M. S., & Cho, Y.-I. (2022). MediaPipe’s Landmarks with RNN for Dynamic Sign Language Recognition. Electronics (Switzerland), 11(19). https://doi.org/10.3390/electronics11193228

Subramanian, B., Olimov, B., Naik, S. M., Kim, S., Park, K.-H., & Kim, J. (2022). An integrated mediapipe-optimized GRU model for Indian sign language recognition. Scientific Reports, 12(1). https://doi.org/10.1038/s41598-022-15998-7

Suyudil, I., Sudadio, S., & Suherman, S. (2022). Introduction to Indonesian Sign Language using Mediapipe with Random Forest and Multinomial Logistic Regression Models.Jurnal Ilmu Siber dan Teknologi Digital (JISTED (pp. 65–80). https://doi.org/10.35912/jisted.v1i1.1899

Zulkhadi, T. C.-S., Maria, E., & Yulianto. (2019). Facial Shape Pattern Recognition with OpenCV.JURTI, 181-186. https://e-journals.unmul.ac.id/index.php/INF/article/view/4033

Downloads

Published

2024-07-24

How to Cite

Muhammad Juzairi Safitli, Aminuddin, F. huda, & Gustina. (2024). Controlling Computer Device Cursors Using Hand Gestures by Utilizing OpenCV, MediaPipe, and PyAutoGui. Intellecta : Journal of Artificial Intelligence, 1(1), 1–16. Retrieved from https://journal.idscipub.com/intellecta/article/view/173

Issue

Section

Articles