Actuator prototype system by voice commands using free software
DOI:
https://doi.org/10.29019/enfoqueute.v7n2.94Keywords:
voice recognition, voice commands, spectral differences, python, free software applicationsAbstract
This prototype system is a software application that through the use of techniques of digital signal processing, extracts information from the user's speech, which is then used to manage the on/off actuator on a peripheral computer when vowels are pronounced. The method applies spectral differences. The application uses the parallel port as actuator, with the information recorded in the memory address 378H. This prototype was developed using free software tools for its versatility and dynamism, and to allow other researchers to base on it for further studies.
Downloads
References
Audacity. (29 de 04 de 2016). Audacity Español. Obtenido de Audacity Español: http://audacity.es/
Bernal, J. (2000). Reconocimiento de Voz y Fonética Acústica. Madrid: Ra-Ma.
Chen, Y. L.-M. (2015). “Locallyconnected d and convolutional neural networks for smalL footprint speaker recognition,”. Interspeech.
García, C., & Tapia, D. (2000). Estudio de la Frecuencia Fundamental de la Voz y de sus Efectos en el. Proyecto de Fin de Carrera E.T.S.I. Madrid, España: U. Politécnica de Madrid.
Grayson, J. (2000). Phyton and Tkinter Programming‖, . Manning Publications Co.
H. Aronowitz, R. H. (2011). “New developments in voice biometrics for user. n Interspeech, Florence, 17-20.
Hans, P. (2011). A Primer on Scientific Programming with Python. New York: Springer.
Phyton. (29 de 04 de 2016). Idle Phyton. Obtenido de Idle Phyton: https://docs.python.org/2/library/idle.html
Phyton. (29 de 04 de 2016). Python GUI Programming (Tkinter). Obtenido de Python GUI Programming (Tkinter): http://www.tutorialspoint.com/python/python_gui_programming.htm
Poor, H. (1985). An Introduction to Signal Detection and Estimation. New York: Springer- Verlag.
Thomas, T., Pecham, J., & Frangoulis, E. (1989). A Determination of the Sensitivity of Speech Recognisers to Speaker Variability. Proceedings of ICASSP, (págs. 544-547). Glasgow.
Thomas, T., Pecham, J., Frangoulis, E., & Cove, J. (14989). A The Sensitivity of Speech Recognisers to Speaker Variability and Speaker Variation. Proc of Eurospeech, (págs. 408-411). Paris.
Published
How to Cite
Issue
Section
License
The articles and research published by the UTE University are carried out under the Open Access regime in electronic format. This means that all content is freely available without charge to the user or his/her institution. Users are allowed to read, download, copy, distribute, print, search, or link to the full texts of the articles, or use them for any other lawful purpose, without asking prior permission from the publisher or the author. This is in accordance with the BOAI definition of open access. By submitting an article to any of the scientific journals of the UTE University, the author or authors accept these conditions.
The UTE applies the Creative Commons Attribution (CC-BY) license to articles in its scientific journals. Under this open access license, as an author you agree that anyone may reuse your article in whole or in part for any purpose, free of charge, including commercial purposes. Anyone can copy, distribute or reuse the content as long as the author and original source are correctly cited. This facilitates freedom of reuse and also ensures that content can be extracted without barriers for research needs.
This work is licensed under a Creative Commons Attribution 3.0 International (CC BY 3.0).
The Enfoque UTE journal guarantees and declares that authors always retain all copyrights and full publishing rights without restrictions [© The Author(s)]. Acknowledgment (BY): Any exploitation of the work is allowed, including a commercial purpose, as well as the creation of derivative works, the distribution of which is also allowed without any restriction.