Dr Shahana Bano s.bano@rgu.ac.uk
Lecturer
Dr Shahana Bano s.bano@rgu.ac.uk
Lecturer
Pavuluri Jithendra
Gorsa Lakshmi Niharika
Yalavarthi Sikhi
Speech acts as a barrier to communication between two individuals and helps them in expressing their feelings, thoughts, emotions, and ideologies among each other. The process of establishing a communicational interaction between the machine and mankind is known as Natural Language processing. Speech recognition aids in translating the spoken language into text. We have come up with a Speech Recognition model that converts the speech data given by the user as an input into the text format in his desired language. This model is developed by adding Multilingual features to the existent Google Speech Recognition model based on some of the natural language processing principles. The goal of this research is to build a speech recognition model that even facilitates an illiterate person to easily communicate with the computer system in his regional language.
BANO, S., JITHENDRA, P., NIHARIKA, G.L. and SIKHI, Y. 2020. Speech to text translation enabling multilingualism. In Proceedings of the 2020 International conference for innovation in technology (INOCON 2020), 6-8 November 2020, Bangluru, India. Piscataway: IEEE [online]. Available from: https://doi.org/10.1109/INOCON50539.2020.9298280
Presentation Conference Type | Conference Paper (published) |
---|---|
Conference Name | 2020 International conference for innovation in technology (INOCON 2020) |
Start Date | Nov 6, 2020 |
End Date | Nov 8, 2020 |
Acceptance Date | Jul 30, 2020 |
Online Publication Date | Jan 1, 2021 |
Publication Date | Dec 31, 2020 |
Deposit Date | Sep 20, 2023 |
Publicly Available Date | Sep 20, 2023 |
Publisher | Institute of Electrical and Electronics Engineers (IEEE) |
Peer Reviewed | Peer Reviewed |
ISBN | 9781728197456 |
DOI | https://doi.org/10.1109/INOCON50539.2020.9298280 |
Keywords | Speech recognition; Natural language processing; Language translation; Machine learning |
Public URL | https://rgu-repository.worktribe.com/output/2064062 |
BANO 2020 Speech to text translation (AAM)
(337 Kb)
PDF
Copyright Statement
© IEEE
Fabric variation and visualization using light dependent factor.
(2023)
Presentation / Conference Contribution
Vehicle spotting in nighttime using gamma correction.
(2022)
Presentation / Conference Contribution
Comprehending object detection by deep learning methods and algorithms.
(2022)
Presentation / Conference Contribution
Detection of image forgery for forensic analytics.
(2022)
Presentation / Conference Contribution
About OpenAIR@RGU
Administrator e-mail: publications@rgu.ac.uk
This application uses the following open-source libraries:
Apache License Version 2.0 (http://www.apache.org/licenses/)
Apache License Version 2.0 (http://www.apache.org/licenses/)
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2025
Advanced Search