IRMA-International.org: Creator of Knowledge
Information Resources Management Association
Advancing the Concepts & Practices of Information Resources Management in Modern Organizations

Enhancing Robustness in Speech Recognition using Visual Information

Enhancing Robustness in Speech Recognition using Visual Information
View Sample PDF
Author(s): Omar Farooq (Aligarh Muslim University, India)and Sekharjit Datta (Loughborough University, UK)
Copyright: 2012
Pages: 23
Source title: Speech, Image, and Language Processing for Human Computer Interaction: Multi-Modal Advancements
Source Author(s)/Editor(s): Uma Shanker Tiwary (Indian Institute of Information Technology Allahabad, India)and Tanveer J. Siddiqui (University of Allahabad, India)
DOI: 10.4018/978-1-4666-0954-9.ch008

Purchase

View Enhancing Robustness in Speech Recognition using Visual Information on the publisher's website for pricing and purchasing information.

Abstract

The area of speech recognition has been thoroughly researched during the past fifty years; however, robustness is still an important challenge to overcome. It has been established that there exists a correlation between speech produced and lip motion which is helpful in the adverse background conditions to improve the recognition performance. This chapter presents main components used in audio-visual speech recognition systems. Results of a prototype experiment conducted on audio-visual corpora for Hindi speech have been reported of simple phoneme recognition task. The chapter also addresses some of the issues related to visual feature extraction and the integration of audio-visual and finally present future research directions.

Related Content

Aswathy Ravikumar, Harini Sriraman. © 2023. 18 pages.
Ezhilarasie R., Aishwarya N., Subramani V., Umamakeswari A.. © 2023. 10 pages.
Sangeetha J.. © 2023. 13 pages.
Manivannan Doraipandian, Sriram J., Yathishan D., Palanivel S.. © 2023. 14 pages.
T. Kavitha, Malini S., Senbagavalli G.. © 2023. 36 pages.
Uma K. V., Aakash V., Deisy C.. © 2023. 23 pages.
Alageswaran Ramaiah, Arun K. S., Yathishan D., Sriram J., Palanivel S.. © 2023. 17 pages.
Body Bottom