Automatic Speech Recognition and Translation for Low Resource Languages 9781394214174, 9781394213580

Automatic Speech Recognition and Translation for Low Resource Languages contains groundbreaking research from experts an

121 82 43MB

English Pages 646 Year 2024

Table of contents :
Cover
Table of Contents
Series Page
Title Page
Copyright Page
Dedication Page
Foreword
Preface
Acknowledgement
1 A Hybrid Deep Learning Model for Emotion Conversion in Tamil Language
1.1 Introduction
1.2 Dataset Collection and Database Preparation
1.3 Pre-Trained CNN Architectural Models
1.4 Proposed Method for Emotion Transformation
1.5 Synthesized Speech Evaluation
1.6 Conclusion
References
2 Attention-Based End-to-End Automatic Speech Recognition System for Vulnerable Individuals in Tamil
2.1 Introduction
2.2 Related Work
2.3 Dataset Description
2.4 Implementation
2.5 Results and Discussion
2.6 Conclusion
References
3 Speech-Based Dialect Identification for Tamil
3.1 Introduction
3.2 Literature Survey
3.3 Proposed Methodology
3.4 Experimental Setup and Results
3.5 Conclusion
References
4 Language Identification Using Speech Denoising Techniques: A Review
4.1 Introduction
4.2 Speech Denoising and Language Identification
4.3 The Noisy Speech Signal is Denoised Using Temporal and Spectral Processing
4.4 The Denoised Signal is Classified to Identify the Language Spoken Using Recent Machine Learning Algorithm
4.5 Conclusion
References
5 Domain Adaptation-Based Self-Supervised ASR Models for Low-Resource Target Domain
5.1 Introduction
5.2 Literature Survey
5.3 Dataset Description
5.4 Self-Supervised ASR Model
5.5 Domain Adaptation for Low-Resource Target Domain
5.6 Implementation of Domain Adaptation on wav2vec2 Model for Low-Resource Target Domain
5.7 Results Analysis
5.8 Conclusion
Acknowledgements
References
6 ASR Models from Conventional Statistical Models to Transformers and Transfer Learning
6.1 Introduction
6.2 Preprocessing
6.3 Feature Extraction
6.4 Generative Models for ASR
6.5 Discriminative Models for ASR
6.6 Deep Architectures for Low-Resource Languages
6.7 The DNN-HMM Hybrid System
6.8 Summary
References
7 Syllable-Level Morphological Segmentation of Kannada and Tulu Words
7.1 Introduction
7.2 Related Work
7.3 Corpus Construction and Annotation
7.4 Methodology
7.5 Experiments and Results
7.6 Conclusion and Future Work
References
8 A New Robust Deep Learning-Based Automatic Speech Recognition and Machine Transition Model for Tamil and Gujarati
8.1 Introduction
8.2 Literature Survey
8.3 Proposed Architecture
8.4 Experimental Setup
8.5 Results
8.6 Conclusion
References
9 Forensic Voice Comparison Approaches for Low-Resource Languages
9.1 Introduction
9.2 Challenges of Forensic Voice Comparison
9.3 Motivation
9.4 Review on Forensic Voice Comparison Approaches
9.5 Low-Resource Language Datasets
9.6 Applications of Forensic Voice Comparison
9.7 Future Research Scope
9.8 Conclusion
References
10 CoRePooL—Corpus for Resource-Poor Languages: Badaga Speech Corpus
10.1 Introduction
10.2 CoRePooL
10.3 Benchmarking
10.4 Conclusion
Acknowledgement
References
11 Bridging the Linguistic Gap: A Deep Learning-Based Image-to-Text Converter for Ancient Tamil with Web Interface
11.1 Introduction
11.2 The Historical Significance of Ancient Tamil Scripts
11.3 Realization Process
11.4 Dataset Preparation
11.5 Convolution Neural Network
11.6 Webpage with Multilingual Translator
11.7 Results and Discussions
11.8 Conclusion and Future Work
References
12 Voice Cloning for Low-Resource Languages: Investigating the Prospects for Tamil
12.1 Introduction
12.2 Literature Review
12.3 Dataset
12.4 Methodology
12.5 Results and Discussion
12.6 Conclusion
References
13 Transformer-Based Multilingual Automatic Speech Recognition (ASR) Model for Dravidian Languages
13.1 Introduction
13.2 Literature Review
13.3 Dataset Description
13.4 Methodology
13.5 Experimentation Results and Analysis
13.6 Conclusion
References
14 Language Detection Based on Audio for Indian Languages
14.1 Introduction
14.2 Literature Review
14.3 Language Detector System
14.4 Experiments and Outcomes
14.5 Conclusion
References
15 Strategies for Corpus Development for Low-Resource Languages: Insights from Nepal
15.1 Low-Resource Languages and the Constraints
15.2 Language Resources Map for the Languages of Nepal
15.3 Unicode Inception and Advent in Nepal
15.4 Speech and Translation Initiatives
15.5 Corpus Development Efforts—Sharing Our Experiences
15.6 Constraints to Competitive Language Technology Research for Nepali and Nepal’s Languages
15.7 Roadmap for the Future
15.8 Conclusion
References
16 Deep Neural Machine Translation (DNMT): Hybrid Deep Learning Architecture-Based English-to-Indian Language Translation
16.1 Introduction
16.2 Literature Survey
16.3 Background
16.4 Proposed System
16.5 Experimental Setup and Results Analysis
16.6 Conclusion and Future Work
References
17 Multiview Learning-Based Speech Recognition for Low-Resource Languages
17.1 Introduction
17.2 Approaches of Information Fusion in ASR
17.3 Partition-Based Multiview Learning
17.4 Data Augmentation Techniques
17.5 Conclusion
References
18 Automatic Speech Recognition Based on Improved Deep Learning
18.1 Introduction
18.2 Literature Review
18.3 Proposed Methodology
18.4 Results and Discussion
18.5 Conclusion
References
19 Comprehensive Analysis of State-of-the-Art Approaches for Speaker Diarization
19.1 Introduction
19.2 Generic Model of Speaker Diarization System
19.3 Review of Existing Speaker Diarization Techniques
19.4 Challenges
19.5 Applications
19.6 Conclusion
References
20 Spoken Language Translation in Low-Resource Language
20.1 Introduction
20.2 Related Work
20.3 MT Algorithms
20.4 Dataset Collection
20.5 Conclusion
References
Index
End User License Agreement

Automatic Speech Recognition and Translation for Low Resource Languages
9781394214174, 9781394213580

Author / Uploaded
L. Ashok Kumar
D. Karthika Renuka
Bharathi Raja Chakravarthi
Thomas Mandl

0 0 0
Like this paper and download? You can publish your own PDF file online for free in a few minutes! Sign Up

Recommend Papers

Interacting with Computers by Voice Automatic Speech Recognition and Synthesis

511 118 568KB Read more

Efficient Algorithms for Speech Recognition

446 113 1MB Read more

Automatic Speech Translation: Fundamental Technology for Future Cross-Language Communications 9782919875023

Automatic Speech Translation introduces recent results of Japanese research and development in speech translation and sp

170 5 12MB Read more

Speech Recognition and Understanding 9781774691847

This book entitled, "Introduction to Speech and Language Therapy", has been designed to add to the knowledge o

158 73 14MB Read more

Speech and Language Technologies for Low-Resource Languages - First International Conference, SPELLL 2022, Kalavakkam, India, November 23–25, 2022, Proceedings [1 ed.] 9783031332302, 9783031332319

This book constitutes refereed proceedings from the First International Conference on Speech and Language Technologies f

98 5 28MB Read more

Speech and Language Technologies for Low-Resource Languages : First International Conference, SPELLL 2022, Kalavakkam, India, November 23–25, 2022, Proceedings [1 ed.] 9783031332319, 9783031332302

This book constitutes refereed proceedings from the First International Conference on Speech and Language Technologies f

97 81 34MB Read more

Deep learning for NLP and speech recognition 9783030145958, 9783030145965, 3030145956

This textbook explains Deep Learning Architecture, with applications to various NLP Tasks, including Document Classifica

432 112 19MB Read more

Translation (Resource Books for Teachers) 0194371042, 9780194371049

This book is designed to help EFL and ESL teachers with the practicalities involved in setting up and managing self-acce

392 113 4MB Read more

Toward Spontaneous Speech Recognition and Understanding

432 86 1MB Read more

Speech Recognition using Neural Networks

450 45 847KB Read more