R, Deepa and V, Jayalakshmi and P, Thilakavathy and G, Manikandan and R, Surendran (2024) Custom Transformer-Based Approach for Enhanced Bengali Automatic Speech Recognition. In: 2024 First International Conference on Innovations in Communications, Electrical and Computer Engineering (ICICEC), Davangere, India.
Full text not available from this repository. (Request a copy)Abstract
Automated speech recognition (ASR) systems struggle with Bengali, which is the fifth most spoken language. Bengali has a varied morphology, many dialects, and limited high-quality annotated voice data. Traditional voice recognition techniques work well however struggle with Bengali phonetics and syntax. This method eliminates the need for large amounts of tagged data and the difficulties of differentiating sounds in different languages, both issues are addressed by this procedure. The research in this paper offers an Intellectual Bengali Speech Recognition System using Deep Learning Techniques (IBSRS-DLT). The model in this system uses transformer architecture and attention processes to manage Bengali speech pattern long-range dependencies. The IBSRS-DLT outperforms standard models in recognition accuracy using a pre-trained transformer model modified for phonetic and lexical correctness and fine-tuned on Bengali datasets. After training on a vast Bengali speech corpus, real-world datasets and rigorous simulations confirm the system. IBSRS-DLT can be used for automatic transcription, voice-controlled devices, and Bengali virtual assistants, such applications are efficient and accessible. The simulation showed a considerable boost in recognition speed and a decrease in keyword mistakes compared to current methods. This paper shows that deep learning can solve linguistic issues in ASR systems, enabling scalable regional language detection.
Item Type: | Conference or Workshop Item (Paper) |
---|---|
Subjects: | Computer Science Engineering > Automated Machine Learning |
Domains: | Computer Science Engineering |
Depositing User: | Mr IR Admin |
Date Deposited: | 23 Aug 2025 05:47 |
Last Modified: | 23 Aug 2025 05:47 |
URI: | https://ir.vistas.ac.in/id/eprint/10348 |