16+
DOI: 10.18413/2518-1092-2026-11-1-0-3

SYSTEM ARCHITECTURE FOR ASR OF AGGLUTINATIVE LOW-RESOURCE LANGUAGES

The relevance of the research is driven by the need to overcome the digital divide, which is particularly acute for low-resource languages. While speakers of widely spoken languages actively use voice assistants, transcription systems, and other speech technologies, small indigenous peoples are left behind in the digital progress. This inequality deprives people of access to modern means of communication, education, and information in their native language, leading to their further marginalization and accelerating the process of language extinction. The development of specialized solutions for automatic speech recognition (ASR) under low-resource conditions is a key step towards expanding technological accessibility. The article addresses the problem
of developing automatic speech recognition (ASR) systems for low-resource languages, specifically Kabardian. It presents a comprehensive approach, including the adaptation of the Massively Multilingual Speech (MMS) model, data preprocessing, as well as the development and integration of language models for post-processing. The main focus is on the MMS model architecture, based on Wav2Vec 2.0, and its modification using Language-Specific Adapter Heads (LSAH), which enables efficient fine-tuning of the model on limited datasets. The stages of audio and text data preprocessing are described. The architectures and results of applying n-gram (3-gram, 5-gram) and neural network (mT5-base) language models for correcting errors in the ASR output are considered. The practical significance of the work is confirmed by the creation of a functional open-source system with a web interface on the Hugging Face Spaces platform, demonstrating the feasibility of building effective ASR solutions for minority languages.

Number of views: 135 (view statistics)
Количество скачиваний: 462
Full text (PDF)Скачать XMLTo articles list
  • User comments
  • Reference lists

While nobody left any comments to this publication.
You can be first.

Leave comment: