Список литературы

2518-1092

Research result. Information technologies

2518-1092

10.18413/2518-1092-2022-8-3-0-5

3225

ARTIFICIAL INTELLIGENCE AND DECISION MAKING

<strong>CLASSIFICATION OF SPEECH DATA BY EMOTIONAL BACKGROUND</strong>

Zhikharev

Alexander Gennadievich

Zhikharev

Alexander Gennadievich

zhikharev@bsu.edu.ru

Chernykh

Vladimir Sergeevich

Chernykh

Vladimir Sergeevich

2023

8300

In this paper, the algorithm of classification of speech data by emotional background, developed by the authors, is considered. In particular, it describes a neural network created to recognize eight different emotions in speech. To train the neural network, a training sample obtained from the RAVDESS dataset, which contains 1440 audio files, was used. These audio files contain the speech of 24 actors (12 women and 12 men) with a neutral North American accent. The paper describes the process of training a neural network using the Keras library, including the network architecture, layer sizes, activation functions and optimization methods. The stages of preprocessing and preparation of the original audio data before training the network are also discussed. The results of the study show that the developed neural network has high performance and the ability to recognize emotions with an accuracy of 80%.

audio attributesaudioaudio fileaudio dataemotional backgroundclassificationmodellayer

Список литературы

Chollet F. Deep Learning with Python. 2nd interd. edition. – SPb.: St. Petersburg: St. Petersburg. – 576 p. – ISBN 978-5-4461-1909-7.

Han K., Lee K., Kim H.G. Music emotion recognition using chroma feature-based probabilistic neural network. Multimedia Tools and Applications. – 2017. – V. №76, Issue №3. – P. 3691-3710.

Getting to Know the Mel-Spectrogram. [Electronic resource] – Electronic data, 2019. – URL: https://towardsdatascience.com/getting-to-know-the-mel-spectrogram-31bca3e2d9d0.

Graves A., Schmidhuber J. Framewise phoneme classification with bidirectional LSTM and other neural network architectures. Nature. – 2005. – V. №18, Issue №5-6. – P. 602-610.

Understanding LSTM Networks. [Electronic resource] – Electronic data, 2015. – URL: http://colah.github.io/posts/2015-08-Understanding-LSTMs/.