ABOUT THE SOUNDS RECOGNITION ALGORITHM BASED ON THE COSINE TRANSFORM
This article is devoted to solving the problem of recognizing various sounds in the environment, which is widely used in surveillance and control systems and allows you to identify objects of various nature, for example, a car, a boat, an airplane, animals, birds, etc. The paper proposes an algorithm for recognizing sounds in an audio signal based on the analysis of the signal frequency components corresponding to the discrete cosine transform coefficients of signal fragments. The discrete cosine transform provides, in contrast to the Fourier transform, the decomposition of the signal into real frequency components, which reduces computational costs when implementing the algorithm. In the developed algorithm, based on the frequency analysis of the audio signal, as an example, notes of different octaves are determined. At the stage of preprocessing, fragments corresponding to pauses are allocated in the initial signal and the informative audio signal fragments are formed, during the analysis of which, at the next stage of the algorithm, notes are recognized. Computational experiments with a model sound signal demonstrated the developed algorithm.
Ursol D.V., Bolgova E.V., Chernomorets D.A., Chernomorets A.A. About the sounds recognition algorithm based on the cosine transform // Research result. Information technologies. – Т.7, №4, 2022. – P. 67-75. DOI: 10.18413/2518-1092-2022-7-4-0-8
While nobody left any comments to this publication.
You can be first.
1. Abramov G.V., Korobova L.A., Ivashin A.L., Matytsina I.A. Analysis and use of mathematical methods for recognition of sound signals // Bulletin of the Voronezh State University of Engineering Technologies. 2015. No. 2 (64). pp. 61-65.
2. Danilov V.V., Salekh Kh.M. Investigation of malfunctions of unmanned aerial vehicles based on the recognition of the sound of aggregates // Decision. 2018. Vol. 1. P. 126-129.
3. Mityanok V.V., Konovalova N.V. Application of phase analysis of speech sounds to recognize a person by his voice // Technical Acoustics. 2013. Vol. 13. P. 4.
4. Pripadchev A.A., Cherny E.V. Development of a system for automated recognition of sounds of critical situations in an audio signal stream / In the collection: Almanac of Scientific Works of Young Scientists of ITMO University. in 5 volumes. 2016. P. 188-190.
5. Vasiliev D.E. Improving the efficiency of special purpose sound recognition using artificial intelligence / In the collection: Modern trends in the development of science and the world community in the era of digitalization. Collection of materials of the VII International scientific-practical conference. Editorial board: Babaeva Z.Sh. [and etc.]. Moscow, 2022, pp. 143-145.
6. Balabaev S.A., Lupin S.A. Accelerating the work of the method for determining the voices of birds / In the collection: Advanced development of modern science as a driver for the growth of the economy and the social sphere. Collection of the II All-Russian Scientific and Practical Conference. Petrozavodsk, 2020. P. 51-56.
7. Ivanov A.N., Kiselev A.M. Protection of audio files by digital marking method based on discrete cosine transform and discrete wavelet transform // Uchenye zametki TOGU. 2019. V. 10. No. 3. P. 42-52.
8. Rakitsky V.A. Discrete cosine transform as a means of computer processing of information // Problems of informatization and control. 2019. V. 2. No. 62. P. 52-56.
9. Chernomorets A.A., Bolgova E.V., Chernomorets D.A. The generalized subband analysis on the basis of unitary transformations // Belgorod State University Scientific Bulletin. Economics. Information technologies. 2015. No. 7 (204). pp. 97-104.
10. Novikov K.D. Program for implementing discrete cosine transformation by means of GPU / Certificate of registration of the computer program 2021611111, 01/21/2021. Application No. 2021610330 dated 01/13/2021.
11. Bumagin A.V., Gondar A.V., Prudnikov A.A., Steshenko V.B. Device for calculating discrete cosine transform // Patent for invention RU 2430407 C1, 27.09.2011. Application No. 2010115396/08 dated 04/20/2010.
12. Parshin B.Ya., Zhukov D.O. Comparison of Discrete Fourier Transform and Modified Cosine Transform in Audio Information Compression. Bulletin of Computer and Information Technologies. 2010. No. 5 (71). pp. 12-18.
13. Zhilyakov E.G. Variational methods of analysis and construction of functions based on empirical data: monograph. / E.G. Zhilyakov. - Belgorod: Publishing House of BelSU, 2007. – 160 p.
14. Zhilyakov E.G., Chernomorets A.A., Bolgova E.V. About subinterval matrices based on unitary transformations // Research Result. Information Technology. 2017. V. 2. No. 1. P. 55-63.
15. Lapaev N.G., Sedov A.N., Shevchenko O.V. Determining the parameters of signals using the discrete Fourier transform and wavelet transform // Methods and devices for transmitting and processing information. 2004. No. 6. P. 140-148.
16. Konev A.A., Onishchenko A.A., Kostyuchenko E.Yu., Yakimuk A.Yu. Automatic recognition of musical notes // Scientific Bulletin of the Novosibirsk State Technical University. 2015. No. 3 (60). pp. 32-47.