An Expert System for Automatic Classification of Sound Signals
2020, Journal of Telecommunications and Information Technology
https://doi.org/10.26636/JTIT.2020.143220Abstract
In this paper, we present the results of research focusing on methods for recognition/classification of audio signals. We consider the results of the research project to serve as a basis for the main module of a hybrid expert system currently under development. In our earlier studies, we conducted research on the effectiveness of three classifiers: fuzzy classifier, neural classifier and WEKA system for reference data. In this project, a particular emphasis was placed on fine-tuning the fuzzy classifier model and on identifying neural classifier applications, taking into account new neural networks that we have not studied so far in connection with sounds classification methods. Keywords—audio descriptors, bird species, fuzzy classification of audio signals, MPEG-7, spectral features of sound.
FAQs
AI
What neural network architecture was employed for bird species classification?
A three-layer neural network was used, analyzed with different activation functions, achieving a 78% mean classification efficiency.
How were sound signal features extracted for bird classification?
Low-level descriptors (LLD) from the MPEG-7 standard provided the input vectors, quantifying signal characteristics.
What role does fuzzy logic play in the expert system design?
Fuzzy logic enhances classification by defining triangular fuzzy sets for bird species, achieving 65% effectiveness.
What implications does the research have for agriculture and species monitoring?
Successful identification of bird species can inform crop protection strategies and aid ornithological population studies.
How does sample size influence the neural network's classification performance?
The tests were conducted with a 50-sample learning set; larger sets could improve recognition levels.
References (24)
- J. Hook, F. Noroozi, O. Toygar, and G. Anbarjafari, "Automatic speech based emotion recognition using paralinguistics features". Bull. of the Polish Acad. of Sci.: Techn. Sci., vol. 67, no. 3, pp. 479-488, 2019 (DOI: 10.24425/bpasts.2019.129647).
- Ł. Mik, A. Lorenc, D. Król, R. Wielgat, R. Święciński, and R. Jędryka "Fusing the electromagnetic articulograph, high-speed video cameras and a 16-channel microphone array for speech anal- ysis", Bull. of the Polish Acad. of Sci.: Techn. Sci., vol. 66, no. 3, pp. 256-266, 2018 (DOI: 10.24425/122106).
- K. Tyburek, P. Prokopowicz, P. Kotlarz, and M. Repka. "Comparison of the efficiency of time and frequency descriptors based on different classification conceptions", in Proc. 14th Int. Conf. on Artif. Intell. and Soft Comput. ICAISC 2015 , Zakopane, Poland, 2015 (DOI: 10.1007/978-3-319-19324-3-44).
- M. Kubanek, J. Bobulski, and L. Adrjanowicz "Characteristics of the use of coupled hidden Markov models for audio-visual Polish speech recognition", Bull. of the Polish Acad. of Sci.: Techn. Sci., vol. 60, no. 2, pp. 307-316, 2012 (DOI: 10.2478/V10175-012-0041-6).
- K. Tyburek "Klasyfikacja instrumentów strunowych w multimedi- alnych bazach danych ze szczególnym uwzględnieniem artykulacji pizzicato (Classification of string instruments in multimedia database especially for pizzicato articulation)", Ph.D. Thesis, Institute of Fun- damental Technological, Research Polish Academy of Sciences, War- saw, 2008 (in Polish).
- K. Tyburek, W. Cudny, and W. Kosińnski "Pizzicato sound analysis of selected instruments in the frequency domain", Image Process. & Commun., vol. 11, no. 1, pp. 53-57, 2006.
- J. Niemi and J. T. Tanttu, "Deep learning case study for automatic bird identification", Appl. Sci., vol. 8, no. 11, 2018 (DOI: 10.3390/app8112089).
- J. Niemi and J. T. Tanttu, "Automatic bird identification for offshore wind farms: A case study for deep learning" , in Proc. of Int. Symp. ELMAR 2017, Zadar, Croatia, 2017 (DOI: 10.23919/ELMAR.2017.8124482).
- S.-J. Hong, Y. Han, S.-Y. Kim, A.-Y. Lee, and G. Kim, "Application of deep-learning methods to bird detection using unmanned aerial vehicle imagery", Sensors, vol. 19, no. 7, pp. 1-16, 2019 (DOI: 10.3390/s19071651).
- S. Balemarthy, A. Sajjanhar, and X. Zheng, "Our practice of using machine learning to recognize species by voice", 2018 [Online]. Available: https://arxiv.org/ftp/arxiv/papers/1810/1810.09078.pdf
- M. Lasseck, "Improved Automatic Bird Identification through Deci- sion Tree based Feature Selection and Bagging". in Proc. Working Notes of CLEF 2015 -Conf. and Labs of the Eval. forum, Toulouse, France, 2015 [Online]. Available: http://ceur-ws.org/ Vol-1391/160-CR.pdf
- B. S. Manjunath, P. Salembier, and T. Sikora, Eds., Introduction to MPEG-7: Multimedia Content Description Interface. Chichester: Wiley, 2002 (ISBN: 978-0-471-48678-7).
- H. G. Kim, N. Moreau, and T. Sikora, MPEG7 Audio and Beyond: Audio Content Indexing and Retrieval. Wiley, 2005 (ISBN: 9780470093344).
- K. K. Pathak, S. Panthi, and N. Ramakrishnan "Application of neural network in sheet metal bending process", Defence Sci. J., vol. 55, no. 2, pp. 125-131, 2005 (DOI: 10.14429/dsj.55.1976).
- T. Hannagan "The delta rule does Bubbles", J. of Vision, vol. 13, no. 8, pp. 1-11, 2013 (DOI: 10.1167/13.8.17).
- L. C. de Barros, R. C. Bassanezi, and W. A. Lodwick, A First Course in Fuzzy Logic, Fuzzy Dynamical Systems, and Biomathematics. Theory and Applications. Studies in Fuzziness and Soft Computing, vol. 347. Berlin, Heidelberg: Springer, 2017 (ISBN: 9783662533222).
- P. Prokopowicz and D. Ślęzak, "Ordered fuzzy numbers: Definitions and operations", in Theory and Applications of Ordered Fuzzy Num- bers, P. Prokopowicz, J. Czerniak, D. Mikołajewski, Ł. Apiecionek, and D. Ślęzak, Eds. Studies in Fuzziness and Soft Computing, vol. 356, pp. 57-79. Springer, 2017 (DOI: 10.1007/978-3-319-59614-3 4).
- P. Prokopowicz, "Processing direction with ordered fuzzy numbers", in Theory and Applications of Ordered Fuzzy Numbers, P. Prokopow- icz, J. Czerniak, D. Mikołajewski, Ł. Apiecionek, and D. Ślęzak, Eds. Studies in Fuzziness and Soft Computing, vol. 356, pp. 81-98.
- Springer, 2017 (DOI: 10.1007/978-3-319-59614-3 5).
- W. Siler and J. J. Buckley, Fuzzy Expert Systems and Fuzzy Reason- ing.
- Wiley, 2005 (ISBN: 9780471388593).
- P. Prokopowicz, "The use of ordered fuzzy numbers for modelling changes in dynamic processes", Inform. Sci., vol. 470, pp. 1-14, 2019 (DOI: https://doi.org/10.1016/j.ins.2018.08.045).
- C. Marechal et al., "Survey on AI-based multimodal methods for emotion detection", in High-Performance Modelling and Simulation for Big Data Applications, J. Kołodziej and H. Gonzalez-Velez, Eds. LNCS, vol. 11400, pp. 307-324. Springer, 2019 (DOI 10.1007/978-3-030-16272-6 11).
- M. Grochowski, A. Kwasigroch, and A. Mikołajczyk, "Selected tech- nical issues of deep neural networks for image classification pur- poses", Bull. of the Polish Acad. of Sci.: Techn. Sci., vol. 67, no. 2, pp. 363-376, 2019 (DOI: 10.24425/bpas.2019.128485).