Supplemental Files:
Creator:
Date:
Abstract:
Binaural sound source localization is the determination of the position of a sound source based on two data sensors, microphones, mimicking the human auditory system. Many audio processing systems in our daily work and life rely on sound source localization, such as speech enhancement/recognition and human-robot interaction. However, the accuracy of sound source localization under adverse acoustic scenarios is still hard to ensure. This thesis proposes machine learning with feature extractions to estimate the sound source localization by manipulating and analyzing data collected by public Head Related Transfer Function databases. The two proposed methods are wavelet scattering long short-term memory and wavelet scattering convolutional neural network. These developed methods are studied in classification and regression approaches for different scenarios. The results demonstrate that the proposed methods achieve excellent performance in multiple noisy environments compared to recent literature, especially in regression binaural sound source localization.