dc.contributor |
Graduate Program in Electrical and Electronic Engineering. |
|
dc.contributor.advisor |
Arslan, Levent M. |
|
dc.contributor.author |
Akyürek, Muhammed Furkan. |
|
dc.date.accessioned |
2023-03-16T10:20:41Z |
|
dc.date.available |
2023-03-16T10:20:41Z |
|
dc.date.issued |
2020. |
|
dc.identifier.other |
EE 2020 A58 |
|
dc.identifier.uri |
http://digitalarchive.boun.edu.tr/handle/123456789/12984 |
|
dc.description.abstract |
Array microphone processing is a complex application with multiple interlinked components like direction of arrival for the audio sources, beamforming and postfiltering that are dependent on the array geometry. The array microphones gained popularity by the advent of the smart speakers. In this thesis, an end-to-end solution is provided containing all of the array microphone processing components along with the denoising integrated to the core of the system using a deep learning method called autoencoders. The neural network system is trained on the magnitude spectra generated by a dataset created exclusively for this thesis by combining some of the publicly available speech and noise datasets. This thesis proposes a single channel and a multichannel speech enhancement model to solve the beamforming problem. The multichannel autoencoder model is shown to perform better than some of the common conventional beamforming methods by objective evaluation methods. Results from this thesıs indicate the room for improvement in this field by the use of neural networks. |
|
dc.format.extent |
30 cm. |
|
dc.publisher |
Thesis (M.S.) - Bogazici University. Institute for Graduate Studies in Science and Engineering, 2020. |
|
dc.subject.lcsh |
Microphone arrays. |
|
dc.subject.lcsh |
Speech processing systems. |
|
dc.title |
Application of deep learning for array microphone processing |
|
dc.format.pages |
xvii, 71 leaves ; |
|