Please use this identifier to cite or link to this item: http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/6003
Full metadata record
DC FieldValueLanguage
dc.contributor.advisorBorwankar, Prabhanjanen_US
dc.contributor.authorMOHAN, ANIKETen_US
dc.date.accessioned2021-07-02T11:00:33Z
dc.date.available2021-07-02T11:00:33Z
dc.date.issued2021-07
dc.identifier.citation50en_US
dc.identifier.urihttp://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/6003
dc.description.abstractIn this project we focus on different deep Learning algorithms for noisy audio enhancement where traditional Digital signal Processing (DSP) techniques fail to enhance noisy audio clips, we also worked on the classification of different enhanced Industrial sounds and compared the results with not enhanced Industrial audio. For sound enhancement, we used the magnitude spectrum of audio. Considering the temporal and spatial features we investigated four different deep learning architectures on speech datasets to select the most suitable architecture for the enhancement of Industrial sounds. The architectures consisted of Feed Forward Neural Network, Convolution Neural Network, Recurrent Neural Network. We trained the models using noisy clean training pairs. The trained model acted as a filter for background noise. To examine the enhancement performance we measured Noise reduction, speech distortion, and perceptual estimation of speech quality. The Experimental results show Convolution and recurrent neural network layers increased the performance of the models. For the classification of audio clips, we used Mel spectrogram features of audio clips. In this problem, we investigated different deep learning architectures. Here we use Full convolution neural networks for classification and also used transfer learning to implement ResNet50 and efficient net for classification. To measure the model performance we used Precision, Recall F1-Score as metrics. The experiment results showed that most of the architecture did not give good results as compared to not enhanced Audio.en_US
dc.language.isoenen_US
dc.subjectDeep Learningen_US
dc.subjectMachine Learningen_US
dc.subjectSpectrogramsen_US
dc.subjectNeural Networksen_US
dc.titleMachine Learning in Sound Analyticen_US
dc.typeThesisen_US
dc.type.degreeBS-MSen_US
dc.contributor.departmentDept. of Data Scienceen_US
dc.contributor.registration20161030en_US
Appears in Collections:MS THESES

Files in This Item:
File Description SizeFormat 
ms_thesis_final.pdf4.34 MBAdobe PDFView/Open    Request a copy


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.