Machine Learning in Sound Analytic

MOHAN, ANIKET

DR Home
→
THESES & PROJECT REPORTS
→
MS THESES
→
View Item

dc.contributor.advisor	Borwankar, Prabhanjan	en_US
dc.contributor.author	MOHAN, ANIKET	en_US
dc.date.accessioned	2021-07-02T11:00:33Z
dc.date.available	2021-07-02T11:00:33Z
dc.date.issued	2021-07
dc.identifier.citation	50	en_US
dc.identifier.uri	http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/6003
dc.description.abstract	In this project we focus on different deep Learning algorithms for noisy audio enhancement where traditional Digital signal Processing (DSP) techniques fail to enhance noisy audio clips, we also worked on the classification of different enhanced Industrial sounds and compared the results with not enhanced Industrial audio. For sound enhancement, we used the magnitude spectrum of audio. Considering the temporal and spatial features we investigated four different deep learning architectures on speech datasets to select the most suitable architecture for the enhancement of Industrial sounds. The architectures consisted of Feed Forward Neural Network, Convolution Neural Network, Recurrent Neural Network. We trained the models using noisy clean training pairs. The trained model acted as a filter for background noise. To examine the enhancement performance we measured Noise reduction, speech distortion, and perceptual estimation of speech quality. The Experimental results show Convolution and recurrent neural network layers increased the performance of the models. For the classification of audio clips, we used Mel spectrogram features of audio clips. In this problem, we investigated different deep learning architectures. Here we use Full convolution neural networks for classification and also used transfer learning to implement ResNet50 and efficient net for classification. To measure the model performance we used Precision, Recall F1-Score as metrics. The experiment results showed that most of the architecture did not give good results as compared to not enhanced Audio.	en_US
dc.language.iso	en	en_US
dc.subject	Deep Learning	en_US
dc.subject	Machine Learning	en_US
dc.subject	Spectrograms	en_US
dc.subject	Neural Networks	en_US
dc.title	Machine Learning in Sound Analytic	en_US
dc.type	Thesis	en_US
dc.type.degree	BS-MS	en_US
dc.contributor.department	Dept. of Data Science	en_US
dc.contributor.registration	20161030	en_US

Files in this item

Name: ms_thesis_final.pdf

Size: 4.241Mb

Format: PDF

View/Open

This item appears in the following Collection(s)

MS THESES [1980]
Thesis submitted to IISER Pune in partial fulfilment of the requirements for the BS-MS Dual Degree Programme/MSc. Programme/MS-Exit Programme

Show simple item record

Search Repository

Advanced Search

Browse

All of Repository
This Collection
- Titles
- Authors
- By Advisor
- By Issue Date
- Subjects
- By Type
- By Department

Machine Learning in Sound Analytic

Files in this item

This item appears in the following Collection(s)

Search Repository

Browse

All of Repository

This Collection

My Account