Probabilistic Unsupervised Learning with Heterogeneous Noisy Data

PARDHI, SAMARTH

DR Home
→
THESES & PROJECT REPORTS
→
MS THESES
→
View Item

dc.contributor.advisor	NARLIKAR, LEELAVATI
dc.contributor.author	PARDHI, SAMARTH
dc.date.accessioned	2024-05-17T09:18:23Z
dc.date.available	2024-05-17T09:18:23Z
dc.date.issued	2024-05
dc.identifier.citation	59	en_US
dc.identifier.uri	http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/8825
dc.description.abstract	Mixture models are widely used in situations where the interest is in the probability densities. In such cases, the focus is usually to estimate parameters and mixing probabilities with successful clustering. But in real-life applications, analysis becomes substantially challenging for two reasons; the first one is when we introduce heterogeneous features of continuous, categorical, and even ordinal type, and the second one is too many features make the algorithm computationally expensive and not all features contribute for the inference. Feature selection will be implemented for the same. Goal of this project is to summarize the existing methods and develop model-based approaches that are robust and scalable. These approaches will enable model-based clustering simultaneously selecting relevant features within heterogeneous data. Idea is to use Bayesian framework using the Gibbs sampling techniques. These are very popular techniques in mixture models. We investigate Gaussian and Categorical features in model-based clustering, assuming the number of cluster is finite and does not grow with sample size; such frameworks are called finite mixture model. These proposed methods are compared with maximum likelihood approach, which uses the Expected-Maximisation algorithm.	en_US
dc.language.iso	en	en_US
dc.subject	Bayesian Theory	en_US
dc.subject	Model-based Clustering	en_US
dc.subject	Mixture Model	en_US
dc.subject	Markov Chain Monte Carlo	en_US
dc.subject	Feature Selection	en_US
dc.title	Probabilistic Unsupervised Learning with Heterogeneous Noisy Data	en_US
dc.type	Thesis	en_US
dc.description.embargo	One Year	en_US
dc.type.degree	BS-MS	en_US
dc.contributor.department	Dept. of Mathematics	en_US
dc.contributor.registration	20191078	en_US

Files in this item

Name: 20191078_Samarth_ ...

Size: 6.154Mb

Format: PDF

Description: MS Thesis

View/Open

This item appears in the following Collection(s)

MS THESES [1705]
Thesis submitted to IISER Pune in partial fulfilment of the requirements for the BS-MS Dual Degree Programme/MSc. Programme/MS-Exit Programme

Show simple item record

Search Repository

Advanced Search

Browse

All of Repository
This Collection
- Titles
- Authors
- By Advisor
- By Issue Date
- Subjects
- By Type
- By Department

Probabilistic Unsupervised Learning with Heterogeneous Noisy Data

Files in this item

This item appears in the following Collection(s)

Search Repository

Browse

All of Repository

This Collection

My Account