Digital Repository

Probabilistic Unsupervised Learning with Heterogeneous Noisy Data

Show simple item record

dc.contributor.advisor NARLIKAR, LEELAVATI
dc.contributor.author PARDHI, SAMARTH
dc.date.accessioned 2024-05-17T09:18:23Z
dc.date.available 2024-05-17T09:18:23Z
dc.date.issued 2024-05
dc.identifier.citation 59 en_US
dc.identifier.uri http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/8825
dc.description.abstract Mixture models are widely used in situations where the interest is in the probability densities. In such cases, the focus is usually to estimate parameters and mixing probabilities with successful clustering. But in real-life applications, analysis becomes substantially challenging for two reasons; the first one is when we introduce heterogeneous features of continuous, categorical, and even ordinal type, and the second one is too many features make the algorithm computationally expensive and not all features contribute for the inference. Feature selection will be implemented for the same. Goal of this project is to summarize the existing methods and develop model-based approaches that are robust and scalable. These approaches will enable model-based clustering simultaneously selecting relevant features within heterogeneous data. Idea is to use Bayesian framework using the Gibbs sampling techniques. These are very popular techniques in mixture models. We investigate Gaussian and Categorical features in model-based clustering, assuming the number of cluster is finite and does not grow with sample size; such frameworks are called finite mixture model. These proposed methods are compared with maximum likelihood approach, which uses the Expected-Maximisation algorithm. en_US
dc.language.iso en en_US
dc.subject Bayesian Theory en_US
dc.subject Model-based Clustering en_US
dc.subject Mixture Model en_US
dc.subject Markov Chain Monte Carlo en_US
dc.subject Feature Selection en_US
dc.title Probabilistic Unsupervised Learning with Heterogeneous Noisy Data en_US
dc.type Thesis en_US
dc.description.embargo One Year en_US
dc.type.degree BS-MS en_US
dc.contributor.department Dept. of Mathematics en_US
dc.contributor.registration 20191078 en_US


Files in this item

This item appears in the following Collection(s)

  • MS THESES [1705]
    Thesis submitted to IISER Pune in partial fulfilment of the requirements for the BS-MS Dual Degree Programme/MSc. Programme/MS-Exit Programme

Show simple item record

Search Repository


Advanced Search

Browse

My Account