Digital Repository

Geometric Interpretations of the k-Nearest Neighbour Distributions

Show simple item record

dc.contributor.advisor BANERJEE, ARKA
dc.contributor.advisor Abel, Tom
dc.contributor.author GANGOPADHYAY, KWANIT
dc.date.accessioned 2025-05-15T04:23:35Z
dc.date.available 2025-05-15T04:23:35Z
dc.date.issued 2025-05
dc.identifier.citation 81 en_US
dc.identifier.uri http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/9869
dc.description.abstract The k-Nearest Neighbour Cumulative Distribution Functions are measures of clustering for discrete datasets that are fast and efficient to compute. They are significantly more informative than the 2-point correlation function. Their connection to N-point correlation functions, void probability functions and Counts-in-Cells is known. However, the connections between the CDFs and geometric and topological summary statistics are yet to be fully explored.This understanding will be crucial to find optimally informative summary statistics to analyse data from stage-4 cosmological surveys. We explore quantitatively the geometric interpretations of the kNN CDF summary statistics. We establish an equivalence between the 1NN CDF at radius r and the volume of spheres with the same radius around data points. We show that higher kNN CDFs represent the volumes of intersections of at least k spheres around data points. We present similar geometric interpretations for the kNN cross-correlation CDFs. We further show that the full shape of the CDFs have information about planar angles, solid angles and arc lengths created at the intersections of spheres around the data points, and can be accessed through the derivatives of the CDF. We show that this information is equivalent to that captured by Germ Grain Minkowski Functionals. Using Fisher analyses we compare the constraining power of various data vectors constructed from kNN CDFs and Minkowski Functionals. We find that the CDFs and their derivatives and the Minkowski Functionals have nearly identical constraining power. However, the CDFs are computationally orders of magnitude faster to evaluate. en_US
dc.description.sponsorship Startup Research Grant (SRG/2023/000378) from the Science and Engineering Research Board (SERB), India en_US
dc.language.iso en en_US
dc.subject Research Subject Categories::NATURAL SCIENCES en_US
dc.subject Physics en_US
dc.subject Cosmology en_US
dc.title Geometric Interpretations of the k-Nearest Neighbour Distributions en_US
dc.type Thesis en_US
dc.description.embargo No Embargo en_US
dc.type.degree BS-MS en_US
dc.contributor.department Dept. of Physics en_US
dc.contributor.registration 20201162 en_US


Files in this item

This item appears in the following Collection(s)

  • MS THESES [2003]
    Thesis submitted to IISER Pune in partial fulfilment of the requirements for the BS-MS Dual Degree Programme/MSc. Programme/MS-Exit Programme

Show simple item record

Search Repository


Advanced Search

Browse

My Account