Please use this identifier to cite or link to this item: http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/10801
Title: Geometric interpretations of the k-nearest neighbour distributions
Authors: GANGOPADHYAY, KWANIT
BANERJEE, ARKA
Abel, Tom
Dept. of Physics
Keywords: Methods: statistical
Cosmological parameters
Large-scale structure of Universe
2025
Issue Date: Nov-2025
Publisher: Oxford University Press
Citation: Monthly Notices of the Royal Astronomical Society, 543(04), 3409–3427.
Abstract: The k-nearest neighbour (NN) cumulative distribution functions (CDFs) are measures of clustering for discrete data sets that are fast and efficient to compute. They are significantly more informative than the two-point correlation function. Their connection to N-point correlation functions, void probability functions, and counts-in-cells is known. However, the connections between the CDFs and geometric and topological summary statistics are yet to be fully explored. This understanding will be crucial to find optimally informative summary statistics to analyse data from stage-4 cosmological surveys. We explore quantitatively the geometric interpretations of the kNN CDF summary statistics. We establish an equivalence between the 1NN CDF at radius r and the volume of spheres with the same radius around data points. We show that higher kNN CDFs represent the volumes of intersections of spheres around data points. We present similar geometric interpretations for the kNN cross-correlation CDFs. We further show that the full shape of the CDFs have information about planar angles, solid angles, and arc lengths created at the intersections of spheres around the data points, and can be accessed through the derivatives of the CDF. We show that this information is equivalent to that captured by Germ–Grain Minkowski Functionals. Using Fisher analyses, we compare the constraining power of various data vectors constructed from kNN CDFs and Minkowski Functionals. We find that the CDFs and their derivatives and the Minkowski Functionals have nearly identical constraining power. However, the CDFs are computationally orders of magnitude faster to evaluate.
URI: https://doi.org/10.1093/mnras/staf1637
http://dr.iiserpune.ac.in:8080/xmlui/handle/123456789/10801
ISSN: 1365-2966
0035-8711
Appears in Collections:JOURNAL ARTICLES

Files in This Item:
There are no files associated with this item.


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.