Comparison of density estimation methods for astronomical datasets

B. J. Ferdosi*, H. Buddelmeijer, S.C. Trager, M. H. F. Wilkinson, J. B. T. M. Roerdink

*Corresponding author for this work

Research output: Contribution to journalArticleAcademicpeer-review

23 Citations (Scopus)
315 Downloads (Pure)

Abstract

Context. Galaxies are strongly influenced by their environment. Quantifying the galaxy density is a difficult but critical step in studying the properties of galaxies.

Aims. We aim to determine differences in density estimation methods and their applicability in astronomical problems. We study the performance of four density estimation techniques: k-nearest neighbors (kNN), adaptive Gaussian kernel density estimation (DEDICA), a special case of adaptive Epanechnikov kernel density estimation (MBE), and the Delaunay tessellation field estimator (DTFE).

Methods. The density estimators are applied to six artificial datasets and on three astronomical datasets, the Millennium Simulation and two samples from the Sloan Digital Sky Survey. We compare the performance of the methods in two ways: first, by measuring the integrated squared error and Kullback-Leibler divergence of each of the methods with the parametric densities of the datasets (in case of the artificial datasets); second, by examining the applicability of the densities to study the properties of galaxies in relation to their environment (for the SDSS datasets).

Results. The adaptive kernel based methods, especially MBE, perform better than the other methods in terms of calculating the density properly and have stronger predictive power in astronomical use cases.

Conclusions. We recommend the modified Breiman estimator as a fast and reliable method to quantify the environment of galaxies.

Original languageEnglish
Article numberA114
Number of pages16
JournalAstronomy & astrophysics
Volume531
DOIs
Publication statusPublished - Jul-2011

Keywords

  • methods: data analysis
  • methods: statistical
  • methods: miscellaneous
  • DIGITAL SKY SURVEY
  • SCALE-INDEPENDENT METHOD
  • PARTICLE HYDRODYNAMICS
  • PROBABILITY DENSITY
  • LUMINOSITY FUNCTION
  • CLUSTER-ANALYSIS
  • LEAST-SQUARES
  • GALAXY COLOR
  • DATA RELEASE
  • COSMIC WEB

Cite this