TY - JOUR
T1 - A multi-band AGN-SFG classifier for extragalactic radio surveys using machine learning
AU - Karsten, J.
AU - Wang, L.
AU - Margalef-Bentabol, B.
AU - Best, P. N.
AU - Kondapally, R.
AU - La Marca, A.
AU - Morganti, R.
AU - Röttgering, H. J.A.
AU - Vaccari, M.
AU - Sabater, J.
N1 - Publisher Copyright:
© 2023 Authors. All rights reserved.
PY - 2023/7
Y1 - 2023/7
N2 - Context. Extragalactic radio continuum surveys play an increasingly more important role in galaxy evolution and cosmology studies. While radio galaxies and radio quasars dominate at the bright end, star-forming galaxies (SFGs) and radio-quiet active galactic nuclei (AGNs) are more common at fainter flux densities. Aims. Our aim is to develop a machine-learning classifier that can efficiently and reliably separate AGNs and SFGs in radio continuum surveys. Methods. We performed a supervised classification of SFGs versus AGNs using the light gradient boosting machine (LGBM) on three LOFAR Deep Fields (Lockman Hole, Boötes, and ELAIS-N1), which benefit from a wide range of high-quality multi-wavelength data and classification labels derived from extensive spectral energy distribution (SED) analyses. Results. Our trained model has a precision of 0.92±0.01 and a recall of 0.87±0.02 for SFGs. For AGNs, the model performs slightly worse, with a precision of 0.87±0.02 and a recall of 0.78±0.02. These results demonstrate that our trained model can successfully reproduce the classification labels derived from a detailed SED analysis. The model performance decreases towards higher redshifts, which is mainly due to smaller training sample sizes. To make the classifier more adaptable to other radio galaxy surveys, we also investigate how our classifier performs with a poorer multi-wavelength sampling of the SED. In particular, we find that the far-infrared and radio bands are of great importance. We also find that a higher signal-to-noise ratio in some photometric bands leads to a significant boost in the model performance. In addition to using the 150 MHz radio data, our model can also be used with 1.4 GHz radio data. Converting 1.4 GHz to 150 MHz radio data reduces the performance by ∼4% in precision and ∼3% in recall.
AB - Context. Extragalactic radio continuum surveys play an increasingly more important role in galaxy evolution and cosmology studies. While radio galaxies and radio quasars dominate at the bright end, star-forming galaxies (SFGs) and radio-quiet active galactic nuclei (AGNs) are more common at fainter flux densities. Aims. Our aim is to develop a machine-learning classifier that can efficiently and reliably separate AGNs and SFGs in radio continuum surveys. Methods. We performed a supervised classification of SFGs versus AGNs using the light gradient boosting machine (LGBM) on three LOFAR Deep Fields (Lockman Hole, Boötes, and ELAIS-N1), which benefit from a wide range of high-quality multi-wavelength data and classification labels derived from extensive spectral energy distribution (SED) analyses. Results. Our trained model has a precision of 0.92±0.01 and a recall of 0.87±0.02 for SFGs. For AGNs, the model performs slightly worse, with a precision of 0.87±0.02 and a recall of 0.78±0.02. These results demonstrate that our trained model can successfully reproduce the classification labels derived from a detailed SED analysis. The model performance decreases towards higher redshifts, which is mainly due to smaller training sample sizes. To make the classifier more adaptable to other radio galaxy surveys, we also investigate how our classifier performs with a poorer multi-wavelength sampling of the SED. In particular, we find that the far-infrared and radio bands are of great importance. We also find that a higher signal-to-noise ratio in some photometric bands leads to a significant boost in the model performance. In addition to using the 150 MHz radio data, our model can also be used with 1.4 GHz radio data. Converting 1.4 GHz to 150 MHz radio data reduces the performance by ∼4% in precision and ∼3% in recall.
KW - Catalogs
KW - Galaxies: active
KW - Methods: data analysis
UR - http://www.scopus.com/inward/record.url?scp=85166271737&partnerID=8YFLogxK
U2 - 10.1051/0004-6361/202346770
DO - 10.1051/0004-6361/202346770
M3 - Article
AN - SCOPUS:85166271737
SN - 0004-6361
VL - 675
JO - Astronomy and Astrophysics
JF - Astronomy and Astrophysics
M1 - A159
ER -