4.6 Article

Machine-learning identification of galaxies in the WISE x SuperCOSMOS all-sky catalogue

期刊

ASTRONOMY & ASTROPHYSICS
卷 596, 期 -, 页码 -

出版社

EDP SCIENCES S A
DOI: 10.1051/0004-6361/201629165

关键词

methods: data analysis; methods: numerical; astronomical databases: miscellaneous; galaxies: statistics; large-scale structure of Universe

资金

  1. National Aeronautics and Space Administration
  2. UK Science and Technology Facilities Council
  3. Alfred P. Sloan Foundation
  4. National Science Foundation
  5. University of Arizona
  6. Brazilian Participation Group
  7. Brookhaven National Laboratory
  8. Carnegie Mellon University
  9. University of Florida
  10. French Participation Group
  11. German Participation Group
  12. Harvard University
  13. Instituto de Astrofisica de Canarias
  14. Michigan State/Notre Dame/JINA Participation Group
  15. Johns Hopkins University
  16. Lawrence Berkeley National Laboratory
  17. Max Planck Institute for Astrophysics
  18. Max Planck Institute for Extraterrestrial Physics
  19. New Mexico State University
  20. New York University
  21. Ohio State University
  22. Pennsylvania State University
  23. University of Portsmouth
  24. Princeton University
  25. Spanish Participation Group
  26. University of Tokyo
  27. University of Utah
  28. Vanderbilt University
  29. University of Virginia
  30. University of Washington
  31. Yale University
  32. Polish National Science Center [UMO-2012/07/D/ST9/02785]
  33. National Science Centre [UMO-2012/07/B/ST9/04425, UMO-2013/09/D/ST9/04030]
  34. Polish-Swiss Astro Project
  35. European Associated Laboratory Astrophysics Poland-France HECOLS
  36. Netherlands Organization for Scientific Research, NWO [614.001.451]
  37. European Research Council [279396]
  38. Swiss Contribution to the enlarged European Union
  39. US Department of Energy Office of Science

向作者/读者索取更多资源

Context. The two currently largest all-sky photometric datasets, WISE and SuperCOSMOS, have been recently cross-matched to construct a novel photometric redshift catalogue on 70% of the sky. Galaxies were separated from stars and quasars through colour cuts, which may leave imperfections because different source types may overlap in colour space. Aims. The aim of the present work is to identify galaxies in the WISE x SuperCOSMOS catalogue through an alternative approach of machine learning. This allows us to define more complex separations in the multi-colour space than is possible with simple colour cuts, and should provide a more reliable source classification. Methods. For the automatised classification we used the support vector machines (SVM) learning algorithm and employed SDSS spectroscopic sources that we cross-matched with WISE x SuperCOSMOS to construct the training and verification set. We performed a number of tests to examine the behaviour of the classifier (completeness, purity, and accuracy) as a function of source apparent magnitude and Galactic latitude. We then applied the classifier to the full-sky data and analysed the resulting catalogue of candidate galaxies. We also compared the resulting dataset with the one obtained through colour cuts. Results. The tests indicate very high accuracy, completeness, and purity (>95%) of the classifier at the bright end; this deteriorates for the faintest sources, but still retains acceptable levels of similar to 85%. No significant variation in the classification quality with Galactic latitude is observed. When we applied the classifier to all-sky WISE x SuperCOSMOS data, we found 15 million galaxies after masking problematic areas. The resulting sample is purer than the one produced by applying colour cuts, at the price of a lower completeness across the sky. Conclusions. The automatic classification is a successful alternative approach to colour cuts for defining a reliable galaxy sample. The identifications we obtained are included in the public release of the WISE x SuperCOSMOS galaxy catalogue.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.6
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

暂无数据
暂无数据