4.7 Article

Transient-optimized real-bogus classification with Bayesian convolutional neural networks - sifting the GOTO candidate stream

Journal

MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY
Volume 503, Issue 4, Pages 4838-4854

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/mnras/stab633

Keywords

methods: data analysis; techniques: photometric; surveys

Funding

  1. Monash-Warwick Alliance
  2. Warwick University
  3. Monash University
  4. Sheffield University
  5. University of Leicester
  6. Armagh Observatory Planetarium
  7. National Astronomical Research Institute of Thailand (NARIT)
  8. University of Turku
  9. University of Manchester
  10. University of Portsmouth
  11. Instituto de Astrofisica de Canarias (IAC)
  12. Science and Technology Facilities Council (STFC)
  13. STFC [ST/T007184/1, ST/T003103/1, ST/P000495/1]
  14. UK Research and Innovation Fellowship [MR/T020784/1]
  15. ERC under the European Union's Horizon 2020 research and innovation programme [715051]
  16. STFC
  17. NASA [NAG5-4741]
  18. NASA PDART [NNX16AG52G]
  19. STFC [2296723] Funding Source: UKRI
  20. NASA [904558, NNX16AG52G] Funding Source: Federal RePORTER

Ask authors/readers for more resources

This paper introduces a new real-bogus classifier based on a Bayesian convolutional neural network for nuanced classification and uncertainty-aware evaluation of transient candidates. The approach achieves competitive classification accuracy with classifiers trained with fully human-labelled data sets, through fully automated training set generation and data-driven augmentation methods.
Large-scale sky surveys have played a transformative role in our understanding of astrophysical transients, only made possible by increasingly powerful machine learning-based filtering to accurately sift through the vast quantities of incoming data generated. In this paper, we present a new real-bogus classifier based on a Bayesian convolutional neural network that provides nuanced, uncertainty-aware classification of transient candidates in difference imaging, and demonstrate its application to the datastream from the GOTO wide-field optical survey. Not only are candidates assigned a well-calibrated probability of being real, but also an associated confidence that can be used to prioritize human vetting efforts and inform future model optimization via active learning. To fully realize the potential of this architecture, we present a fully automated training set generation method which requires no human labelling, incorporating a novel data-driven augmentation method to significantly improve the recovery of faint and nuclear transient sources. We achieve competitive classification accuracy (FPR and FNR both below 1 percent) compared against classifiers trained with fully human-labelled data sets, while being significantly quicker and less labour-intensive to build. This data-driven approach is uniquely scalable to the upcoming challenges and data needs of next-generation transient surveys. We make our data generation and model training codes available to the community.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available