4.7 Article

SKA Science Data Challenge 2: analysis and results

Journal

MONTHLY NOTICES OF THE ROYAL ASTRONOMICAL SOCIETY
Volume 523, Issue 2, Pages 1967-1993

Publisher

OXFORD UNIV PRESS
DOI: 10.1093/mnras/stad1375

Keywords

methods: data analysis; techniques: imaging spectroscopy; surveys; software: simulations; galaxies: statistics; radio lines: galaxies

Ask authors/readers for more resources

The Square Kilometre Array Observatory (SKAO) will explore the radio sky to conduct transformational science, with large and complex data products that require advanced analysis techniques. SKAO is conducting Science Data Challenges to familiarize the scientific community with the data and drive the development of new analysis techniques. Challenge 2 saw over 100 participants develop new techniques, with the winning strategy combining predictions from two machine learning techniques to achieve a 20% improvement in overall performance.
The Square Kilometre Array Observatory (SKAO) will explore the radio sky to new depths in order to conduct transformational science. SKAO data products made available to astronomers will be correspondingly large and complex, requiring the application of advanced analysis techniques to extract key science findings. To this end, SKAO is conducting a series of Science Data Challenges, each designed to familiarize the scientific community with SKAO data and to drive the development of new analysis techniques. We present the results from Science Data Challenge 2 (SDC2), which invited participants to find and characterize 233 245 neutral hydrogen (H i) sources in a simulated data product representing a 2000 h SKA-Mid spectral line observation from redshifts 0.25-0.5. Through the generous support of eight international supercomputing facilities, participants were able to undertake the Challenge using dedicated computational resources. Alongside the main challenge, 'reproducibility awards' were made in recognition of those pipelines which demonstrated Open Science best practice. The Challenge saw over 100 participants develop a range of new and existing techniques, with results that highlight the strengths of multidisciplinary and collaborative effort. The winning strategy - which combined predictions from two independent machine learning techniques to yield a 20 per cent improvement in overall performance - underscores one of the main Challenge outcomes: that of method complementarity. It is likely that the combination of methods in a so-called ensemble approach will be key to exploiting very large astronomical data sets.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available