4.6 Article

A semi-supervised approach to dark matter searches in direct detection data with machine learning

Journal

Publisher

IOP Publishing Ltd
DOI: 10.1088/1475-7516/2022/02/039

Keywords

Machine learning; dark matter simulations; dark matter experiments; dark matter detectors

Funding

  1. Generalitat Valenciana through the GenT Excellence Program [CIDEGENT/2020/020]
  2. Ministerio de Ciencia e Innovacion MICIN/AEI [PID2020-113334GB-I00, PID2020-113644GB-I00]
  3. research grant The Dark Universe: A Synergic Multimessenger Approach - MIUR [2017X7X85K]
  4. project Theoretical Astroparticle Physics (TAsP) - INFN
  5. European Union through the 2014-2020 FEDER Operative Programme of Comunitat Valenciana [IDIFEDER/2018/048]

Ask authors/readers for more resources

The paper applies modern machine learning techniques to dark matter direct detection, utilizing a convolutional variational autoencoder and a convolutional neural network in a semi-supervised manner for anomaly detection. The results show that the optimum performance is achieved when both unsupervised and supervised anomaly scores are considered together, and datasets with higher anomaly scores are deemed anomalous and require further investigation. These learning-focused anomaly detection methods have the potential to outperform likelihood-based methods.
The dark matter sector remains completely unknown. It is therefore crucial to keep an open mind regarding its nature and possible interactions. Focusing on the case of Weakly Interacting Massive Particles, in this work we make this general philosophy more concrete by applying modern machine learning techniques to dark matter direct detection. We do this by encoding and decoding the graphical representation of background events in the XENONnT experiment with a convolutional variational autoencoder. We describe a methodology that utilizes the 'anomaly score' derived from the reconstruction loss of the convolutional variational autoencoder as well as a pre-trained standard convolutional neural network, in a semi-supervised fashion. Indeed, we observe that optimum results are obtained only when both unsupervised and supervised anomaly scores are considered together. A data set that has a higher proportion of anomaly score is deemed anomalous and deserves further investigation. Contrary to classical analyses, in principle all information about the events is used, preventing unnecessary information loss. Lastly, we demonstrate the reach of learning-focused anomaly detection in this context by comparing results with classical inference, observing that, if tuned properly, these techniques have the potential to outperform likelihood-based methods.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available