4.6 Article

Unsupervised outlier detection in heavy-ion collisions

Journal

PHYSICA SCRIPTA
Volume 96, Issue 6, Pages -

Publisher

IOP PUBLISHING LTD
DOI: 10.1088/1402-4896/abf214

Keywords

Heavy-ion collisions; Machine learning; Outlier detection

Funding

  1. Development and Promotion of Science and Technology Talents Project (DPST) - Royal Thai Government Scholarship, Suranaree University of Technology (SUT)
  2. Samson AG
  3. BMBF through the ErUM-Data project
  4. Walter Greiner Gesellschaft zur Forderung der physikalischen Grundlagenforschung e.V.
  5. NVIDIA Corporation
  6. GPU Grant
  7. DAAD through a PPP exchange grant

Ask authors/readers for more resources

This study presents various unsupervised learning methods for outlier detection in high energy nuclear collisions, especially in heavy ion collisions. Dimensional reduction algorithms like PCA and AEN are used to distinguish outliers from background. The research finds that the most effective model for separating outlier events requires good performance in event reconstruction while maintaining a small number of parameters.
We present different methods of unsupervised learning which can be used for outlier detection in high energy nuclear collisions. This method is of particular interest for heavy ion collisions where a direct comparison of experimental data to model simulations is often ambiguous and it is not easy to determine whether an observation is due to new physics, an incomplete understanding of the known physics or an experimental artefact. The UrQMD model is used to generate the bulk background of events as well as different variants of outlier events which may result from misidentified centrality or detector malfunctions. The methods presented here can be generalized to different and novel physics effects. To detect the outliers, dimensional reduction algorithms are implemented, speciftically the Principle Component Analysis (PCA) and Autoencoders (AEN). We find that mainly the reconstruction error is a good measure to distinguish outliers from background. The performance of the algorithms is compared using a ROC curve. It is shown that the number of reduced (encoded) dimensions to describe a single event contributes significantly to the performance of the outlier detection task. We find that the model which is best suited to separate outlier events requires a good performance in reconstructing events and at the same time a small number of parameters.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available