March 21, 2026
Journal Article

Dense Autoencoders, Clustering Techniques, and Semi-Supervised Learning for HPGe Gamma-Spectra

Abstract

Classifying high-resolution gamma spectra by their isotopic content is an essential task in nuclear forensics and other applications. Traditional analysis methods are often time-intensive, but machine learning (ML) may help analysts quickly process many spectra. Such methods tend to rely on abundant, well-labeled data for training. Historical gamma data exists in various fields but is not uniformly useful for supervised ML due to inconsistent labeling. To address some of these challenges, we present a method to classify and organize unlabeled data from high-purity germanium detectors using an autoencoding neural network (autoencoder). We trained dense autoencoders to compress gamma data into latent representations that enable efficient data characterization. By clustering the encoded spectra or lower-dimensional mappings of them, we identified and removed portions of over-abundant data categories, resulting in a more balanced dataset and improved autoencoder performance. This encoding and clustering pipeline also enabled the organization of spectra into self-consistent categories. Finally, we found that encoded representations showed potential as inputs for semi-supervised learning of nuclide identification (NID) labels, achieving an average F1 score of 0.85 +/- 0.03 when mapping encodings to a set of 65 isotope labels.

Published: March 21, 2026

Citation

Lonsway M.J., K.L. Truax, S.B. Emmons, B.D. Pierson, and B.C. Archambault. 2026. Dense Autoencoders, Clustering Techniques, and Semi-Supervised Learning for HPGe Gamma-Spectra. Nuclear Instruments and Methods in Physics Research. Section A: Accelerators, Spectrometers, Detectors and Associated Equipment 1088:171484. PNNL-SA-217987. doi:10.1016/j.nima.2026.171484

Research topics