AG-2026.04-1824·astro-ph.GA
TwinSpecNet: Extending APOGEE's chemical reach to low-S/N spectra via empirical paired learning
Authors
- Weijia Sun
- Cristina Chiappini
- Samir Nepal
Abstract
Large spectroscopic surveys rely on automated pipelines to deliver homogeneous stellar labels, but a substantial fraction of observations are at low signal-to-noise ratio (S/N), where label estimates become imprecise or are omitted. In APOGEE, these low-S/N spectra visits sample faint and distant populations -- the bulge, outer halo, and satellite systems -- yet still encode recoverable chemical information. We present TwinSpecNet (TSN), a paired-learning framework that exploits APOGEE's multi-visit observing strategy: by training on empirical low-/high-S/N spectral twins of the same stars, TSN learns to suppress stochastic noise while preserving the ASPCAP label scale. TSN employs a Vision Transformer encoder with dual objectives: reconstructing high-S/N flux from low-S/N visits and predicting stellar parameters and abundances with calibrated uncertainties. TSN reduces label scatter relative to visit-level ASPCAP for S/N<60 visits. TSN reproduces the ASPCAP scale with residual scatters of $σ$< 19 K in $T_{\mathrm{eff}}$, $σ\sim$0.06 dex in $\log g$, and $σ\sim$0.03 dex in Fe/H. TSN tightens intra-cluster abundance dispersions, recovers cleaner chemical sequences in inner-disk and bulge and satellite samples, and improves C/N-based age precision for APOKASC giants from 1.70 to 1.59 Gyr. By learning survey-specific noise patterns from repeated observations, TSN demonstrates how empirical paired learning can extend the chemical reach of existing spectroscopic data, providing a template applicable to other multi-visit surveys.
Submitted
29 April 20262 weeks ago
Version
v1
License
CC-BY-4.0
DOI
10.48550/arXiv.2604.26491
Summary
A machine-learning method called TwinSpecNet improves chemical measurements from blurry stellar spectra by learning from the same stars observed multiple times, extending APOGEE's ability to study distant and faint stellar populations.
- TwinSpecNet trains on repeated observations of the same stars to learn how to clean up noisy spectra while preserving accurate chemical signatures, reducing measurement scatter for low-quality data.
- The method recovers stellar chemical abundances with precisions matching the survey's gold standard (better than 0.03 dex for iron), making previously unusable faint-star data scientifically useful.
- This approach works because it's tailored to a specific survey's noise patterns; the technique could be adapted to other multi-visit spectroscopic surveys to unlock chemistry in their faintest observations.
curious · generated by claude-haiku-4-5
Chat with this PDF
Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.
Community
Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.