AG-2026.04-2094·physics.optics
YOSO: single-frame Gerchberg-Saxton phase retrieval with AI-based data augmentation for in-line holography
Authors
- Julianna Winnik
- Adam Walocha
- Wojciech Ogonowski
- Wiktor Forjasz
- Piotr Arcab
- Mikołaj Rogalski
- Aleksandra Rutkowska
- Marzena Stefaniuk
- José Ángel Picazo-Bueno
- Vicente Micó
- Maciej Trusiak
- Maria Cywińska
Abstract
We present YOSO (You Only Shot Once), a single-frame phase retrieval framework for digital in-line holographic microscopy (DIHM) in which supervised deep learning is used to numerically generate an additional hologram corresponding to different defocus distance, creating a so-called multi-height dataset, which is then conventionally processed with a well-established Gerchberg-Saxton (GS) algorithm. YOSO is trained on computer-generated data derived from natural images, enabling strong generalization. The selected multi-scale ResNet architecture enables rapid training in under two hours on a mid-range workstation, which is done only once, enabling efficient inference thereafter. We further show that YOSO network can process inputs of varying spatial dimensions, allowing training on small inputs and direct inference on full-sized holograms while bypassing patch-and-stitch procedure. A further advantage of YOSO is its physics-consistent hologram padding, which replaces conventional zero or edge-value padding with a physically grounded approach compatible with the GS framework. The YOSO framework is tested on various systems (lens-based and lensless DIHM) and diverse samples: a resolution test target, adherent and suspended biological cells, and a mouse brain slice. The results show that YOSO is compatible with 3D objects and correctly recovers defocused object wave features, enabling holographic postprocessing such as numerical refocusing. The results of this work are available publicly as software for end-to-end implementation.
Submitted
30 April 20261 month ago
Version
v1
License
CC-BY-4.0
DOI
10.48550/arXiv.2604.27777
Summary
YOSO uses AI to simulate extra holograms from a single photo, then applies classical image-recovery math to reconstruct 3D objects from in-line holograms—making the technique faster and more practical.
- A deep learning model generates synthetic holograms at different focal depths from one real image, effectively multiplying your data without taking more pictures.
- The approach combines neural networks with the Gerchberg-Saxton algorithm (a well-understood iterative method), so it inherits both AI speed and classical physics reliability.
- It works on real biological samples and doesn't require breaking images into patches, making it practical for high-resolution microscopy without extra computational tricks.
curious · generated by claude-haiku-4-5
Chat with this PDF
Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.
Community
Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.