AG-2024.02-1635·quant-ph·cross-listed: cs.LG
Arbitrary Polynomial Separations in Trainable Quantum Machine Learning
Authors
- Eric R. Anschuetz
- Xun Gao
Abstract
Recent theoretical results in quantum machine learning have demonstrated a general trade-off between the expressive power of quantum neural networks (QNNs) and their trainability; as a corollary of these results, practical exponential separations in expressive power over classical machine learning models are believed to be infeasible as such QNNs take a time to train that is exponential in the model size. We here circumvent these negative results by constructing a hierarchy of efficiently trainable QNNs that exhibit unconditionally provable, polynomial memory separations of arbitrary constant degree over classical neural networks -- including state-of-the-art models, such as Transformers -- in performing a classical sequence modeling task. This construction is also computationally efficient, as each unit cell of the introduced class of QNNs only has constant gate complexity. We show that contextuality -- informally, a quantitative notion of semantic ambiguity -- is the source of the expressivity separation, suggesting that other learning tasks with this property may be a natural setting for the use of quantum learning algorithms.
Submitted
13 February 20242 years ago
Version
v1
License
CC-BY-4.0
DOI
10.48550/arXiv.2402.08606
Summary
Researchers designed quantum neural networks that can be trained efficiently while provably outperforming classical models by polynomial factors at sequence modeling tasks, avoiding the usual trainability-versus-power tradeoff.
- The key innovation is a hierarchy of quantum networks that stay trainable (polynomial time) while gaining polynomial advantages in expressive power over classical networks including Transformers—avoiding the exponential slowdowns that plague other quantum models.
- Contextuality, a quantum property related to how information's meaning depends on measurement context, is identified as the underlying source of quantum advantage, suggesting where quantum learning might genuinely help.
- The quantum circuits use only constant-sized gates per unit, making the approach practically implementable without exotic hardware requirements.
curious · generated by claude-haiku-4-5
Chat with this PDF
Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.
Community
Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.