AG-2024.02-1665·cs.LG·cross-listed: cs.AIquant-ph
ResQuNNs: Towards Enabling Deep Learning in Quantum Convolution Neural Networks
Authors
- Muhammad Kashif
- Muhammad Shafique
Abstract
In this paper, we present a novel framework for enhancing the performance of Quanvolutional Neural Networks (QuNNs) by introducing trainable quanvolutional layers and addressing the critical challenges associated with them. Traditional quanvolutional layers, although beneficial for feature extraction, have largely been static, offering limited adaptability. Unlike state-of-the-art, our research overcomes this limitation by enabling training within these layers, significantly increasing the flexibility and potential of QuNNs. However, the introduction of multiple trainable quanvolutional layers induces complexities in gradient-based optimization, primarily due to the difficulty in accessing gradients across these layers. To resolve this, we propose a novel architecture, Residual Quanvolutional Neural Networks (ResQuNNs), leveraging the concept of residual learning, which facilitates the flow of gradients by adding skip connections between layers. By inserting residual blocks between quanvolutional layers, we ensure enhanced gradient access throughout the network, leading to improved training performance. Moreover, we provide empirical evidence on the strategic placement of these residual blocks within QuNNs. Through extensive experimentation, we identify an efficient configuration of residual blocks, which enables gradients across all the layers in the network that eventually results in efficient training. Our findings suggest that the precise location of residual blocks plays a crucial role in maximizing the performance gains in QuNNs. Our results mark a substantial step forward in the evolution of quantum deep learning, offering new avenues for both theoretical development and practical quantum computing applications.
Submitted
14 February 20242 years ago
Version
v1
License
CC-BY-4.0
DOI
10.48550/arXiv.2402.09146
Summary
Researchers introduced trainable quantum convolutional layers and used residual connections (skip links between layers) to fix gradient flow problems, enabling deeper quantum neural networks to train effectively.
- Traditional quantum convolutional layers were frozen; this work makes them learnable, giving quantum networks more flexibility—like upgrading from fixed filters to adaptive ones in classical deep learning.
- Stacking trainable quantum layers breaks gradient flow during training; residual blocks act as gradient highways, solving a key engineering challenge in quantum machine learning.
- Empirical testing shows where to place residual blocks matters significantly, suggesting quantum-classical hybrid architectures need different design rules than purely classical networks.
curious · generated by claude-haiku-4-5
Chat with this PDF
Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.
Community
Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.