AG-2024.06-2401·quant-ph
qLUE: A Quantum Clustering Algorithm for Multi- Dimensional Datasets
Authors
- Dhruv Gopalakrishnan
- Luca Dellantonio
- Antonio Di Pilato
- Wahid Redjeb
- Felice Pantaleo
- Michele Mosca
Abstract
Clustering algorithms are at the basis of several technological applications, and are fueling the development of rapidly evolving fields such as machine learning. In the recent past, however, it has become apparent that they face challenges stemming from datasets that span more spatial dimensions. In fact, the best-performing clustering algorithms scale linearly in the number of points, but quadratically with respect to the local density of points. In this work, we introduce qLUE, a quantum clustering algorithm that scales linearly in both the number of points and their density. qLUE is inspired by CLUE, an algorithm developed to address the challenging time and memory budgets of Event Reconstruction (ER) in future High-Energy Physics experiments. As such, qLUE marries decades of development with the quadratic speedup provided by quantum computers. We numerically test qLUE in several scenarios, demonstrating its effectiveness and proving it to be a promising route to handle complex data analysis tasks -- especially in high-dimensional datasets with high densities of points.
Submitted
29 June 20241 year ago
Version
v1
License
CC-BY-4.0
DOI
10.48550/arXiv.2407.00357
Summary
A new quantum clustering algorithm called qLUE scales linearly with both the number of data points and their density, potentially offering significant speedups over classical methods that struggle with high-dimensional, densely-packed datasets.
- Classical clustering algorithms face a scaling bottleneck: they run in linear time relative to points but quadratic time relative to local density, making them slow for crowded high-dimensional data.
- qLUE adapts a physics-inspired classical algorithm (CLUE) using quantum computing's natural quadratic speedup, achieving linear scaling in both dimensions—a theoretical advantage for analyzing complex datasets.
- The algorithm was designed for particle-detector data analysis in future physics experiments but could accelerate machine learning and data science tasks wherever high-density, high-dimensional clustering is needed.
curious · generated by claude-haiku-4-5
Chat with this PDF
Ask questions, probe assumptions, request a plain-English summary. Answers cite sections from the preprint itself.
Community
Questions and answers about this paper from other readers. No formal peer review — just a place to think out loud.