Exponential quantum advantage in processing massive classical data

Curator's Take

This research represents a potential breakthrough in quantum machine learning by demonstrating that quantum computers with as few as 60 logical qubits could outperform classical machines requiring exponentially more resources for processing massive datasets. The key innovation lies in "quantum oracle sketching," which allows quantum computers to access classical data in superposition without the typical bottleneck of loading entire datasets into quantum memory. What makes this particularly compelling is that the authors validated their theoretical advantages on real-world applications like genomics data and sentiment analysis, showing 4-6 orders of magnitude improvements in computational efficiency. If these results hold up to scrutiny, they could finally provide the elusive "quantum advantage" for classical machine learning tasks that has remained one of the field's most sought-after goals.

— Mark Eatherly

Summary

Broadly applicable quantum advantage, particularly in classical data processing and machine learning, has been a fundamental open problem. In this work, we prove that a small quantum computer of polylogarithmic size can perform large-scale classification and dimension reduction on massive classical data by processing samples on the fly, whereas any classical machine achieving the same prediction performance requires exponentially larger size. Furthermore, classical machines that are exponentially larger yet below the required size need superpolynomially more samples and time. We validate these quantum advantages in real-world applications, including single-cell RNA sequencing and movie review sentiment analysis, demonstrating four to six orders of magnitude reduction in size with fewer than 60 logical qubits. These quantum advantages are enabled by quantum oracle sketching, an algorithm for accessing the classical world in quantum superposition using only random classical data samples. Combined with classical shadows, our algorithm circumvents the data loading and readout bottleneck to construct succinct classical models from massive classical data, a task provably impossible for any classical machine that is not exponentially larger than the quantum machine. These quantum advantages persist even when classical machines are granted unlimited time or if BPP=BQP, and rely only on the correctness of quantum mechanics. Together, our results establish machine learning on classical data as a broad and natural domain of quantum advantage and a fundamental test of quantum mechanics at the complexity frontier.

Read Full Article at arXiv Quantum Physics →