Qubit-Scalable CVRP via Lagrangian Knapsack Decomposition and Noise-Aware Quantum Execution

Curator's Take

This work tackles one of quantum computing's most pressing practical challenges: how to solve real-world optimization problems when current quantum devices have severe limitations on qubit count and gate fidelity. The researchers developed a clever decomposition method that breaks down the complex Vehicle Routing Problem into smaller quantum-friendly subproblems, then uses machine learning to adaptively manage both the mathematical optimization process and the hardware execution across different quantum processors. What makes this particularly significant is the end-to-end approach that treats quantum hardware limitations not as obstacles to work around, but as constraints to actively manage through intelligent resource allocation and noise-aware execution strategies. This represents a mature step toward practical quantum advantage in logistics optimization, moving beyond proof-of-concept demonstrations to address the messy realities of deploying quantum algorithms on today's imperfect hardware.

— Mark Eatherly

Summary

Hybrid quantum optimization for vehicle routing faces a practical bottleneck: direct QUBO encodings of CVRP quickly exceed near-term qubit and gate budgets, while quantum evaluations are expensive, noise-limited, and sensitive to backend and circuit configuration. We address this gap with an end-to-end decomposition pipeline that converts CVRP into bounded-width quantum subproblems and treats quantum execution as a decision problem within the optimization loop. Starting from a Fisher--Jaikumar assignment linearization, we apply Lagrangian relaxation to dualize customer-assignment couplers, yielding independent per-vehicle knapsack subproblems that admit QUBO/Ising evaluation. To replace brittle subgradient tuning, we learn a multiplier-update controller using expert-guided pretraining followed by reinforcement-learning fine-tuning, with rewards based on execution-realized progress and route reconstruction. We also introduce a constrained contextual bandit as a hardware-aware execution layer that selects backend and circuit configuration with feasibility screening, enabling adaptation across heterogeneous noisy resources and parallel multi-QPU scheduling. Computational results on multiple CVRPLIB families show that the decomposition yields stable bounded-width subproblems across instance sizes, learned multiplier updates improve end-to-end routing quality relative to classical subgradient control under matched budgets, and hardware-mode configuration reduces median optimality gaps relative to static execution choices in our test set. We do not claim quantum advantage. Instead, the contribution is a practical end-to-end framework for scaling hybrid quantum CVRP optimization through OR decomposition, learning-augmented dual control, and adaptive hardware-aware execution.

Read Full Article at arXiv Quantum Physics →