Parameter-efficient Quantum Multi-task Learning

Curator's Take

This research tackles a fundamental scalability challenge in machine learning by leveraging quantum computing's unique representational advantages. While classical multi-task learning systems suffer from quadratic parameter growth as tasks multiply, this quantum approach achieves linear scaling by using variational quantum circuits as shared prediction heads that can encode multiple tasks within high-dimensional quantum state spaces. The work demonstrates how quantum machine learning isn't just about potential speedups, but can offer genuine architectural advantages in parameter efficiency—a critical consideration as AI models grow increasingly large and expensive to train. This represents an important step toward practical quantum advantage in machine learning, showing how quantum resources can be strategically deployed to solve real computational bottlenecks rather than simply replacing classical components wholesale.

— Mark Eatherly

Summary

Multi-task learning (MTL) improves generalization and data efficiency by jointly learning related tasks through shared representations. In the widely used hard-parameter-sharing setting, a shared backbone is combined with task-specific prediction heads. However, task-specific parameters can grow rapidly with the number of tasks. Therefore, designing multi-task heads that preserve task specialization while improving parameter efficiency remains a key challenge. In Quantum Machine Learning (QML), variational quantum circuits (VQCs) provide a compact mechanism for mapping classical data to quantum states residing in high-dimensional Hilbert spaces, enabling expressive representations within constrained parameter budgets. We propose a parameter-efficient quantum multi-task learning (QMTL) framework that replaces conventional task-specific linear heads with a fully quantum prediction head in a hybrid architecture. The model consists of a VQC with a shared, task-independent quantum encoding stage, followed by lightweight task-specific ansatz blocks enabling localized task adaptation while maintaining compact parameterization. Under a controlled and capacity-matched formulation where the shared representation dimension grows with the number of tasks, our parameter-scaling analysis demonstrates that a standard classical head exhibits quadratic growth, whereas the proposed quantum head parameter cost scales linearly. We evaluate QMTL on three multi-task benchmarks spanning natural language processing, medical imaging, and multimodal sarcasm detection, where we achieve performance comparable to, and in some cases exceeding, classical hard-parameter-sharing baselines while consistently outperforming existing hybrid quantum MTL models with substantially fewer head parameters. We further demonstrate QMTL's executability on noisy simulators and real quantum hardware, illustrating its feasibility.

Read Full Article at arXiv Quantum Physics →