4.7 Article

Absence of Barren Plateaus in Quantum Convolutional Neural Networks

期刊

PHYSICAL REVIEW X
卷 11, 期 4, 页码 -

出版社

AMER PHYSICAL SOC
DOI: 10.1103/PhysRevX.11.041011

关键词

-

资金

  1. U.S. Department of Energy (DOE) through a quantum computing program - Los Alamos National Laboratory (LANL) Information Science and Technology Institute
  2. Samsung GRP grant
  3. Laboratory Directed Research and Development program of LANL [20200677PRD1, 20190065DR]
  4. U.S. DOE, Office of Science, Office of Advanced Scientific Computing Research under the Accelerated Research in Quantum Computing program

向作者/读者索取更多资源

Analyzing the gradient scaling in QCNN architecture shows that this type of network does not exhibit barren plateaus, indicating that QCNNs are trainable even with random initialization. This result provides an analytical guarantee for the trainability of quantum neural networks.
Quantum neural networks (QNNs) have generated excitement around the possibility of efficiently analyzing quantum data. But this excitement has been tempered by the existence of exponentially vanishing gradients, known as barren plateau landscapes, for many QNN architectures. Recently, quantum convolutional neural networks (QCNNs) have been proposed, involving a sequence of convolutional and pooling layers that reduce the number of qubits while preserving information about relevant data features. In this work, we rigorously analyze the gradient scaling for the parameters in the QCNN architecture. We find that the variance of the gradient vanishes no faster than polynomially, implying that QCNNs do not exhibit barren plateaus. This result provides an analytical guarantee for the trainability of randomly initialized QCNNs, which highlights QCNNs as being trainable under random initialization unlike many other QNN architectures. To derive our results, we introduce a novel graph-based method to analyze expectation values over Haar-distributed unitaries, which will likely be useful in other contexts. Finally, we perform numerical simulations to verify our analytical results.

作者

我是这篇论文的作者
点击您的名字以认领此论文并将其添加到您的个人资料中。

评论

主要评分

4.7
评分不足

次要评分

新颖性
-
重要性
-
科学严谨性
-
评价这篇论文

推荐

Article Multidisciplinary Sciences

Nickel isotopic evidence for late-stage accretion of Mercury-like differentiated planetary embryos

Shui-Jiong Wang, Wenzhong Wang, Jian-Ming Zhu, Zhongqing Wu, Jingao Liu, Guilin Han, Fang-Zhen Teng, Shichun Huang, Hongjie Wu, Yujian Wang, Guangliang Wu, Weihan Li

Summary: During Earth's late-stage accretion, impactors delivered most of the volatiles, with nickel serving as an important tracer. Research has found that the BSE has a lighter nickel isotopic composition compared to chondrites, suggesting that this sub-chondritic signature was established during the Moon-forming giant impact.

NATURE COMMUNICATIONS (2021)

Article Multidisciplinary Sciences

Cost function dependent barren plateaus in shallow parametrized quantum circuits

M. Cerezo, Akira Sone, Tyler Volkoff, Lukasz Cincio, Patrick J. Coles

Summary: In this study, the authors rigorously prove that defining cost functions with local observables can avoid the barren plateau problem, while defining them with global observables leads to exponentially vanishing gradients. The results indicate a connection between locality and trainability in variational quantum algorithms (VQAs).

NATURE COMMUNICATIONS (2021)

Article Physics, Multidisciplinary

Reformulation of the No-Free-Lunch Theorem for Entangled Datasets

Kunal Sharma, M. Cerezo, Zoe Holmes, Lukasz Cincio, Andrew Sornborger, Patrick J. Coles

Summary: The NFL theorem limits one's ability to learn a function with a training dataset. Researchers show that entangled datasets in the quantum environment can lead to a violation of the NFL theorem, and that entanglement can reduce the fundamental limit on the learnability of a unitary.

PHYSICAL REVIEW LETTERS (2022)

Article Physics, Multidisciplinary

Trainability of Dissipative Perceptron-Based Quantum Neural Networks

Kunal Sharma, M. Cerezo, Lukasz Cincio, Patrick J. Coles

Summary: This article analyzes the gradient scaling performance of a recently proposed architecture called dissipative quantum neural networks (DQNNs), and finds that DQNNs can exhibit gradient vanishing. Moreover, we quantitatively bound the scaling of the gradient for DQNNs under different conditions and demonstrate that trainability is not always guaranteed.

PHYSICAL REVIEW LETTERS (2022)

Article Quantum Science & Technology

Equivalence of quantum barren plateaus to cost concentration and narrow gorges

Andrew Arrasmith, Zoe Holmes, M. Cerezo, Patrick J. Coles

Summary: This research investigates the relationship between cost function landscapes of parameterized quantum circuits (PQCs). It is analytically proven that the phenomena of exponentially vanishing gradients, exponential cost concentration about the mean, and the exponential narrowness of minima occur together. The key implication of this result is that BPs can be diagnosed numerically through cost differences instead of computationally expensive gradients.

QUANTUM SCIENCE AND TECHNOLOGY (2022)

Article Physics, Multidisciplinary

Inference-Based Quantum Sensing

C. Huerta Alderete, Max Hunter Gordon, Frederic Sauvage, Akira Sone, Andrew T. Sornborger, Patrick J. Coles, M. Cerezo

Summary: This article presents an inference-based scheme for quantum sensing, which allows accurate estimation of unknown parameters and determination of the scheme's sensitivity through measurements of the system response. The scheme is applicable to arbitrary probe states and measurement schemes, and remains effective in the presence of quantum noise.

PHYSICAL REVIEW LETTERS (2022)

Article Quantum Science & Technology

Variational quantum state eigensolver

M. Cerezo, Kunal Sharma, Andrew Arrasmith, Patrick J. Coles

Summary: In this study, we introduce the variational quantum state eigensolver (VQSE) method for dealing with exponentially large matrices of density matrix. By exploiting the connection between diagonalization and majorization, VQSE can accurately calculate the largest eigenvalues of the density matrix rho and the corresponding eigenvectors gate sequence V, with lower computational complexity.

NPJ QUANTUM INFORMATION (2022)

Article Computer Science, Interdisciplinary Applications

Theory of overparametrization in quantum neural networks

Martin Larocca, Nathan Ju, Diego Garcia-Martin, Patrick J. Coles, Marco Cerezo

Summary: A theoretical framework for quantum neural network (QNN) overparametrization and its impact on QNN design is established. The prospect of achieving quantum advantage with QNNs is exciting. Understanding how QNN properties, such as the number of parameters, affect the loss landscape is crucial for designing scalable QNN architectures.

NATURE COMPUTATIONAL SCIENCE (2023)

Article Quantum Science & Technology

Diagnosing barren plateaus with tools from quantum optimal control

Martin Larocca, Piotr Czarnik, Kunal Sharma, Gopikrishnan Muraleedharan, Patrick J. Coles, M. Cerezo

Summary: Variational Quantum Algorithms (VQAs) have received attention for their potential quantum advantage, but more research is needed on their scalability. This study proposes a framework using quantum optimal control to diagnose the presence of barren plateaus in problem-inspired ansatzes and proves that avoiding barren plateaus is not guaranteed for these ansatzes. The results provide a framework for trainability-aware ansatz design strategies without extra quantum resources and establish a link between barren plateaus and the scaling of the dimension of g.

QUANTUM (2022)

Article Computer Science, Interdisciplinary Applications

Challenges and opportunities in quantum machine learning

M. Cerezo, Guillaume Verdon, Hsin-Yuan Huang, Lukasz Cincio, Patrick J. Coles

Summary: Quantum machine learning, positioned at the intersection of machine learning and quantum computing, shows great potential in accelerating data analysis for quantum data and has wide applications. Challenges still remain regarding the trainability of quantum machine learning models, but through continuous research and exploration, the development potential in this field is evident.

NATURE COMPUTATIONAL SCIENCE (2022)

Article Quantum Science & Technology

Non-trivial symmetries in quantum landscapes and their resilience to quantum noise

Enrico Fontana, M. Cerezo, Andrew Arrasmith, Ivan Rungger, Patrick J. Coles

Summary: This paper analyzes the cost landscape for Parametrized Quantum Circuits (PQCs) and proves the exponential symmetry and resilience of these symmetries under noise. Based on these findings, the paper introduces an optimization method called Symmetry-based Minima Hopping (SYMH) which improves the optimizer performance in the presence of non-unital noise. Numerical simulations show that SYMH achieves performance comparable to current hardware.

QUANTUM (2022)

Article Quantum Science & Technology

Group-Invariant Quantum Machine Learning

Martin Larocca, Frederic Sauvage, Faris M. Sbahi, Guillaume Verdon, Patrick J. Coles, M. Cerezo

Summary: This study proposes a framework for designing quantum machine learning models based on underlying invariances, which can create models that adhere to symmetries. The effectiveness of the framework is demonstrated through theoretical results and examples, as well as the discovery of new algorithms.

PRX QUANTUM (2022)

Article Quantum Science & Technology

Covariance Matrix Preparation for Quantum Principal Component Analysis

Max Hunter Gordon, M. Cerezo, Lukasz Cincio, Patrick J. Coles

Summary: Principal component analysis (PCA) is a dimensionality reduction method that involves diagonalizing the covariance matrix of a dataset. Recently, quantum algorithms for PCA based on diagonalizing a density matrix have been proposed. However, a concrete protocol for encoding the covariance matrix as a density matrix has been lacking. In this study, we address this gap by providing a simple means for preparing the covariance matrix for arbitrary quantum datasets or centered classical datasets. We also propose a method for uncentered classical datasets, which we interpret as PCA on a symmetrized dataset. We demonstrate the effectiveness of our method through numerical experiments on the MNIST handwritten digit dataset and molecular ground-state datasets.

PRX QUANTUM (2022)

Article Physics, Multidisciplinary

Variational quantum algorithm for estimating the quantum Fisher information

Jacob L. Beckey, M. Cerezo, Akira Sone, Patrick J. Coles

Summary: This paper presents a variational quantum algorithm, VQFIE, for estimating the quantum Fisher information (QFI) of a mixed state. By estimating the lower and upper bounds on the QFI, VQFIE outputs a range in which the actual QFI lies, and can be used to prepare the state that maximizes the QFI for quantum sensing applications. Unlike previous approaches, VQFIE does not require knowledge of the explicit form of the sensor dynamics.

PHYSICAL REVIEW RESEARCH (2022)

Article Quantum Science & Technology

Connecting Ansatz Expressibility to Gradient Magnitudes and Barren Plateaus

Zoe Holmes, Kunal Sharma, M. Cerezo, Patrick J. Coles

Summary: Parametrized quantum circuits are a flexible paradigm for solving variational problems and programming near-term quantum computers. By extending the barren plateau phenomenon to arbitrary ansatze, we establish a fundamental relationship between expressibility and trainability, showing that highly expressive ansatze are more difficult to train.

PRX QUANTUM (2022)

暂无数据