Anti-Concentration for the Unitary Haar Measure and Applications to Random Quantum Circuits
Abstract
We prove a Carbery-Wright style anti-concentration inequality for the unitary Haar measure, by showing that the probability of a polynomial in the entries of a random unitary falling into an range is at most a polynomial in . Using it, we show that the scrambling speed of a random quantum circuit is lower bounded: Namely, every input qubit has an influence that is at least inverse exponential in depth, on any output qubit touched by its lightcone. Our result on scrambling speed works with high probability over the choice of a circuit from an ensemble, as opposed to just working in expectation.
As an application, we give the first polynomial-time algorithm for learning log-depth random quantum circuits with Haar random gates up to polynomially small diamond distance, given oracle access to the circuit. Other applications of this new scrambling speed lower bound include:
-
An optimal depth lower bound for -approximate unitary designs on any circuit architecture;
-
A polynomial-time quantum algorithm that computes the depth of a bounded-depth circuit, given oracle access to the circuit.
Our learning and depth-testing algorithms apply to architectures defined over any geometric dimension, and can be generalized to a wide class of architectures with good lightcone properties.
Keywords and phrases:
Haar measure, anti-concentration, random quanytum circuit, learningCopyright and License:
2012 ACM Subject Classification:
Theory of computation Quantum computation theoryAcknowledgements:
The authors thank Anurag Anshu, Adam Bouland, Lijie Chen, Jonas Haferkamp, Robert Huang, Issac Kim, Yunchao Liu, Tony Metger, Marcus Michelen, Tommy Schuster and Kewen Wu for helpful comments and discussions, and thank Jacob Watkins for pointing out a mistake in our previous version of Section 5.3.2.Funding:
B.F., S.G., and W.Z. acknowledge support from AFOSR (FA9550-21-1-0008). This material is based upon work partially supported by the National Science Foundation under Grant CCF-2044923 (CAREER), by the U.S. Department of Energy, Office of Science, National Quantum Information Science Research Centers (Q-NEXT) and by the DOE QuantISED grant DE-SC0020360.Editor:
Shubhangi SarafSeries and Publisher:
Leibniz International Proceedings in Informatics, Schloss Dagstuhl – Leibniz-Zentrum für Informatik
1 Introduction
Random quantum circuits are one of the most popular paradigms of quantum computation in the near-term era. They are well-studied, both theoretically and experimentally, in the context of quantum advantage demonstrations: e.g., see [6, 8, 3, 7, 43, 44, 21]. They also have numerous applications in areas like benchmarking, like in [19, 37], in cryptography, as in e.g., [4, 2, 50], and in the modeling of physical objects like black holes, as in e.g., [30, 49, 52].
One reason random quantum circuits are extensively studied is because they are rapid “scramblers” of information. Intuitively, this means that the output state it generates has non-trivial correlations across spatially separated qubits. The rate of scrambling depends on the depth – the deeper the circuit is, the better it is at scrambling. However, even though previous works on scrambling have put upper bounds on the scrambling speed of random circuits – see, e.g., [28, 11, 46, 27, 33, 13, 14, 23, 42, 36, 16, 51, 17, 53] – lower bounds on the scrambling speed are not well studied.
In this work, we give a lower bound on the speed with which random quantum circuits can scramble information. Our main theorem is as follows:
Theorem 1.
Let be a random quantum circuit with a fixed architecture, where each gate is a -qubit independent Haar random unitary. Let and be a pair of input and output qubits connected by layer of gates. Arbitrarily fix the inputs to , except the qubit , and let be the channel that maps to .
Then for every , with probability at least over the following holds: For every two single-qubit states and ,
| (1) |
where denotes the Frobenius norm, and is a constant that depends only on .
In particular, our main result gives a lower bound on the influence that changing an input qubit has on a designated output qubit inside the lightcone of that input qubit and shows that it decays at most exponentially fast with depth. We articulate some points that make Theorem 1 useful for our applications:
-
The bound in Theorem 1 is uniform, in the sense that ratio does not depend on how the input qubits are chosen, and is applicable to any circuit architecture. Moreover, the probability quantifier over the circuit is stated before the choices of and , which is specifically important for our application on learning random quantum circuits (Theorem 5).
-
Although we stated the theorem using Frobenius norm, the norm is only over single-qubit states and thus is equivalent to any other matrix norm.
-
To obtain the strongest bound, we can choose to be the shortest path from to . When is outside the lightcone of , we think of , in which case the statement still holds.
-
Our bound in Theorem 1 is tight, in the sense that one could prove a matching upper bound
(2) for various architectures with a different constant , by calculating the moment . Such calculation can be done by analyzing the Markov chain of Pauli operators, using the results in [28, 11, 45]. However, the moment method could not yield our lower bound, which we will explain in details below.
Implications on proving typicality statements.
Observe that Theorem 1 is a statement that works with high probability over the choice of circuit from the ensemble , as opposed to just in expectation over the ensemble . To the best of our knowledge, this is the first typicality result regarding the scrambling behavior of random quantum circuits.
Most of the previous work studying random quantum circuits examines their properties by computing some specific quantities of interest in expectation, such as entanglement entropy, OTOC and collision chance, by relating the quantities with the lower moments (usually the second moment) of the quantum circuit. For non-negative quantities the expectation already provides an upper bound that holds with high-probability (like in (2)) by Markov’s inequality. But for proving a strong typicality result as in (1), one also needs a corresponding lower bound, which is commonly proved by calculating the variance of the desired quantity . In particular, we would want that for some appropriately small , and for some ,
| (3) |
However, for most architectures a relation like (3) is largely unknown. One reason is that when is already a second moment of the quantum circuit, and proving (3) requires computing the fourth or higher moments, which is extremely difficult even for standard brickwork circuits [9]. More importantly, in our case, can be heuristically approximated by the product of i.i.d random variables that follow some distribution with constant variance; because of this, we cannot expect (3) to hold as will be exponentially larger than .
In this work, we overcome this barrier by performing a more fine-grained analysis. We do not analyze the randomness present in the choice of gates all at once; Instead, we break up the randomness of the circuit into layers and lower bound the random variable with a product of random variables that are highly correlated. We then decouple these random variables using a new anti-concentration inequality on the unitary Haar measure, which will be stated in Theorem 6. This allows us to avoid calculating higher moments of the circuit, and still retrieve information about the distribution of that could be lost in the moments. Our result in Theorem 1 directly implies typicality guarantees on other quantities, such as the purity of a single output qubit, and we believe the method we use can be generalized to prove typicality results for many other useful quantities.
1.1 Applications
Intuitively, Theorem 1 means that the output state of random quantum circuits with logarithmic depth carries a signal pertaining to its input state that can be extracted from the output state using single qubit quantum state tomography. This allows us to show many applications.
1.1.1 Optimal 2-design lower bounds
First, as an immediate corollary of Theorem 1, we show a depth lower bound for random quantum circuits, defined on any architecture, forming approximate unitary designs; see Section 5.1.
Theorem 2.
Let be a random quantum circuit, defined on an arbitrary fixed architecture of minimum depth , where each gate is a -qubit independent Haar random unitary. If is an -approximate -design, then
Here the minimum depth means the smallest number of gates that any path from input to output goes through, resulting in a stronger requirement than the previous depth lower bounds that uses the lightcone size property. Theorem 2 shows that previous works on approximate -design constructions, e.g. [28, 32, 24, 15, 50], are optimal in , assuming the gates are Haar random unitaries. And combined with the depth lower bound in [18, 50], it means that the approximate -design construction of depth from [50] is optimal in both and .
1.1.2 Testing for depth
Our second application concerns testing the depth of a random circuit when the depth is promised to be at most logarithmic.
Theorem 3.
Let be a brickwork random quantum circuit on qubits of an unknown depth promised to be bounded as , where each gate is independently Haar random. Given oracle access to , there is a polynomial time algorithm that outputs with probability .
Although Theorem 3 is stated with brickwork circuits for simplicity, it is applicable to many other architectures satisfying the following: For a random circuit of depth , the lightcone size is strictly smaller than that of a random circuit of depth larger than . Hence, there will be a particular output qubit which will be insensitive to a change in a particular input qubit in the former case and will be sensitive in the latter case. For this difference is inverse polynomially large and can be tested by state tomography. The explicit algorithm is given in Section 5.2.
Note that our algorithm outputs the exact depth instead of obtaining an approximation, in contrast with the recently proposed depth test algorithm in [26]. This is partially due to the fact that our algorithm works with high probability with respect to the choice of a circuit from the ensemble, as opposed to working only in expectation, unlike [26].
1.1.3 Learning brickwork random circuits
Third, we show that Theorem 1 allows us to learn brickwork random circuits of logarithmic depth. We start by showing that the first layer of gates can be learned given oracle access to the circuit.
Theorem 4.
Let be a brickwork random quantum circuit on qubits of known depth , where each gate is independently Haar random. Given oracle access to , there is a polynomial time algorithm that with high probability outputs each gate in the first layer with polynomially small error.
Furthermore, in real life scenarios we can assume the distribution over the gates is a discrete approximation of the Haar measure (see Definition 28), and in this case we can actually learn the entire circuit:
Theorem 5.
Let be a brickwork random quantum circuit on qubits of known depth , where each gate is independently drawn from a discretized version of the Haar measure. Given oracle access to , there is a polynomial time algorithm that with high probability outputs with polynomially small error.
We will prove Theorem 4 and Theorem 5 in Section 5.3. Note that the learning algorithm in Theorem 5 is proper, that the learnt circuit has the exact same depth and architecture as the actual circuit .
Before this work, the state-of-art learning algorithm for brickwork quantum circuits is due to [31, 35], which runs in polynomial time for circuits of -dimensional geometry up to depth, and is not proper (that the outputted circuit has depth larger than the input circuit by at least a constant factor). Our algorithm works for all geometric dimensions, works up to depth, and still has polynomial runtime. There is evidence that the depth dependence of our learning algorithm is optimal, because random circuits form high enough designs beyond depth [50], and possibly also pseudorandom unitaries, which hints at learning hardness.
Here we provide an informal description of the learner:
-
For each gate in the first layer, the learner iteratively searches through the gate set for the correct gate. There’s only polynomially many gates in the gate set because we chose an appropriate -net over -qubit Haar random unitaries. During each round, the learner applies the inverse of a gate chosen from the gate set at the particular location it wishes to learn.
-
Select an input qubit to the gate we are trying to learn. The key observation is that, depending on whether we applied the correct inverse, the output lightcone will consist of different qubits. In particular, one qubit will lie outside the lightcone and be unaffected by a change in the selected input qubit if we hit the correct inverse, as opposed to if we didn’t. As a corollary of Theorem 1, in the former case, we can detect the influence of the input qubit by performing single qubit state tomography.
Figure 1: An illustration of the learning algorithm in Theorem 5, where we try to uncompute the gate by applying some two-qubit unitary before it. Left: When failed to uncompute , the lightcone of the input qubit touches an output qubit at distance . Right: When , the output qubit falls out of the lightcone and will not be affected by the change in .
Note that the learner of [31, 35] uses a very different strategy: it looks at each output qubit, and then employs a brute force search over all possible lightcones of that circuit to find the correct one. Thereafter, it stitches together all the different local lightcones of every qubit, and then corrects for the errors in the overlapping regions of the different lightcones. The fact that we do not do a brute force search over all possible lightcones makes our algorithm efficient for any dimension, and not just -dimensional brickwork circuits.
1.2 Main Technical Tool: Anti-Concentration for Haar Measure
The intuition behind the proof of Theorem 1 is the following: We consider the path of qubits in the circuit , where gate has as an input and as an output. Let be the corresponding qubits when the input is , and we want to bound the ratios and hence their product.
It turns out that we can prove the lower bound , where is a polynomial function over the entries of in its matrix representation. Therefore, we only need to show that is often not too small, when is a Haar random unitary. In other words, we need to show that the polynomial does not concentrate around zero. Our main technical contribution is the following theorem which proves this anti-concentration phenomenon:
Theorem 6.
Let be a Haar random unitary matrix, and let be a degree- polynomial on the entries of and . Then for every , it holds that
where and are constants that depend only on and .
The anti-concentration inequality of polynomials over Gaussian random variables was famously proved by Carbery and Wright [12], and their result actually applies to any log-concave distribution over . However, as the Haar measure does not even have a convex support, the proof techniques in [12] does not apply. The alternative inductive proof in [38] also fails to apply, as we are dealing with correlated input variables. We present a very different inductive proof in Section 3.
Note that if we consider as a complex random variable, and define the complex variance
then we obtain the form closer to the classical anti-concentration inequalities:
Corollary 7.
Let be a Haar random unitary matrix, and let be a degree- polynomial on the entries of and . Then for every and every , it holds that
However in this work we will not use the form in Corollary 7, as Theorem 6 suffices for our applications.
Remark 8.
Unlike the Carbery-Wright inequality which is dimension-free, meaning the right hand side is and does not depend on , we have and in our proof. In fact, using the concentration bounds it is not hard to show that must depends polynomially on . However, we conjecture that could be independent of , in which case the result would be applicable to random quantum circuits with gates of higher locality.
1.3 Related Works
Concentration phenomenon on unitary Haar measures has been extensively studied, and the readers can refer to [39] for a comprehensive review of the results. In comparison, much less has been shown for the reverse direction, namely the anti-concentration inequalities.
One common way to prove such inequalities is by calculating higher moments and apply the Paley-Zygmund inequality, which was indeed used for showing the anti-concentration property of output distributions of Haar random unitaries and random quantum circuits [1, 25, 18]. However, the inequality proved this way is not strong enough for our applications, while calculating moments of a random quantum circuit is also non-trivial and depends highly on the architecture [22, 9]. Instead, we resort to prove a general anti-concentration inequality for polynomials, whose theory has been well developed for Gaussian distributions [12] and product distributions (namely the Littlewood-Offord theory) and has found numerous applications in computational complexity theory [41, 40, 34]. Our inductive proof of Theorem 6 also shares a similar spirit with the elementary proof of Carbery-Wright inequality in [38].
Multiple notions of scrambling property of random quantum circuits has been previously studied. In particular, [11] showed that in a random circuit consists of sequential applications of Haar random gates on a complete graph of qubits, every subset of qubits is polynomially close to maximally mixed with high probability for some constant . Our Theorem 1 can be viewed as a result in the reverse direction which bounds the scrambling speed of such random circuits. Specifically, at least sequential gates are required, as otherwise with high probability a pair of input and output qubits are depth apart due to a generalization of the coupon-collector problem [48]. It is also reasonable to believe that our method of proving Theorem 1, via the anti-concentration inequality, is applicable to obtain lower bounds for other measures of scrambling such as entanglement and out-of-time-ordered correlation (OTOC) [45, 46, 5, 27]. Upper bounds in the above-mentioned works are obtained by calculating moments and analyzing the averaged Markov chain on Pauli operators, which are not sufficient to prove lower bounds in the typical case.
We also review some previous works related to our applications and clarify the connections. For approximate unitary designs, many previous constructions, for example [28, 10, 24, 29, 15], employed Haar random unitary gates and achieved the optimal dependence on . The construction of depth in the recent work of [50] also falls into this category. Meanwhile, they also proposed a construction of approximate 3-design with only depth. This does not contradict our lower bound Theorem 2 as the construction uses random Clifford unitaries, which are not independent between layers and also fail the anti-concentration property in Theorem 6. It is intriguing, however, to see if our argument can be extended to show a matching depth lower bound.
The depth test algorithm in [26] is based on the entanglement dynamics of random circuits and implemented with their Bell sampling framework. For brickwork circuits, as there is a constant gap between the upper and lower bounds for the entanglement entropy in the typical case, their algorithm gives constant approximation of the depth. For general architectures, the algorithm in [26] also requires knowledge of the entanglement velocity, while our algorithm for Theorem 3 only relies on a specific property of the architecture that the lightcone strictly expands with depth.
The learning algorithm for shallow quantum circuits in [31] is based on the idea of brute-force enumerating all possibilities in a light cone, and stitching the parts together. Therefore, their algorithm works any quantum circuit in worse case, and has complexity exponential in the lightcone size. This means that in order to have polynomial efficiency, the depth has to be for -dimensional geometrically local circuits and for general architectures. However, the learning algorithm is improper and the outputted circuit has a large polynomial blow-up in depth. The blow-up factor could be reduced to constant, at the cost of lowering the depth bound to , and thus could not perform in polynomial time for log-depth circuits even for 1-dimensional geometry. In comparison, our algorithm works for random quantum circuits, on the fixed brickwork or similar geometrically local architecture, where neighboring qubits can be distinguished by their lightcones. The strength of our algorithm Theorem 5 is that it works up to logarithmic depth in polynomial time regardless of the geometric dimension, and it learns properly in that it outputs a circuit with the exact same architecture. This makes our result incomparable with [31].
We also mention that, there is a different learning task where instead given oracle access to the circuit , the learner is only given copies of the state , and needs to learn a circuit that prepares the same state with error in trace distance. Our algorithm relies on having different inputs and hence does not work in this case, whereas [31] gave a quasi-polynomial efficiency algorithm for 2D and the algorithm was extended to higher dimensions in [35].
2 Preliminaries
We start with some basic notations. We use to denote the unitary group of dimension , and use to denote the Haar measure over . We use Greek letters such as to denote density matrices of quantum states. We use and for trace norm and Frobenius norm, and for diamond distance between unitary channels, defined as
with maximum taken over all density matrices .
A circuit architecture determines the positions of gates in the circuit. We define the depth of an architecture as follows.
Definition 9.
In a circuit architecture, a path in space-time between two qubits and is a sequence of qubits where for each , there is a gate in the circuit that has as an input and as an output.
We say the architecture has minimum depth , if there exists a pair of input and output qubits connected by a path of length , and every other path from input to output has depth at least .
A specific architecture of interest is the (1-dimensional) brickwork architecture:
Definition 10.
A brickwork quantum circuit on qubits of depth consists of layers of gates, where on layer , there is a two-qubit gate acting on the -th and -th qubit if and only if and have the same parity.
The brickwork architecture could be generalized to higher dimensional geometry, and our results still hold for any constant dimension. However, for simplicity we stick with the 1-dimensional architecture in this paper.
We will need the following statements about quantum state tomography on single qubits for our algorithms (see e.g. [47])
Proposition 11.
Given access to copies of a single-qubit state , one can output an estimation with in time.
The following simple lemma is particularly useful, which bounds the difference between states through a channel:
Lemma 12.
Let be a quantum channel that takes qubits as the input. For every input states and , we have
As we are frequently dealing with differences between quantum states, here we present some facts about the space of such differences. We start from the Bloch sphere presentation of a single-qubit state:
| (4) |
where
| (5) |
Then the difference between two single-qubit states can be written in the Pauli basis
| (6) |
Therefore, if we view under the coordinate system , then the set of all possible single-qubit differences can be identified with a ball of radius in . The Euclidean space is equipped with the standard trace inner product , so that the norm coincides with the Frobenius norm.
More generally, let be the set of all possible differences between two -qubit states and . The difference can be written as a real linear combination over the Pauli basis
where the identity is removed as . Since the Pauli basis are orthonormal, we can think of as a subset of the Euclidean space (although the set is much more complicated than a ball for ). The Euclidean space is also equipped with the standard trace inner product and the Frobenius norm. As a result, a quantum channel with -qubit input and -qubit output induces a linear map from to :
which is also a real linear map from to .
3 Anti-concentration Bound
In this section we prove Theorem 6. For simplicity, we introduce the notion of semi-polynomials: A function is a degree- semi-polynomial in complex variables , if it is a polynomial in of degree at most . Now Theorem 6 is implied by the following more general form:
Theorem 13.
Let , and be a degree- semi-polynomial. Suppose that takes as inputs the entries of the first columns of an unitary matrix, and that the value of is always a non-negative real number over this domain. Then for ,
holds for every , where and are constants that depend only on and .
To see that Theorem 13 implies Theorem 6, it suffices to take and notice that is a degree- semi-polynomial that is always non-negative. We prove Theorem 13 via induction, and the proof is divided into four stages.
3.1
We start with the simplest case when . In this case is a single-variable degree- semi-polynomial over the unit circle . Since over this domain, assuming we can write as
where is a degree polynomial in . Without loss of generality we can assume that , and therefore
| (7) |
Then we can bound the expectation of as
| (8) |
On the other hand, if for some , it is easy to show that
| (9) |
Thus if holds for all , then
| (10) |
That means, if we take , then only happens when falls into one of the -balls around some . Each -ball intersect with the unit circle as an arc of angle at most , and thus has measure at most under the Haar measure over the unit circle. Therefore we conclude that
| (11) |
and we can take and .
3.2 , Alternative Distribution
For the sake of later use, we also need a version where follows the distribution of the first coordinate of a Haar-random unit vector , . In this case no longer holds, and we need an alternative method.
For every , we define
where the expectation is over a Haar-random with . Notice that a monomial in has expectation unless , and when we have . That means is a degree- polynomial in .
Notice that follows the Beta distribution , with the density function
and under this distribution coincides with .
With the analysis in the previous section which also works on , we can show that if we take , then only happens when falls into one of the -balls around complex roots of , which are intervals of length at most on the real line. Since the density function of has a maximum of , we have
| (12) |
On the other hand, applying Equation 11 from the previous section on for every fixed directly provides
| (13) |
Thus by a union bound we have
| (14) |
We note that not only this result will be used in the next stage, the technique itself will also be reapplied multiple times in the later proofs.
3.3
In this stage we consider with general , and thus the inputs to is a Haar-random unit vector . The strategy is to use induction on , and show that with high probability over the choice of ,
is not too small conditioned on the fixed . To handle this, we need the following lemma.
Lemma 14.
If is a degree- semi-polynomial, and is a Haar-random unit vector, then is a degree- semi-polynomial on .
Proof.
Omitted in this version.
Since is an expectation over , it is also always non-negative and has the same expectation as . Thus Equation 14 from the previous section gives
| (15) |
Now we fix some , and consider the degree- semi-polynomial
which is always non-negative. Since for some , and follows the Haar measure over the unit sphere in , we can apply the induction hypothesis for on to get
| (16) |
Therefore we conclude that, for every ,
| (17) |
We can take
and .
3.4
Now we handle the general case when the inputs to the semi-polynomial consist of columns of a Haar random unitary. The proof is similar to the last stage, using the fact that the input distribution can be viewed as a unitary-invariant distribution over orthonormal vectors in . In particular, let the vectors be , and we consider the function
Here is a Haar-random vector over the unit sphere in the -dimensional orthogonal subspace of . We would like to show that is a semi-polynomial in order to use induction, and we first show it with replaced with a Gaussian distribution.
Lemma 15.
Let be a set of orthonormal vectors in , and let distributed as a standard -dimensional complex Gaussian in the orthogonal subspace of .
Let be a degree- semi-polynomial in . Then
is a degree- semi-polynomial in the entries of .
Proof.
Omitted in this version.
Corollary 16.
is a degree- semi-polynomial in the entries of .
Proof.
Notice that is equidistributed as , where is the Gaussian in Lemma 15. In other words, where follows a fixed distribution independent of .
Now consider each monomial in , and let be the part of monomial over entries of , then it suffices to show that is a degree- semi-polynomial in . Lemma 15 already showed this for , and since is a monomial of degree , we have
| (18) |
where is a non-zero constant irrelevant to the choice of . Therefore is also a degree- semi-polynomial in .
Since is an expectation over , it is also always non-negative and has the same expectation as . Thus the induction hypothesis gives
| (19) |
Now we fix some and consider the degree- semi-polynomial
which is always non-negative. Since there exists a linear map such that , where is a Haar-random unit vector in , we can apply Equation 17 from the previous stage for on to get
| (20) |
Therefore we conclude that, for every ,
| (21) |
Similar to the previous stage, we can take
and . This completes the proof of Theorem 13.
4 Mixing Bound for Random Circuits
In this section we prove Theorem 1. Since the output qubit is in the lightcone of the input qubit , there exists gates in the circuit that connect the input and output qubits. That is, there are qubits with and , such that the gate has as an input and as an output. Each gate is independently drawn from the Haar measure .
Note that even when gate is given, we cannot directly claim any relationship between and , as the other input qubits to are correlated and possibly entangled with . To handle this, we let be the composite state of the input qubits to , and by Lemma 12 we have
| (22) |
where and are the corresponding states when the input state is . Therefore we only need to bridge the remaining gaps by proving
| (23) |
where is some function of , and then bound the distribution of by applying the anti-concentration bound from Theorem 6.
At a first glance, this looks impossible as is a -qubit state while is single-qubit, which means that as long as , whatever is, there will be input states to with the output qubits . The key observation is that the states cannot be arbitrary -qubit states: Since all the input qubits to the circuit are fixed except , when the circuit is given, each is the result of through a fixed quantum channel. As the difference ranges within the 3-dimensional Euclidean space , the difference could also only range within a 3-dimensional subspace of .
This allows us to define and bound the distribution of that uniformly holds for every such subspace as follows:
Lemma 17.
For every , there exists a distribution over that satisfies:
-
There exists such that , where we use as shorthand for .
-
For every fixing of the input qubits other than , and layers of the circuit , there exists a function at layer such that
holds for all input states and , and when .
We will prove Lemma 17 in Section 4.1. For now let us show how Lemma 17 would imply Theorem 1.
Proof of Theorem 1.
From Lemma 17 and Equation 22 we get
| (24) |
for every . Here each function depends on the previous layers, but as random variables they are independent, since follows the same distribution no matter how are fixed.
Since and , we have
| (25) |
To bound the product of we use Markov’s inequality, which states that for every ,
| (26) |
where we take to be the constant in Lemma 17. Take such that , we conclude that with probability at least ,
4.1 Proof of Lemma 17
The basic idea of the proof is to lower bound the ratio with a polynomial function on the entries of and apply Theorem 6.
The quantum channel defined by that maps to induces the linear map
The domain of is a 3-dimensional subspace of which we denote as , while the range of is . Notice that the map is completely determined by the domain and the gate , while depends only on the fixed inputs to the circuit and the gates of in layers .
Since both the domain and the range are Euclidean spaces, the absolute determinant is independent of the choices of bases when writing as matrix in . We will show that is basically the lower bound that we seek for, via the following propositions whose proof can be found in the full version.
Proposition 18.
It always holds that
Proposition 19.
For each fixed domain , is a degree-6 semi-polynomial in the entries of .
Proposition 20.
There exists a constant such that for every possible domain ,
Now we are ready to prove Lemma 17.
Proof of Lemma 17.
Applying Theorem 6 with Propositions 19 and 20 gives
| (27) |
which holds for every domain and every . We define as the distribution over with the following cumulative function:
Take and let , then we have
| (28) |
Now suppose the input qubits other than are fixed, and the layers of the circuit is given. This fixes the domain for , and provides the cumulative function
Notice that is continuous and non-decreasing, and by defining its inverse on , we can show that for every ,
We then define the function as follows: For each , let be the smallest such that
When , we have since
| (29) |
We also have since holds for all . Combined with Proposition 18 we get
5 Applications
5.1 Depth Lower Bound for Approximate Designs
In this section we prove Theorem 2. We first recall the definition of an approximate unitary design.
Definition 21.
For a distribution over and , define the moment superoperator as the following channel:
The distribution is an -approximate unitary -design if
Note that there are several other definitions of the approximate design (see e.g. [10]), and we choose the weaker one so that our Theorem 2 is still compatible with the stronger definitions of designs.
Proof of Theorem 2.
Without loss of generality, let us assume that the first input qubit and the first output qubit are depth apart. We fix all other input qubits to be maximally mixed, so that when is also maximally mixed, the entire output is maximally mixed regardless of the circuit , and thus . On the other hand, when , we know the following about the output qubit via Theorem 1 that with probability over the circuit ,
| (30) |
We can expand the left hand side of Equation 30 as
| (31) |
Since always holds, we get
| (32) |
Now imagine feeding two copies of the input state to the superoperators and , and apply a swap test on the first output qubits in the two copies. The output probability is determined by . We know already from [20] that when going through an -qubit Haar random unitary, we have . Therefore, we conclude that the difference which means that .
5.2 Depth Test
In this section we prove Theorem 3, where we learn the exact depth of a brickwork random circuit .
The processed is described in Algorithm 1. Here is the constant in Theorem 1 with for brickwork circuits, and is the target error probability. Notice that in a brickwork circuit of depth , the -th output qubit lies outside the lightcone of the first input qubit. Therefore, when the algorithm iterates to the correct depth , we have and must be returned even when both and are estimated with error.
On the other hand, when is smaller than the actual depth, the -th output qubit lies inside the lightcone of the first input qubit. By Theorem 1, with probability over we have
| (33) |
This means when both and are estimated with error, we still have . By a union bound over , with probability all those smaller than the actual depth will be skipped, and thus the outputted depth is correct.
Note that the efficiency of the algorithm depends on the single-qubit tomography process, which by Proposition 11 is . As a conclusion, we obtain the following more general statement which implies Theorem 3:
Theorem 22.
Let be a brickwork random quantum circuit of an unknown depth , where each gate is independently Haar random.. Given oracle access to , for any , Algorithm 1 outputs with probability at least in time .
Remark 23.
The only property of the brickwork architecture we used here is that the set of qubits within the lightcone of an input qubit is strictly expanding when the depth grows, which allows us to distinguish between different depths. Therefore the algorithm can be easily modified, with the same efficiency, to work with higher dimensional brickwork circuits and other architectures.
5.3 Learning Brickwork Random Circuits
5.3.1 Learning the First Gate
Here we prove Theorem 4, where we learn the gate , namely the gate in the first layer acting on the first and second qubit, in a brickwork circuit of depth with Haar random gates. The same arguments also work for other gates in the first layer.
To learn , we try to uncompute by first apply some two-qubit unitary and then apply the circuit . We distinguish whether is close to or not using the similar idea as in Section 5.2. Specifically, if so that perfectly uncomputes the gate , then the -th output qubit will lie outside the lightcone of the first input qubit. Note that this is also true when cancels into unentangled single-qubit gates, that is when there exists that . In contrast, when does not cancel into unentangled single-qubit gates, the first two qubits will be entangled after the first layer of gates, and thus the -th output qubit will be affected when the first input qubit changes.
To put the above intuition more formally, we first define the distance between two-qubit gates when taking the quotient over unentangled single-qubit gates:
Definition 24.
For , we define the distance
We note that , like the diamond distance , is a pseudometric on . That is, a metric except that two distinct unitaries could have distance zero. However, if and only if for some .
The learning algorithm is described in Algorithm 2, with being the target error rate. Notice that within each loop of , the second input qubit is fixed, while the first one changes with a difference that goes through the Pauli basis. The input qubits and are Pauli eigenstates except when , which can be obtained by randomly choosing or .
To prove the correctness of the algorithm, we need the following lemmas that connects the distance with the behavior of over the Pauli basis.
Lemma 25.
For , if then for every and ,
where is the partial trace that traces out the first qubit.
Lemma 26.
For , if for every and we have
then .
The proof of Lemma 26 is rather technical and thus is omitted in this version. For now, let us show how Lemmas 25 and 26 would imply the completeness and the soundness of Algorithm 2.
We apply Theorem 1 to the circuit minus its first layer, which is a circuit of depth . Notice that among the input qubits to the second layer of , the third to -th qubits only depends on the gates and thus can be viewed as fixed. The first qubit does not affect the -th output qubit and thus its entire lightcone can be removed from the picture. That leaves us the second qubit, which is
Therefore, with changed with a difference and unchanged, Theorem 1 implies that with probability , the following holds for every .
| (34) |
Meanwhile, Lemma 12 implies that
| (35) |
For completeness, since we take from an -net, the algorithm must have tested some with . Hence by Lemma 25 and Equation 35, for such it always holds that , and thus will not be rejected even with tomography errors in and .
For soundness, assume that , then by Lemma 26 we know that for some and ,
| (36) |
As , that means there exists some such that
| (37) |
Thus by Equation 34 we have , and will be rejected when and are estimated with error.
The efficiency of the algorithm depends on the size of the -net and the state tomography process, which are both . Notice that the algorithm similarly works for every gate in the first layer, and as a result, we obtained the following formal statement of Theorem 4:
Theorem 27.
Let be a brickwork random quantum circuit of depth , where each gate is independently Haar random. Let be the gate in the first layer of that acts on the first and second qubit. Given oracle access to , for any , with probability at least over , Algorithm 2 outputs some such that in time .
5.3.2 Learning the Circuit with Discretized Distribution
It is tempting to use Theorem 27 to learn the entire circuit . Indeed, if the statement is errorless that , we could view the single-qubit gates as part of the second layer. That means after learning the first layer, we can perfectly uncompute it and hence use Algorithm 2 to learn the second layer, and proceed until we are only left with single qubit gates, which are easily learnable.
However, when there are learning errors, which are inevitable for a continuous gate distribution like , the above framework runs into a problem. Since we cannot perfectly uncompute the first layer, the inputs to the rest of the circuit are not clean enough: In particular, the error in could have much larger influence on the output difference than itself. To control the influence of the errors, we need to reduce the learn error in the previous layer to be polynomially smaller than the target error in the current layer, and therefore a circuit of depth would require error as small as and incur quasi-polynomial running time.
Here we present a bypass to the problem which allows us to prove an errorless version of Theorem 27 and thus make the proposed framework work. The idea is to change the distribution from into a discrete one that approximates . Intuitively, any -net where the elements are distributed according to would be a good approximation. We formalize this intuition as the following:
Definition 28.
An -net of a distribution over a pseudometric space is a distribution over with a finite support, such that for some (possibly randomized) map , with the following properties:
-
For every , .
-
For every , either or .
Notice that under Definition 28, the set is indeed an -net in the normal sense. Actually, the definition is general enough so that we can first choose any -net as the support, remove the redundant elements with zero distances, and then take to be an arbitrary rounding scheme into the support. We show that approximates via the following lemma.
Lemma 29.
If is -Lipschitz, that is for all ,
then for every we have
From now on, for each we fix some -net of the Haar measure under the distance, and denote it by . We will show that when the gates in the brickwork circuit are drawn the net, we can actually use the framework at the start of this section to learn the circuit. To do so, we first prove an errorless version of Theorem 27 as follows.
Theorem 30.
Let be a brickwork random quantum circuit of depth , where each gate is independently drawn from . Let be the gate in the first layer of that acts on the first and second qubit. Given oracle access to , for every there is an algorithm that with probability at least over outputs in time .
Proof.
The algorithm is basically the same as Algorithm 2, except that we now iterate through the support of . The proof is also mostly the same: When , we have and thus will not be rejected; Otherwise , and the proof goes through as long as we have the corresponding version of Theorem 1.
Since Theorem 1 is proved via Lemma 17, it suffices to prove Lemma 17 where is replaced with . The crux is to prove the inequality Equation 27, that is for some , it holds for all that
| (38) |
where is the matrix form of the linear map , for in a certain fixed 3-dimensional subspace of .
Note that by Proposition 18, each entry of is in , while by Proposition 19, each entry of is a Lipschitz function of under distance . This is because when that corresponds to matrices and , for any and with we have
| (39) |
As consists of monomials of degree in the entries of , we conclude that is -Lipschitz in under . But when we have which means , and thus is also 144-Lipschitz in under .
As a result, since we already know that Equation 38 holds when , by Lemma 29 we have
| (40) |
However, this will not yield Lemma 17 as the bound is even non-zero when . Fortunately, we can simply use the union bound to ignore the cases when for any gate on the path, which will only add to the error probability . And conditioned on , for every we have
| (41) |
which allows us to prove Lemma 17 on , albeit with different constants.
As a result, we can use Theorem 30 to exactly learn the circuit layer by layer, and obtain Theorem 5 formally as follows.
Theorem 31.
Let be a brickwork random quantum circuit on qubits of depth , where each gate is independently drawn from . Given oracle access to , for every there is an algorithm that with probability at least outputs in time .
References
- [1] Scott Aaronson and Alex Arkhipov. The computational complexity of linear optics. In Proceedings of the 43rd ACM Symposium on Theory of Computing, STOC 2011, San Jose, CA, USA, 6-8 June 2011, STOC ’11, pages 333–342, New York, NY, USA, 2011. Association for Computing Machinery. doi:10.1145/1993636.1993682.
- [2] Scott Aaronson and Shih-Han Hung. Certified randomness from quantum supremacy. In Proceedings of the 55th Annual ACM Symposium on Theory of Computing, STOC 2023, pages 933–944, New York, NY, USA, 2023. Association for Computing Machinery. doi:10.1145/3564246.3585145.
- [3] Frank Arute et al. Quantum supremacy using a programmable superconducting processor. Nature, 574(7779):505–510, October 2019. doi:10.1038/s41586-019-1666-5.
- [4] Roozbeh Bassirian, Adam Bouland, Bill Fefferman, Sam Gunn, and Avishay Tal. On certified randomness from fourier sampling or random circuit sampling, 2024. arXiv:2111.14846.
- [5] Bruno Bertini and Lorenzo Piroli. Scrambling in random unitary circuits: Exact results. Phys. Rev. B, 102:064305, August 2020. doi:10.1103/PhysRevB.102.064305.
- [6] Sergio Boixo, Sergei V. Isakov, Vadim N. Smelyanskiy, Ryan Babbush, Nan Ding, Zhang Jiang, Michael J. Bremner, John M. Martinis, and Hartmut Neven. Characterizing quantum supremacy in near-term devices. Nature Physics, 14(6):595–600, April 2018. doi:10.1038/s41567-018-0124-x.
- [7] Adam Bouland, Bill Fefferman, Zeph Landau, and Yunchao Liu. Noise and the frontier of quantum supremacy. In 2021 IEEE 62nd Annual Symposium on Foundations of Computer Science (FOCS). IEEE, February 2022. doi:10.1109/focs52979.2021.00127.
- [8] Adam Bouland, Bill Fefferman, Chinmay Nirkhe, and Umesh Vazirani. On the complexity and verification of quantum random circuit sampling. Nature Physics, 15(2):159–163, October 2018. doi:10.1038/s41567-018-0318-2.
- [9] Paolo Braccia, Pablo Bermejo, Lukasz Cincio, and M. Cerezo. Computing exact moments of local random quantum circuits via tensor networks, 2024. arXiv:2403.01706.
- [10] Fernando G.S.L. Brandão, Aram W. Harrow, and Michał Horodecki. Local random quantum circuits are approximate polynomial-designs. Commun. Math. Phys., 346(2):397–434, 2016. doi:10.1007/s00220-016-2706-8.
- [11] Winton Brown and Omar Fawzi. Scrambling speed of random quantum circuits, 2013. arXiv:1210.6644.
- [12] Anthony Carbery and James Wright. Distributional and norm inequalities for polynomials over convex bodies in . Mathematical Research Letters, 8(3):233–248, 2001. doi:10.4310/mrl.2001.v8.n3.a1.
- [13] Chi-Fang Chen, Adam Bouland, Fernando G. S. L. Brandão, Jordan Docter, Patrick Hayden, and Michelle Xu. Efficient unitary designs and pseudorandom unitaries from permutations, 2024. doi:10.48550/arXiv.2404.16751.
- [14] Chi-Fang Chen, Jordan Docter, Michelle Xu, Adam Bouland, and Patrick Hayden. Efficient unitary t-designs from random sums, 2024. doi:10.48550/arXiv.2402.09335.
- [15] Chi-Fang Chen, Jeongwan Haah, Jonas Haferkamp, Yunchao Liu, Tony Metger, and Xinyu Tan. Incompressibility and spectral gaps of random circuits, 2024. doi:10.48550/arXiv.2406.07478.
- [16] Chi-Fang Chen and Andrew Lucas. Operator growth bounds from graph theory. Communications in Mathematical Physics, 385(3):1273–1323, July 2021. doi:10.1007/s00220-021-04151-6.
- [17] Chi-Fang Chen, Andrew Lucas, and Chao Yin. Speed limits and locality in many-body quantum dynamics. Reports on Progress in Physics, 86(11):116001, September 2023. doi:10.1088/1361-6633/acfaae.
- [18] Alexander M. Dalzell, Nicholas Hunter-Jones, and Fernando G. S. L. Brandão. Random quantum circuits anticoncentrate in log depth. PRX Quantum, 3(1), March 2022. doi:10.1103/prxquantum.3.010333.
- [19] Christoph Dankert, Richard Cleve, Joseph Emerson, and Etera Livine. Exact and approximate unitary 2-designs and their application to fidelity estimation. Phys. Rev. A, 80:012304, July 2009. doi:10.1103/PhysRevA.80.012304.
- [20] Joseph Emerson, Robert Alicki, and Karol Zyczkowski. Scalable noise estimation with random unitary operators. Journal of Optics B: Quantum and Semiclassical Optics, 7, March 2005. doi:10.1088/1464-4266/7/10/021.
- [21] Bill Fefferman, Soumik Ghosh, Michael Gullans, Kohdai Kuroiwa, and Kunal Sharma. Effect of non-unital noise on random circuit sampling, 2023. doi:10.48550/arXiv.2306.16659.
- [22] Matthew P.A. Fisher, Vedika Khemani, Adam Nahum, and Sagar Vijay. Random quantum circuits. Annual Review of Condensed Matter Physics, 14(1):335–379, 2023. doi:10.1146/annurev-conmatphys-031720-030658.
- [23] Jeongwan Haah, Yunchao Liu, and Xinyu Tan. Efficient approximate unitary designs from random pauli rotations, 2024. doi:10.48550/arXiv.2402.05239.
- [24] Jonas Haferkamp. Random quantum circuits are approximate unitary -designs in depth . Quantum, 6:795, September 2022. doi:10.22331/q-2022-09-08-795.
- [25] Dominik Hangleiter, Juan Bermejo-Vega, Martin Schwarz, and Jens Eisert. Anticoncentration theorems for schemes showing a quantum speedup. Quantum, 2:65, May 2018. doi:10.22331/q-2018-05-22-65.
- [26] Dominik Hangleiter and Michael J. Gullans. Bell sampling from quantum circuits. Phys. Rev. Lett., 133:020601, July 2024. doi:10.1103/PhysRevLett.133.020601.
- [27] Aram W. Harrow, Linghang Kong, Zi-Wen Liu, Saeed Mehraban, and Peter W. Shor. Separation of out-of-time-ordered correlation and entanglement. PRX Quantum, 2:020339, June 2021. doi:10.1103/PRXQuantum.2.020339.
- [28] Aram W. Harrow and Richard A. Low. Random quantum circuits are approximate 2-designs. Communications in Mathematical Physics, 291(1):257–302, July 2009. doi:10.1007/s00220-009-0873-6.
- [29] Aram W. Harrow and Saeed Mehraban. Approximate unitary -designs by short random quantum circuits using nearest-neighbor and long-range gates. Commun. Math. Phys., 401(2):1531–1626, 2023. doi:10.1007/s00220-023-04675-z.
- [30] Patrick Hayden and John Preskill. Black holes as mirrors: quantum information in random subsystems. Journal of High Energy Physics, 2007(09):120–120, September 2007. doi:10.1088/1126-6708/2007/09/120.
- [31] Hsin-Yuan Huang, Yunchao Liu, Michael Broughton, Isaac Kim, Anurag Anshu, Zeph Landau, and Jarrod R. McClean. Learning shallow quantum circuits. In Proceedings of the 56th Annual ACM Symposium on Theory of Computing, STOC 2024, pages 1343–1351, New York, NY, USA, 2024. Association for Computing Machinery. doi:10.1145/3618260.3649722.
- [32] Nicholas Hunter-Jones. Unitary designs from statistical mechanics in random quantum circuits, 2019. arXiv:1905.12053.
- [33] Shao-Kai Jian, Gregory Bentsen, and Brian Swingle. Linear growth of circuit complexity from brownian dynamics, 2022. arXiv:2206.14205.
- [34] Daniel Kane. A structure theorem for poorly anticoncentrated polynomials of gaussians and applications to the study of polynomial threshold functions. The Annals of Probability, 45(3):1612–1679, 2017. doi:10.1214/16-AOP1097.
- [35] Zeph Landau and Yunchao Liu. Learning quantum states prepared by shallow circuits in polynomial time, 2024. doi:10.48550/arXiv.2410.23618.
- [36] Elliott H. Lieb and Derek W. Robinson. The finite group velocity of quantum spin systems. Communications In Mathematical Physics, 28(3):251–257, September 1972. doi:10.1007/BF01645779.
- [37] Yunchao Liu, Matthew Otten, Roozbeh Bassirianjahromi, Liang Jiang, and Bill Fefferman. Benchmarking near-term quantum computers via random circuit sampling, 2022. arXiv:2105.05232.
- [38] Shachar Lovett. An elementary proof of anti-concentration of polynomials in gaussian variables. Electronic Colloquium on Computational Complexity (ECCC), 2010. URL: https://eccc.weizmann.ac.il/report/2010/182.
- [39] Elizabeth S Meckes. The random matrix theory of the classical compact groups, volume 218. Cambridge University Press, 2019. doi:10.1017/9781108303453.
- [40] Raghu Meka, Oanh Nguyen, and Van Vu. Anti-concentration for polynomials of independent random variables. Theory of Computing, 12(11):1–17, 2016. doi:10.4086/toc.2016.v012a011.
- [41] Raghu Meka and David Zuckerman. Pseudorandom generators for polynomial threshold functions. SIAM Journal on Computing, 42(3):1275–1301, 2013. doi:10.1137/100811623.
- [42] Tony Metger, Alexander Poremba, Makrand Sinha, and Henry Yuen. Simple constructions of linear-depth t-designs and pseudorandom unitaries, 2024. doi:10.48550/arXiv.2404.12647.
- [43] A. Morvan et al. Phase transition in random circuit sampling, 2023. arXiv:2304.11119.
- [44] Ramis Movassagh. The hardness of random quantum circuits. Nature Physics, 19(11):1719–1724, November 2023. doi:10.1038/s41567-023-02131-2.
- [45] Adam Nahum, Jonathan Ruhman, Sagar Vijay, and Jeongwan Haah. Quantum entanglement growth under random unitary dynamics. Physical Review X, 7(3), July 2017. doi:10.1103/physrevx.7.031016.
- [46] Adam Nahum, Sagar Vijay, and Jeongwan Haah. Operator spreading in random unitary circuits. Phys. Rev. X, 8:021014, April 2018. doi:10.1103/PhysRevX.8.021014.
- [47] Michael A. Nielsen and Isaac L. Chuang. Quantum Computation and Quantum Information: 10th Anniversary Edition. Cambridge University Press, 2010.
- [48] Erdős Paul and Alfréd Rényi. On a classical problem of probability theory. Magyar Tud. Akad. Mat. Kutató Int. Közl, 6(1):215–220, 1961.
- [49] Lorenzo Piroli, Christoph Sünderhauf, and Xiao-Liang Qi. A random unitary circuit model for black hole evaporation. Journal of High Energy Physics, 2020(4), April 2020. doi:10.1007/jhep04(2020)063.
- [50] Thomas Schuster, Jonas Haferkamp, and Hsin-Yuan Huang. Random unitaries in extremely low depth, 2024. doi:10.48550/arXiv.2407.07754.
- [51] Henrik Wilming and Albert H. Werner. Lieb-robinson bounds imply locality of interactions. Phys. Rev. B, 105:125101, March 2022. doi:10.1103/PhysRevB.105.125101.
- [52] Lisa Yang and Netta Engelhardt. The complexity of learning (pseudo)random dynamics of black holes and other chaotic systems, 2023. arXiv:2302.11013.
- [53] Tianci Zhou, Shenglong Xu, Xiao Chen, Andrew Guo, and Brian Swingle. Operator lévy flight: Light cones in chaotic long-range interacting systems. Phys. Rev. Lett., 124:180601, May 2020. doi:10.1103/PhysRevLett.124.180601.
