This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Quantum Error Mitigation by
Pauli Check Sandwiching

Alvin Gonzales1, Ruslan Shaydulin2, Zain H. Saleem3, and Martin Suchara34 Email: 1agonza@siu.edu 0000-0003-1635-106X 0000-0002-8657-2848 1Intelligence Community Postdoctoral Research Fellowship Program, Argonne National Laboratory, Lemont, IL, USA 3Mathematics and Computer Science Division, Argonne National Laboratory, Lemont, IL, USA 2Global Technology Applied Research, JPMorgan Chase, New York, NY, USA 4Amazon Web Services, Amazon, Seattle, WS, USA
Abstract

We describe and analyze an error mitigation technique that uses multiple pairs of parity checks to detect the presence of errors. Each pair of checks uses one ancilla qubit to detect a component of the error operator and represents one layer of the technique. We build on the results on extended flag gadgets and put it on a firm theoretical foundation. We prove that this technique can recover the noiseless state under the assumption of noise not affecting the checks. The method does not incur any encoding overhead and instead chooses the checks based on the input circuit. We provide an algorithm for obtaining such checks for an arbitrary target circuit. Since the method applies to any circuit and input state, it can be easily combined with other error mitigation techniques. We evaluate the performance of the proposed methods using extensive numerical simulations on 1,850 random input circuits composed of Clifford gates and non-Clifford single-qubit rotations, a class of circuits encompassing most commonly considered variational algorithm circuits. We observe average improvements in fidelity of 34 percentage points with six layers of checks.

I Introduction

Hardware errors or noise arising from qubit imperfections such as unwanted interactions with the environment limit the power of near-term quantum technologies. Since these devices lack the necessary number of qubits and error rates to perform quantum error correction [1, 2, 3], error mitigation is required in order to increase the fidelity of computations. In this work we investigate error mitigation that uses a small number of ancillas to suppress the effect of errors. Various error mitigation techniques have been developed, such as zero-noise extrapolation  [4, 5, 6], which uses different error rates to reduce the error in the measurement of an observable; probabilistic error cancellation [4], which uses an ensemble of known noisy circuits to approach the correct expectation value; dynamical decoupling [7, 8, 9], which uses timed control sequences to suppress interactions of the target quantum system with its environment; readout error mitigation [10], which uses classical postprocessing techniques to mitigate measurement errors; and symmetry verification [11, 12, 13, 14], which verifies symmetries in computational problems of interest and discards erroneous computations.

Protocols that improve measurements of an observable have applications in problems such as the estimation of the ground state energy of a given Hamiltonian [15]. In contrast, protocols that improve fidelity generally apply to any problem. A main feature of many techniques aimed at improving measurements of an observable or reducing readout error is that they have no quantum overhead; in other words, they require no extra qubits or quantum operations (gates). Thus, these techniques are ideal for the noisy intermediate-scale quantum (NISQ) era [16] because current state-of-the-art NISQ devices contain few qubits, typically fewer than 50, and a limited number of gate operations because of fast decoherence times.

As quantum technology develops, error mitigation schemes must adapt and take advantage of improvements in qubit count and quality [17, 18]. Qubit count and error rates vary widely depending on the underlying qubit technology. Additionally, many of the current error mitigation techniques such as dynamical decoupling and probabilistic error cancellation require intricate tailoring of the protocol to the noise. Thus, they typically require the added overhead in costly quantum tomography [19].

In this work we theoretically and numerically study a quantum error mitigation technique inspired by stabilizer codes, that aims at improvement in quantum state fidelity. We build on the results of [20], where they first explored the scheme of sandwiching a circuit between pairs of parity checks. Note that they refer to the pairs of checks as extended flag gadgets, inspired by the work in [21, 22]. Our research puts this parity check scheme on a firm theoretical foundation and numerically demonstrates its efficacy on a wide variety of quantum circuits. The main contributions of our work are: (1) extending the analysis to greater than two layers of checks, (2) establishing the theoretical limits of the technique, which culminates in the unit fidelity result of Theorem 1, (3) providing parity checks in Propositions 1 and 2 that saturate this fidelity bound and hence answers an open question in [20] regarding optimal checks to use, (4) providing a protocol that efficiently determines Pauli parity check pairs that can be used for a given input circuit, and (5) providing numerical simulations for a wide variety of random input circuits consisting of varying qubit count, cnot count, non-Clifford gate count, and layer count.

The error mitigation scheme that we study in this paper at its basic level of one layer uses one ancilla and two controlled unitary operations, which we refer to as checks. The parity checks sandwich the input circuit. Consequently, the error operator is conjugated between two controlled parity matrices. We measure the ancilla and postselect the state on the measurement outcomes. The net effect of the checks and the postselection is a transformed error map, where terms of the error map that anticommute with the checks are eliminated in the postselected state. The performance of this technique, measured by the improvement in quantum state fidelity, improves with the depth of the input circuit. Furthermore, this scheme is tunable, meaning that the number of layers and ancillary qubits used can be set by the user.

This protocol shares some similarity to symmetry verification, which also uses stabilizer-style parity checks to improve the fidelity of the quantum state and requires no knowledge of the noise. However, unlike symmetry verification, which requires input states to be restricted to a specific eigenspace, this scheme places no restriction on the input state. Thus, the technique applies to subcircuits directly and can be easily combined with other error mitigation methods.

In Theorem 1, we prove that in a restricted scenario where the noise does not affect the checks (see Figs. 4(a) and 4(b)) there exist checks such that the postselected state is noiseless and the fidelity reaches unity. We provide an example of a randomly generated Clifford circuit with added checks that saturates this fidelity bound.

We also investigate the performance of the scheme with numerical simulations in a more realistic setting where the checks are also noisy. The numerical simulations consist of 1,8501,850 (unmitigated) randomly generated five- and ten-qubit circuits composed of Clifford + arbitrary diagonal unitary gates. Our technique shows an average fidelity gain of 34 percentage points for random input circuits consisting of 4040 CNOTs with six layers of (noisy) checks; see Figs. 10a and 10b. The increase in fidelity comes at a cost of a lower probability of postselecting on the ancillas’ measurement outcomes. We also provide Clifford simulations that give intuition that this technique will perform well for deep circuits.

This paper is organized as follows. In Section II we review relevant background and provide definitions that are used in the paper. In Section III-A we provide the single-layer protocol. In Section III-B we describe the theoretical foundation of the technique. In Section III-C we provide the full multilayer scheme. In Section III-D we prove Theorem 1 and provide bounds on the number of layers required to reach unit fidelity for the restricted scenario where the noise does not affect the checks. In Section III-E we discuss how our results apply in general settings. In Section III-F we introduce techniques for finding checks quickly by using a precalculated table of commutation rules that eliminates the need to perform matrix multiplication. In Section III-G we give the results of our numerical simulations. In Section IV we discuss our results and possible areas for future work.

II Background

We begin with definitions and notation. For a detailed introduction to modeling of noise in quantum computation the reader is referred to Chapter 8 of [23].

The most general evolution of an open quantum system is given by a dynamical map [24]

:ρSρS,\displaystyle\mathcal{E}:\rho_{S}\rightarrow\rho^{\prime}_{S}, (1)

where ρS\rho_{S} and ρS\rho_{S}^{\prime} are elements of the system Hilbert space S\mathcal{H}_{S}. S\mathcal{H}_{S} is a subspace of the system and environment Hilbert space SE\mathcal{H}_{SE}, where SS is the system and EE is the environment.

In the case of an initially unentangled system and environment, the map is completely positive and trace preserving. It can be derived by taking the partial trace of the global unitary evolution and yields the operator sum representation

(ρS)\displaystyle\mathcal{E}(\rho_{S}) =trE(USEρSρEUSE)\displaystyle=\tr_{E}(U_{SE}\rho_{S}\otimes\rho_{E}U_{SE}^{\dagger})
=iEiρSEi,\displaystyle=\sum_{i}E_{i}\rho_{S}E_{i}^{\dagger}, (2)

where USEU_{SE} is a unitary acting across the system and environment [25, 26]. The operators EiE_{i} in Eq. (II) are commonly called Kraus operators [26]. A map is completely positive (CP) if it maps all positive operators to positive operators when extended by the identity map to arbitrary higher dimensions [25], namely,

n(ρ)0n,ρ,\displaystyle\mathcal{E}\otimes\mathcal{I}_{n}(\rho)\geq 0\quad\forall n,\rho, (3)

where n\mathcal{I}_{n} is the nn-dimensional identity map and ρ\rho is a density matrix. This extension to higher dimensions is required to ensure that when the input state is part of a higher-dimensional state, the output of the map is still positive. A map is trace preserving if

iEiEi=𝕀.\displaystyle\sum_{i}E_{i}^{\dagger}E_{i}=\mathbb{I}. (4)

Maps that do not satisfy Eq. (3) are called not completely positive (NCP) maps. NCP maps play a significant role in non-Markovian evolutions [27], where the evolution of the state is often not decomposable into a sequence of completely positive maps. NCP maps have the form

(ρ)=iηiEiρEi,\displaystyle\mathcal{E}(\rho)=\sum_{i}\eta_{i}E_{i}\rho E_{i}^{\dagger}, (5)

where ηi=±1\eta_{i}=\pm 1 and at least one ηi=1\eta_{i}=-1 exists [25].

In this paper we use fidelity as a figure of merit. For two quantum states ρ\rho and ω\omega, fidelity is defined as

F(ρ,ω)=(trρωρ)2.\displaystyle F(\rho,\omega)=\left(\tr\sqrt{\sqrt{\rho}\omega\sqrt{\rho}}\right)^{2}. (6)

Fidelity is symmetric with regard to its inputs.

Let UU denote the unitary operation implemented by the target circuit that we want to error mitigate. Let ρ\rho^{\prime} be the density matrix representing the ideal, noiseless output of the target circuit, ρn\rho_{n} be the density matrix of the noisy quantum state produced by the target circuit, and ρm\rho_{m} be the density matrix of the noisy error-mitigated state produced by the error mitigated target circuit.

We denote the fidelity of the noisy state before error mitigation as Fn=F(ρn,ρ)F_{n}=F(\rho_{n},\rho^{\prime}), and the fidelity of the state after application of error mitigation as Fm=F(ρm,ρ)F_{m}=F(\rho_{m},\rho^{\prime}).

We define the fidelity gain (improvement due to the technique) as

FmFn.\displaystyle F_{m}-F_{n}. (7)

We say that the method “detects all errors" when Fm=1F_{m}=1 or equivalently when the error map \mathcal{E} on the postselected state is identity, in other words, when all the Kraus operators of \mathcal{E} are proportional to identity.

Next we describe the noise model used in our numerical simulations. The depolarizing channel for dimension dd is

𝒟p(ρ)=(1p)ρ+p𝕀d,\displaystyle\mathcal{D}_{p}(\rho)=(1-p)\rho+p\dfrac{\mathbb{I}}{d}, (8)

where 0p10\leq p\leq 1 [23]. For the numerical simulations of noisy circuits the only noiseless gates are a measurement gate or the input state, which is generated by a random circuit. Otherwise, we apply the single-qubit depolarizing channel after each single-qubit gate and the two-qubit depolarizing channel after every two-qubit gate. Throughout this paper, we set the two-qubit error rate to ten times the single-qubit error rate, an assumption that roughly corresponds to noise observed in current NISQ systems [28, 29].

III Methods

In Sections III-A and III-B we describe the single-layer Pauli Check Sandwiching (PCS) technique and show that this protocol leads to a transformation of the error map. In Sections III-C and III-D we describe the multilayer protocol and prove that we can reach a fidelity of one between a noisy-mitigated circuit and a noiseless circuit when the error map is restricted to a subset of qubits. We also provide a small number of checks that achieve this fidelity. In Section III-E we investigate how our techniques apply in a general setting. In Sections III-F and III-G we give the results of numerical experiments across 1,8501,850 unmitigated random circuits.

Refer to caption
Figure 1: Overview of the one-layer version of the PCS scheme. UU represents the gates of the computation and acts across nn compute qubits. UU is sandwiched between two controlled unitaries comprising C~1\tilde{C}_{1} and C~2\tilde{C}_{2} that satisfy Eq. (11). The ancilla is the bottom qubit. The measurement is performed in the {|0,|1}\{\ket{0},\ket{1}\} basis. The measurement outcome one is discarded, and zero is kept.

III-A Pauli Sandwich Error Mitigation Protocol: Single Layer

We begin by describing the simplest version of the Pauli Check Sandwiching technique that consists of a single pair of parity checks sandwiching the computation (one “layer”). Figure 1 shows a graphical view of the protocol. The unitary operation UU represents the gates of the computation. The bottom qubit is the single ancilla introduced by this scheme, and we commonly refer to the nn qubits above as the compute or computation qubits.

Let C2C_{2} (C1C_{1}) be a controlled unitary with control on the ancilla that applies C~2\tilde{C}_{2} (C~1\tilde{C}_{1}) on the compute target qubits. Mathematically,

C1\displaystyle C_{1} =C~1|11|+𝕀|00|\displaystyle=\tilde{C}_{1}\otimes\outerproduct{1}{1}+\mathbb{I}\otimes\outerproduct{0}{0} (9)
C2\displaystyle C_{2} =C~2|11|+𝕀|00|.\displaystyle=\tilde{C}_{2}\otimes\outerproduct{1}{1}+\mathbb{I}\otimes\outerproduct{0}{0}. (10)

This scheme also requires that

C~2UC~1=U.\displaystyle\tilde{C}_{2}U\tilde{C}_{1}=U. (11)

Before continuing, we make an important distinction between two protocols: the efficient PCS protocol and the general PCS protocol. For the efficient PCS protocol, we restrict C~1\tilde{C}_{1} and C~2\tilde{C}_{2} to be elements of the nn-qubit Pauli group 𝒫n\mathcal{P}_{n}, where

𝒫n={𝕀,x,y,z}n×{±1,±i}.\displaystyle\mathcal{P}_{n}=\{\mathbb{I},\textsc{x},\textsc{y},\textsc{z}\}^{\otimes n}\times\{\pm 1,\pm i\}. (12)

These added conditions are partly due to the difficult problem of determining the optimal circuit that implements C~1\tilde{C}_{1} from a given C~2\tilde{C}_{2} and UU. Note that C~2\tilde{C}_{2} and C~1\tilde{C}_{1} can be much more general and still satisfy Eq. (11). Thus, there are no additional constraints on the checks in the general PCS protocol.

In the general PCS protocol and for a given UU, any unitary C~2\tilde{C}_{2} can be used because in Eq. (11) we can always pick C~2\tilde{C}_{2} and solve for C~1\tilde{C}_{1}. We note in the text if a result holds for a specific case. If no statement is made, then the result holds for both scenarios.

The single-layer protocol is as follows.

  1. 1.

    Initialize the ancilla to |0\ket{0} and apply a Hadamard gate. Perform C1C_{1} with the control on the ancilla qubit and target on the compute qubits.

  2. 2.

    Perform UU on the compute qubits.

  3. 3.

    Perform C2C_{2} with the control on the ancilla qubit and target on the compute qubits.

  4. 4.

    Apply a Hadamard gate to the ancilla. Measure the ancilla in the {P0=|00|,P1=|11|}\{P_{0}=\outerproduct{0}{0},P_{1}=\outerproduct{1}{1}\} basis, and discard the results where the outcome is P1P_{1}. We keep the result where the outcome is P0P_{0}.

III-B Errors Detected by the Pauli Sandwich

Refer to caption
(a) \mathcal{E} is a noise map.
Refer to caption
(b) Equivalent noisy circuit using Eq. (11). The gates after UU along with the postselection on the ancilla can be seen as the transformed error map.
Figure 2: Noisy single-layer scheme

We now consider the effect of this scheme on an error map \mathcal{E} acting on the compute qubits after UU, as shown in Fig. 2(a). Let (ρ)=iEiρEi\mathcal{E}(\rho)=\sum_{i}E_{i}\rho E_{i}^{\dagger}. Then the postselected output state of the protocol is

ρm=\displaystyle\rho_{m}=
i[(C~2EiC~2+Ei)UρU(C~2EiC~2+Ei)]tr(i[(C~2EiC~2+Ei)UρU(C~2EiC~2+Ei)]).\displaystyle\dfrac{\sum_{i}[(\tilde{C}_{2}E_{i}\tilde{C}_{2}^{\dagger}+E_{i})U\rho U^{\dagger}(\tilde{C}_{2}E^{\dagger}_{i}\tilde{C}_{2}^{\dagger}+E^{\dagger}_{i})]}{\tr(\sum_{i}[(\tilde{C}_{2}E_{i}\tilde{C}_{2}^{\dagger}+E_{i})U\rho U^{\dagger}(\tilde{C}_{2}E^{\dagger}_{i}\tilde{C}_{2}^{\dagger}+E^{\dagger}_{i})])}. (13)

As shown in Fig. 2(b), this protocol transforms the error map. We can write the postselected state given in Eq. (III-B) in terms of a new error map \mathcal{E}^{\prime},

ρm=(UρU)tr[(UρU)],\displaystyle\rho_{m}=\dfrac{\mathcal{E}^{\prime}(U\rho U^{\dagger})}{\tr[\mathcal{E}^{\prime}(U\rho U^{\dagger})]}, (14)

where \mathcal{E}^{\prime} has Kraus operators

Ei=C~2EiC~2+Ei2\displaystyle E^{\prime}_{i}=\dfrac{\tilde{C}_{2}E_{i}\tilde{C}_{2}^{\dagger}+E_{i}}{2} (15)

and the factor of 1/21/2 comes from multiplying Eq. (III-B) by a convenient form of one, namely, (1/4)/(1/4)(1/4)/(1/4). We can now observe the power of this error mitigation technique. The error operators EiE_{i} can be expanded in the Pauli basis. Thus, let

Ei=σ~j𝒫nαijσ~j,\displaystyle E_{i}=\sum_{\tilde{\sigma}_{j}\in\mathcal{P}_{n}}\alpha_{ij}\tilde{\sigma}_{j}, (16)

where σ~j\tilde{\sigma}_{j} is an element of the Pauli group and αij=tr(Eiσ~j)/(2n)\alpha_{ij}=\tr(E_{i}\tilde{\sigma}_{j})/(2^{n}) is a complex constant. Let C~2𝒫n\tilde{C}_{2}\in\mathcal{P}_{n}. Each σ~j\tilde{\sigma}_{j} term in the expansion of EiE_{i} either commutes or anticommutes with C~2\tilde{C}_{2}. Substituting Eq. (16) into Eq. (15), we see that the σ~j\tilde{\sigma}_{j} elements that anticommute with C~2\tilde{C}_{2} are eliminated and

Ei=σ~j𝒫nαijσ~j,\displaystyle E^{\prime}_{i}=\sum_{\tilde{\sigma}_{j}\in\mathcal{P}^{\prime}_{n}}\alpha_{ij}\tilde{\sigma}_{j}, (17)

where 𝒫n\mathcal{P}^{\prime}_{n} is the Pauli group excluding the elements that anticommute with C~2\tilde{C}_{2}.

The effect of the protocol on the error map shares some similarities to that of twirling [30, 31, 32]. In twirling, the twirling set TT is used to conjugate the error map:

1|T|VTV(VρV)V\displaystyle\dfrac{1}{\absolutevalue{T}}\sum_{V\in T}V\mathcal{E}(V^{\dagger}\rho V)V^{\dagger}
=1|T|i,(VT)VEiVρVEiV.\displaystyle=\dfrac{1}{\absolutevalue{T}}\sum_{i,(V\in T)}VE_{i}V^{\dagger}\rho VE_{i}^{\dagger}V^{\dagger}. (18)

Usually, twirling is performed by using the Pauli or Clifford group as the twirling set. When twirling is performed with a suitable set, it transforms the noise into a Pauli channel. However, the PCS scheme is in some sense more powerful since it completely eliminates the contribution of anticommuting Pauli terms.

Refer to caption
Figure 3: Multilayer scheme. There are nn compute qubits, mm layers, and mm ancillas. The second index in the controlled unitaries represents the layer. Each layer uses one ancilla and two checks. The checks sandwich the input circuit.

III-C Pauli Sandwich Error Mitigation Protocol: Multiple Layers

The suppression of errors from anticommuting Pauli terms in the postselected state can be enhanced by introducing multiple layers of the single-layer error mitigation technique. A graphical view of how this scheme works is given in Fig. 3. There are mm layers with each layer consisting of controlled operations C1,kC_{1,k} and C2,kC_{2,k}, where the second index represents the layer, and one ancilla corresponding to each layer. Each pair of C1,kC_{1,k} and C2,kC_{2,k} satisfies

C~2,kUC~1,k=U.\displaystyle\tilde{C}_{2,k}U\tilde{C}_{1,k}=U. (19)

The multilayer scheme generalizes the single-layer scheme and is performed as follows.

  1. 1.

    Initialize the ancillas to |0\ket{0}, and perform Hadamard gates on the ancillas. Perform C1,kC_{1,k} with control on the kthk^{\text{th}} ancilla qubit and target on the compute qubits.

  2. 2.

    Perform UU on the compute qubits.

  3. 3.

    Perform C2,kC_{2,k} with control on the kthk^{\text{th}} ancilla qubit and target on the compute qubits.

  4. 4.

    Perform Hadamard gates on the ancillas. Measure all the ancillas in the {P0=|00|,P1=|11|}\{P_{0}=\outerproduct{0}{0},P_{1}=\outerproduct{1}{1}\} basis, and discard the results where at least one of the outcomes is P1P_{1}. We keep the result where all the outcomes are P0P_{0}.

III-D Upper Bounds on Fidelity and Required Number of Checks

Refer to caption
(a) \mathcal{E} is an arbitrary noise map.
Refer to caption
(b) Equivalent circuit using Eq. (19). Reminiscent of the single-layer scheme, the gates after UU along with the postselection on the ancillas can be seen as the transformed error map.
Figure 4: Noisy multilayer scheme

Now let us consider a noise map (ρ)=iEiρEi\mathcal{E}(\rho)=\sum_{i}E_{i}\rho E_{i}^{\dagger} acting after UU on a subset of qubits as shown in Fig. 4(a). From Eq. (15), in the expansion of EiE_{i} in the Pauli basis, we know that the kkth layer eliminates Pauli terms that anticommute with C~2,k\tilde{C}_{2,k}. This immediately leads to the observation that we can detect all errors under the noise model shown in Fig. 4(a), which we prove in the following theorem. Theorem 1 holds in general for the general PCS protocol and it holds for the efficient PCS protocol when the checks are in the Pauli group, in other words, when UU is Clifford. Note that we discuss why these results hold for NCP errors as well later at the start of Section III-E.

Theorem 1 (Unit Fidelity).

If errors are restricted to act only on the compute qubits, for any noisy unitary quantum circuit UU acting on nn compute qubits, there exist checks (see Fig. 4(a)) such that the fidelity between the post selected state and a noiseless run (noiseless execution of U only) reaches one.

Proof.

First, note that if the error map (ρ)=iEiρEi\mathcal{E}(\rho)=\sum_{i}E_{i}\rho E_{i}^{\dagger} is the identity map, then the fidelity between the output ρm\rho_{m} of the error-mitigated circuit and the output ρ\rho^{\prime} of a circuit with only UU (a noiseless run) is

F(ρm,ρ)=1.\displaystyle F(\rho_{m},\rho^{\prime})=1. (20)

This directly follows from Eq. (19). Thus, if we can transform all the Kraus operators EiE_{i} of the error \mathcal{E} to identity in the postselected state, then we have the result.

Notice that from Eq. (19), Fig. 4(a) is equivalent to Fig. 4(b) and the error map is conjugated by multiple layers of checks. Expanding the error in the Pauli basis, we have

Ei=σ~j𝒫nαijσ~j,\displaystyle E_{i}=\sum_{\tilde{\sigma}_{j}\in\mathcal{P}_{n}}\alpha_{ij}\tilde{\sigma}_{j}, (21)

where σ~j\tilde{\sigma}_{j} is an element of the Pauli group 𝒫n\mathcal{P}_{n} and αij=tr(Eiσ~j)/(2n)\alpha_{ij}=\tr(E_{i}\tilde{\sigma}_{j})/(2^{n}) is a complex constant. Let C~2,i𝒫n,i\tilde{C}_{2,i}\in\mathcal{P}_{n},\forall i.

We now make the results given in Section III-B recursive. First, we label the check layers from 1 to mm starting with the innermost layer. Then, Eq. (15) can be written recursively as

Ei(k)=C~2,kEi(k1)C~2,k+Ei(k1)2,\displaystyle E_{i}^{(k)}=\dfrac{\tilde{C}_{2,k}E_{i}^{(k-1)}\tilde{C}_{2,k}^{\dagger}+E_{i}^{(k-1)}}{2}, (22)

where (k)(k) represents the layer and Ei(0)E_{i}^{(0)} is the initial error Kraus operation. This leads to the recursive form of Eq. (17),

Ei(k)=σjGn(k)αijσ~j,\displaystyle E^{(k)}_{i}=\sum_{\sigma_{j}\in G^{(k)}_{n}}\alpha_{ij}\tilde{\sigma}_{j}, (23)

where Gn(k)G^{(k)}_{n} is the Pauli group excluding the elements that anticommute with {C~2,1,C~2,2,,C~2,k}\{\tilde{C}_{2,1},\tilde{C}_{2,2},\cdots,\tilde{C}_{2,k}\}. Letting kk equal the size of 𝒫n\mathcal{P}_{n} (excluding global phases), namely, 4n4^{n}, we get Ei(k)=αiIE_{i}^{(k)}=\alpha_{i}I. The αi\alpha_{i} is a constant that cancels out under renormalization, and the result follows. ∎

Before proceeding, we need to clarify the implications of Theorem 1. In that theorem, if we satisfy the conditions, we will have unit fidelity in the postselected state. However, the probability of postselecting is

P(0¯)=tr((m)(UρU)),\displaystyle P(\overline{0})=\tr(\mathcal{E}^{(m)}(U\rho U^{\dagger})), (24)

where the Kraus operators of (m)\mathcal{E}^{(m)} are given by Eq. (23). In Eq. (23) the checks will eliminate all the Pauli terms that are nonidentity. Thus, we see that if all the Kraus operators of the error map are traceless, in other words, contain no identity term in their expansion in the Pauli basis, all the Kraus operators for the error map in the postselected state will be the zero matrix, and the probability of postselecting is zero. This makes sense because we are not correcting errors, but mitigating errors by post selecting outcomes. The theorem holds trivially in this scenario because there is no post selected state.

Moreover, from Fig. 4(a) and Theorem 1, it seems that we can set (ρ)=UρU\mathcal{E}(\rho)=U^{\dagger}\rho U, which eliminates UU, and use only the checks for the implementation of the circuit. While this is certainly true, we must consider the postselection probability. If UU is traceless, the probability of postselecting is zero.

Next we provide a small number of C2C_{2} checks that can reach unit fidelity in the setting of Theorem 1. The following results given in Propositions 1 and 2 are for the general PCS protocol. Propositions 1 and 2 hold for the efficient PCS protocol given that the C~1\tilde{C}_{1} checks are in the Pauli group, in other words, UU is Clifford. Propositions 1 and 2 hold for the noise model give in Fig. 4(a). For arbitrary weight-one Kraus errors, that is, the Kraus error operators, EiE_{i} act only on a single qubit; there exist two layers, where C~2,1\tilde{C}_{2,1} and C~2,2\tilde{C}_{2,2} are max weight, and we reach unit fidelity in the postselected state.

Proposition 1 (Weight-One Kraus Operators: Two layers of max weight checks are sufficient).

For the noise model given in Fig. 4(a) and for all \mathcal{E} consisting of only weight-one EiE_{i}, there exist two layers of checks such that we have unit fidelity in the postselected state. The C2C_{2} part of the checks requires a total of 2n2n cnot gates, where nn is the number of compute qubits.

Proof.

Each of the single-qubit errors can be expanded in terms of the single-qubit Pauli gates. Thus,

Ei,k=jαi,jσj,k,\displaystyle E_{i,k}=\sum_{j}\alpha_{i,j}\sigma_{j,k}, (25)

where kk is the qubit it is acting on, σj\sigma_{j} is a Pauli matrix or identity, and αi,j\alpha_{i,j} is a complex constant. Let our checks be

C~2,1=xn\displaystyle\tilde{C}_{2,1}=\textsc{x}^{\otimes{n}} (26)

and

C~2,2=zn.\displaystyle\tilde{C}_{2,2}=\textsc{z}^{\otimes{n}}. (27)

These checks are inspired by the parity checks used in Shor’s code [33]. The C~2,1\tilde{C}_{2,1} consist of tensors of Pauli X and anticommutes with Pauli Y and Pauli Z errors in Eq. (25). The C~2,2\tilde{C}_{2,2} consist of tensors of Pauli Z and anticommutes with Pauli X errors in Eq. (25). From Theorem 1, the anticommuting terms in the error operators are suppressed. Thus, these two layers of checks are sufficient to reach fidelity one. ∎

The checks given in Proposition 1 can detect all errors \mathcal{E} that consist of weight-one Kraus operators EiE_{i}. This class of errors contains error maps that are more general than just single-qubit error maps. For example, E1E_{1} can act on qubit one, and E2E_{2} can act on qubit two. E1E_{1} and E2E_{2} are weight-one errors, but the overall map affects multiple qubits.

Remark.

At least two layers are necessary to reach fidelity one in Proposition 1. To see this, we need only show that a single layer is insufficient for arbitrary weight-one errors. Consider a circuit with only one compute qubit. For an arbitrary single-layer scheme, let C~2=W\tilde{C}_{2}=W be the check. Then let the error map be E=WE=W. The check and the error do not anticommute so the error map in the postselected state is not identity. Thus, a single layer is insufficient to detect all weight-one errors; at least two layers are necessary. Proposition 1 shows that we can always saturate this lower bound on the number of required checks.

We can also reach fidelity one for arbitrary weight errors for the error model given in Fig. 4(a) with a small number of weight-one C~2,k\tilde{C}_{2,k}. These checks are generators of the Pauli group and require 2n2n layers, but the C2C_{2} components of the checks require the same number of cnot gates as in Proposition 1. Thus, generally at the cost of more ancillas, we can detect all errors on the postselected state. Consider two weight-one C~2\tilde{C}_{2} checks of σ1\sigma_{1} and σ3\sigma_{3} on the kkth compute qubit. All Pauli group elements that are nonidentity on the kkth qubit anticommute with either σ1\sigma_{1} or σ3\sigma_{3}. This leads to the following small set that can reach fidelity one.

Proposition 2 (Any Error: 2n2n number of weight-one checks are sufficient).

For the noise model given in Fig. 4(a) and arbitrary errors, let nn be the number of compute qubits. Then there exist 2n2n number of distinct (ignoring the global phase) weight-one C~2,k\tilde{C}_{2,k} such that we have unit fidelity in the postselected state.

Proof.

Let the kkth compute qubit have two layers acting on it with C~2,r(k)=σ1(k)\tilde{C}^{(k)}_{2,r}=\sigma_{1}^{(k)} and C~2,l(k)=σ3(k)\tilde{C}^{(k)}_{2,l}=\sigma_{3}^{(k)}. All Pauli group elements that are nonidentity on the kkth qubit anticommute with at least one of the checks. Thus, this eliminates all Pauli terms in the expansion of the error Kraus operators that do not have identity on the kkth qubit for the postselected state. We repeat these checks for the other compute qubits. The same argument holds in general for {σi(k),σj(k)|ij}\{\sigma_{i}^{(k)},\sigma_{j}^{(k)}|i\neq j\} and the result follows. ∎

Refer to caption
Figure 5: Example of checks that detect all errors. The upper bound on fidelity is saturated at four layers for this randomly generated Clifford input circuit consisting of two qubits and 30 cnot gates. We use depolarizing noise for the given noise model in Fig. 4(a). The two-qubit error rate is ten times the single-qubit error rate. The single qubit-error rate ranges from 10510^{-5} to 10110^{-1}. At 10110^{-1}, each cnot gate (acting on the compute qubits only) is followed by a two-qubit maximal depolarizing channel. Regardless, the postselected state is noiseless, as predicted by Theorem 1.

Figure 5 shows an example of a random Clifford circuit consisting of two compute qubits and 30 cnot gates that gives unit fidelity for the postselected state. This matches the prediction of Theorem 1. We use a Clifford circuit to guarantee that we can get the desired checks with the efficient PCS protocol. We use the checks provided in Proposition 2. The two checks on each compute qubit are σ1(k)\sigma_{1}^{(k)} and σ2(k)\sigma_{2}^{(k)}, and we vary the number of layers from zero to four. Interestingly, at the single-qubit error of 0.10.1, the two-qubit depolarizing channel is maximally depolarizing, but the fidelity remains at one for the postselected state (as predicted).

The gain in fidelity comes at the cost of a lower probability of measuring all zeros for the ancillas. This trade-off is demonstrated in Fig. 5b. The probability of measuring all zeros p(0¯)p(\overline{0}) drops to around 7% for this circuit at the high single-qubit error of 0.10.1. Note that the overhead in the number of runs is 1p(0¯)\frac{1}{p(\overline{0})}.

III-E General Errors and Hardware Considerations

Refer to caption
(a) \mathcal{E} is a general noise map that acts across all qubits.
Refer to caption
(b) Using Eq. (19), the error map is still conjugated between the checks.
Figure 6: Noisy multilayer scheme

In the preceding sections, we restricted \mathcal{E} to CP maps, but our results hold also for general linear Hermitian maps, which includes NCP maps. As previously mentioned, NCP maps play a major role in non-Markovian evolutions, where the maps tend to be non-CP divisible. NCP maps have a similar form to CP maps and are written as (ρ)=iηiEiρEi\mathcal{E}(\rho)=\sum_{i}\eta_{i}E_{i}\rho E_{i}^{\dagger}, where ηi=±1\eta_{i}=\pm 1 and there exists at least one ηi=1\eta_{i}=-1. The coefficients ηi\eta_{i} are not used in any of our proofs, and thus the results hold.

Also, we restricted \mathcal{E} to act only on the compute qubits. Obviously this is a restricted case, and in physical systems the checks are noisy and the error map would generally act across all the qubits, as shown in Fig. 6(a). In this situation, the checks still conjugate the error, as shown in Fig. 6(b). Consequently, the technique is effective when \mathcal{E} is dominated by Kraus operators that mainly affect the compute qubits; that is, the majority of the noise is from UU.

On non fully connected quantum computers, the parity checks may be difficult to perform with resulting minimal noise on the ancillas due to the need for swapping qubits. Thus, applications of this technique likely need to carefully map the circuit to the hardware to minimize the swaps between ancillas and compute qubits or execute the circuits on a fully connected device.

Since single-qubit gates introduce less noise than nonlocal gates, the Pauli group is a good candidate for the C~2\tilde{C}_{2} part of the checks. Furthermore, when UU is a deep circuit, the noise it induces will generally act across multiple qubits. In this scenario, low-weight C~2\tilde{C}_{2} will act nontrivially on these errors. Thus, in general it is better to use low-weight checks in order to avoid introducing too many errors.

Moreover, for some executions of this scheme, the postselection probability may be smaller than desired. The postselection probability can be increased by reducing the number of check layers.

III-F Protocol for Finding Checks Quickly

Refer to caption
(a) We assign zyzz to C~2\tilde{C}_{2}
Refer to caption
(b) We propagate one layer to the left and determine the intermediate check gate zyzz.
Refer to caption
(c) We propagate the intermediate check gate one layer to the left and determine the intermediate check gate zyzz. Notice that the z gate commutes with Rz.
Refer to caption
(d) We propagate through the last layer and assign the result zyzx-\textsc{z}\textsc{y}\textsc{z}\textsc{x} to the C~1\tilde{C}_{1} gate.
Figure 7: Visual example of “pushing" the checks through the circuit. We start with C~2\tilde{C}_{2} and find C~1\tilde{C}_{1}. No multiplication is performed since the propagation of the C~2\tilde{C}_{2} gate is determined through lookups of predetermined commutation relations.
Refer to caption
Figure 8: Final error-mitigated circuit for the example described in Fig. 7

While checks always exist for a given UU, in practice it is difficult to directly compute C~1\tilde{C}_{1} from Eq. (19) for a given C~2\tilde{C}_{2}. Here we introduce our searching protocol for the efficient PCS protocol for determining the check pairs quickly and without matrix multiplication. Note that this protocol can fail to find any checks or may not find the desired number of checks. This can happen when the circuit contains many non-Clifford gates. We refer to the checks searching protocol as the finding checks protocol. For our implementation, we constrained C~1\tilde{C}_{1} and C~2\tilde{C}_{2} to be in the Pauli group. We leave the potential searching protocol of a non-Pauli C~1\tilde{C}_{1} for future work.

The goal is to determine the gates comprising C~1\tilde{C}_{1} from a given C~2𝒫n\tilde{C}_{2}\in\mathcal{P}_{n} and a given UU. Instead of performing matrix multiplication, we transpile the input circuit to an equivalent circuit that uses the gate set {x,y,Rz,s,h,cnot}\{\textsc{x},\textsc{y},\textsc{Rz},\textsc{s},\textsc{h},\textsc{cnot}\} and perform lookups of the commutation relations. This method applies to circuits consisting of Clifford ++ arbitrary diagonal gates, which is a universal gate set since diagonal gates contain the gate TT.

To determine the checks, we use the equality U1U2=U2(U2U1U2)=U2U1U_{1}U_{2}=U_{2}(U_{2}^{\dagger}U_{1}U_{2})=U_{2}U^{\prime}_{1}, where U1=U2U1U2U^{\prime}_{1}=U_{2}^{\dagger}U_{1}U_{2} and U1U_{1} and U2U_{2} are unitary. We refer to this technique as “pushing" U1U_{1} through U2U_{2}. Figure 7 gives a visual example of the pushing of the C~2\tilde{C}_{2} gates to determine C~1\tilde{C}_{1}. Figure 8 is the completed error-mitigated circuit. This process is efficient since the cost of each lookup call is constant O(1)O(1).

Algorithm 1 is the pseudocode for the main script for finding a desired number of Pauli check pairs. It iterates through the minimum weight Pauli checks first and terminates when a sufficient number of layers of checks have been found. The protocol focuses on using low weight checks to minimize the noise introduced by the checks as discussed previously in Section III-E. The main script calls on Alg. 3 to see whether it is possible to push the current gate through. In Alg. 2 the lookup call is a preset table that has commutation relations. This symbolic “pushing" of Pauli gates through UU works for all gates in the basis set except for Rz. For Rz, if the gate being pushed is not in {z,i}\{\textsc{z},\textsc{i}\}, which are operators that commute with an arbitrary diagonal gate, then we skip that Pauli group element.

Note that mathematically, any C~2𝒫n\tilde{C}_{2}\in\mathcal{P}_{n} can be used because C~1\tilde{C}_{1} can be determined from Eq. (11). Thus, one should be able to expand the current algorithm to allow for finding of general C~1\tilde{C}_{1}. This problem is nontrivial.

Algorithm 1 Main script: find pairs of Pauli checks
1:circcirc\leftarrow quantum circuit
2:paulispaulis\leftarrow +1 phase Pauli group for N qubits. paulispaulis is sorted by weight from smallest to largest
3:c1c_{1} is initialized to None
4:c2c_{2} is initialized to None
5:layersFound0layersFound\leftarrow 0
6:numberLayersnumberLayers\leftarrow number of layers to find for circcirc.
7:for paulipauli in paulispaulis do
8:     if layersFoundlayersFound equals numberLayersnumberLayers then
9:         BREAK \triangleright Found the necessary number of layers.
10:     end if
11:     op2pauliop2\leftarrow pauli
12:     for op1op1 and indexindex in circcirc do
13:         if CanContinue(op1op1, op2op2then
14:              op2op2\leftarrow Push(op1, op2op2)
15:         else
16:              BREAK \triangleright This check attempt didn’t work so break out and try the next Pauli.
17:         end if
18:         if indexindex is the last index then
19:              layersFoundlayersFound+1layersFound\leftarrow layersFound+1
20:              c1c_{1}\leftarrow append op2op2
21:              c2c_{2}\leftarrow append paulipauli
22:         end if
23:     end for
24:end for
Algorithm 2 Push Pauli gate left
1:Input:
2:op1: The current gate that we need to pass through.
3:op2: The gate being pushed through gate.
4:Output:
5:The gate after it is pushed through op1. Should be equivalent to op1op2op1op1^{\dagger}*op2*op1
6:function push(op1op1, op2op2)
7:     if op1op1 is X then
8:         return lookup X(op2)XX(op2)X
9:     else if op1op1 is Y then
10:         return lookup Y(op2)YY(op2)Y
11:     else if op1op1 is H then
12:         return lookup H(op2)HH(op2)H
13:     else if op1op1 is S then
14:         return lookup S(op2)SS^{\dagger}(op2)S
15:     else if op1op1 is Rz then
16:         return op1op1 \triangleright op1op1 will commute with Rz from the previous check performed in the function CanContinue
17:     else if op1op1 is cnot then
18:         return lookup CX(op2)CXCX(op2)CX
19:     end if
20:end function
Algorithm 3 Checks if it is possible to pass the gate through
1:Input:
2:op1: The current gate that we need to pass through.
3:op2: The gate being pushed through gate.
4:Output:
5:True if the gate can be pushed through and false if not.
6:function CanContinue(op1op1, op2op2)
7:     return op1op1 is not Rz or (op2op2 is I or Z)
8:end function

III-G Numerical Results

The analytical results presented above assume perfect checks. Here we numerically investigate the scheme in a more realistic setting where most gates are noisy, including those involved in the parity checks (the only gates that are not noisy are measurements and the circuit that generates the random input state).

Intuitively, given a Clifford circuit and using the efficient PCS protocol, we should be able to perform long computations with high fidelity. For a given Clifford circuit, we can keep the C2C_{2} checks constant and independent of the depth of UU. Thus, the noise induced by our C2C_{2} checks should be relatively constant.

The C1C_{1} checks depend on UU, but they are elements of the Pauli group and hence limited in size and complexity. Therefore, the noise induced by the C1C_{1} checks should also be limited and independent of the depth of UU.

We demonstrate this intuition on simulations consisting of 550 randomly generated Clifford circuits with two compute qubits. Note that these Clifford simulations provide only intuition that the protocol is suitable for deep circuits because Clifford circuits can in general be optimized to use O(n2/log(n))O(n^{2}/log(n)) cnot gates, where nn is the number of qubits [34]. Thus, two-qubit Clifford circuits can be optimized to be shallow. It may be possible to prove this performance on Clifford circuits with higher qubit counts.

We considered random circuits with cnot counts that varied from 1,2,4,,1,0241,2,4,\cdots,1,024. For each cnot count we generated 5050 random circuits, and we used single-qubit depolarizing noise of 0.00126 (0.0126 two-qubit noise). This lies within the range of current noise levels found in state of the art quantum computers [35, 29]. We used four layers of checks; the form of the checks was provided in Proposition 2. As shown in Fig. 9a, we maintained an average fidelity FmF_{m} for the postselected state of greater than 90% for circuits consisting of up to 1,024 cnot gates. The average fidelity of the unmitigated circuits drops to 25% at 256 cnots. Note that this comes at the cost of a lower postselection rate of 6.25% as shown in Fig. 9b.

For optimized Clifford circuits, we would likely not want to use all the checks from Proposition 2 because we would probably exceed the cnot count of the input circuit. Still, as shown in Fig. 9a, fewer layers can produce significant fidelity improvement.

Refer to caption
Figure 9: Two-qubit Clifford simulation with single-qubit error of 0.00126 (0.0126 two-qubit error). Note that while these input circuits can be optimized to use O(n2/log(n))O(n^{2}/log(n)) cnot gates (i.e., O(2) cnot gates), these simulations provide intuition that the protocol is suitable for deep circuits.

These simulations establish the general trend that fidelity is positively correlated with the number of layers up to some value. We suspect that these results also hold for general (non-Clifford) circuits.

We also randomly generated 1,850 input circuits consisting of Clifford + arbitrary diagonal unitary gates. Of these, 1,350 input circuits consist of five qubits with cnot counts of {1,5,,40}\{1,5,\cdots,40\}; 500 input circuits consist of ten qubits with cnot counts of {1,5,,40,80}\{1,5,\cdots,40,80\}. We varied the single-qubit error from 10510^{-5} to 10210^{-2} with 21 equally spaced points in log scale.

For the ten-qubit circuits we also generated circuits with cnot gate counts of 80 to match the max cnot count to qubit ratio of the five qubit case. Each random circuit was generated first as a random Clifford gate, which we truncated to reach the desired cnot count. Next, we inserted Rz gates with random rotation angles and random locations in the circuit. We used Rz gate counts of {5, 10, 15}. Each Rz value for five qubits consists of 450 circuits. This covers a large class of variational quantum eigensolver and quantum approximate optimization algorithm circuits [36].

We achieved an average peak fidelity gain FmFnF_{m}-F_{n} of 3434 percentage points for five-qubit circuits with a cnot gate count of 40, five Rz gates, and six layers of checks, as shown in Fig. 10a. For input circuits with a low cnot count, the fidelity gain is negative because the checks introduce more errors than they eliminate in the post selected state. The average postselection probability is given in Fig. 10b.

Refer to caption
Figure 10: Six layers. (a) The peak average fidelity gain of 34 percentage points occurred at a single-qubit error of approximately 0.00251 (0.0251 two-qubit error) and 40 CNOTs. (b) The probability of postselecting decreases with increasing error rate and increasing cnot count.

We also give in Fig. 11a a plot that breaks down this peak fidelity gain. At the peak fidelity gain, the nonmitigated circuit has about 33% fidelity, and the six-layer mitigated circuit has about 67% fidelity.

Refer to caption
Figure 11: (a) Average fidelity for layers zero to six. At the peak fidelity gain, the nonmitigated circuit has about 33% fidelity, and the six-layer mitigated circuit has about 67% fidelity. (b) Probability of postselecting.

As shown in Fig. 11a, the mitigated circuits perform significantly and consistently better than the unmitigated circuits. Even for lower-layer counts such as two, the average fidelity gain reached 20 percentage points. Fig. 11b gives the corresponding post selection probabilities and demonstrates that we have significant control over the probabilities by changing the number of layers.

Each additional layer increased the average fidelity provided enough circuit depth. We show this in more detail in Fig. 12a, where we fixed the single-qubit error rate to 0.00251 (0.0251 two-qubit error) the value that gave the peak fidelity gain in Fig. 10a.

Refer to caption
Figure 12: (a) Average fidelity gain vs number of layers. (b) Probability of postselecting vs number of layers. The single-qubit error is fixed at approximately 0.00251 (0.0251 two-qubit error)

Circuits with more than six layers may result in even better performance, but the amount of fidelity gained decreases with subsequent layers. Fig. 12b shows the corresponding post selection probabilities and the minimum post selection probability is about 16%.

As the number of Rz (non-Clifford) gates increases, the number of possible low weight C~2\tilde{C}_{2} checks for the efficient PCS protocol decreases, and consequently the fidelity gain decreases. As shown in Fig. 13a, at an Rz gate count of 10, the peak fidelity gain is about 25%.

Refer to caption
Figure 13: Five-qubit circuits with 10 Rz gates. (a) The max fidelity gain is about 25 percentage points. (b) Probability of postselecting vs single-qubit error.

As shown in Fig. 14a, at an Rz gate count of 15, we cannot find six layers of checks for random circuits with 20 cnot gates or higher. Interestingly, as shown in Figs. 10b, 13b and 14b, the post selection curves are relatively unchanged.

Refer to caption
Figure 14: Five-qubit circuits with 15 Rz gates. After 10 cnot gates, we cannot find circuits with six layers.

Using one layer of checks, we have a peak fidelity gain of about 10 percentage points at 40 cnot gates, as shown in Fig. 15a. Fig. 15b shows the corresponding post selection probabilities.

Refer to caption
Figure 15: Five-qubit circuits with 15 Rz gates. (a) At one layer of checks, the peak fidelity gain is about 10 percentage points.

For the ten-qubit case, as shown in Fig. 16a, we achieved a fidelity gain of about ten percentage points. This occurred at a cnot count to qubit ratio of eight, which matches the scenario of the peak fidelity gain in the five-qubit case. The peak fidelity gain occurred at a single-qubit error of about 0.000891 (0.00891 two-qubit error). Fig. 16b shows the corresponding post selection probabilities.

Refer to caption
Figure 16: Ten-qubit circuits with 5 Rz gates. We used a single layer of low weight checks. The 80 cnot count case matches the cnot count to qubit ratio of the five-qubit case with 40 cnot gates. (a) The peak fidelity gain is about ten percentage points. It occurs at a single-qubit error of about 0.000891 (0.00891 two-qubit error)

The preceding simulations focus on using low-weight checks first. We now analyze the performance of high-weight checks. As shown in Figs. 17(a) and 17(b), while the high-weight checks do give a boost in fidelity, they introduce significant amounts of noise compared to the low-weight checks.

Refer to caption
(a) One layer using max weight checks for five compute qubits.
Refer to caption
(b) One layer using max weight checks for ten compute qubits.
Figure 17: Max weight checks. The high-weight checks introduce a lot of noise compared with the low-weight method, as shown in the large negative fidelity at a high single-qubit error rate.

IV Conclusions

The quantum error mitigation technique we have studied in this work is novel because (1) it has an adjustable quantum overhead for any input circuit, (2) by adjusting the number of layers of check operators, the technique allows controlling of the post-selection probability and the error from the error mitigation protocol, (3) the method can be applied repeatedly and at any location in the circuit, and works for arbitrary input states, and (4) in the setting of Theorem 1, we prove that we can achieve unit fidelity provided that we use a sufficient number of layers.

We prove in Theorem 1 that if the error is restricted to the compute qubits (see Fig. 4(a)), there exist checks such that the fidelity for the postselected state reaches unity. We also give a small number of C~2\tilde{C}_{2} checks that reach unit fidelity in this scenario in Propositions 1 and 2.

In Eq. (19), C~2\tilde{C}_{2} is chosen and C~1\tilde{C}_{1} can be directly determined through our finding checks protocol given in Section III-F. This algorithm determines the pairs of checks without matrix multiplication. Instead, we perform lookups of predetermined commutation relations. One limitation of our finding checks protocol is that we are able to find only C~1\tilde{C}_{1} that are in the Pauli group. This limitation does not exist for the general PCS protocol.

The main limitation of the proposed approach is the need to obtain the checks C~1\tilde{C}_{1} and C~2\tilde{C}_{2}, with cost exponential in the number of qubits in the subcircuit. This cost can be reduced to exponential in the number of non-Clifford gates (and only polynomial in the number of qubits) by leveraging the extended stabilizer formalism [37].

The performance of the protocol is tested through extensive numerical simulations on random circuits consisting of 550 Clifford and 1,850 non-Clifford circuits. We used the Clifford simulations to provide intuition that the technique is suitable for deep circuits.

For the non-Clifford circuits, we used five- and ten-qubit circuits. We use the difference between the fidelity of the mitigated circuit and the fidelity of the unmitigated circuit as a figure of merit. Under depolarizing noise, the simulations reached an average fidelity gain of 3434 percentage points for circuits consisting of five qubits, 40 CNOTs, and six low-weight C~2\tilde{C}_{2} checks (see Figs. 10a and 10b). It is possible that more layers will provide further boosts in fidelity. The single-qubit noise ranged from 10510^{-5} to 10110^{-1}. This coincides with current noise levels found in superconducting quantum computers [35].

In [38], the authors derive an error mitigation scheme based on symmetry verification, which they call the spatio-temporal stabilizer (STS) technique. The STS technique shares many similarities with the PCS scheme, as first introduced in [20], and when there is only one pair of checks, STS is the PCS scheme. An important difference is that when there are multiple pairs of checks, layers are allowed to be partly nested in the STS technique. For example, a possible STS execution is layer one and layer two act on the same compute qubits, but layer two begins before layer one has ended and layer one ends before layer two. Since the STS method also allows the standard layering of checks in PCS, our results also hold for the STS technique.

We also note that while the results of this research are presented in the context of quantum computing, the theoretical results hold in general for settings where the user intends to implement an ideal known unitary UU on a quantum state. This follows because we placed no restrictions on the unitary. The performance of the scheme in other settings needs to be investigated. Also, since the protocol places no restriction on the input state, one can apply the mitigation technique on subcircuits and easily combine it with other methods. Splitting a large circuit into subcircuits for finding checks or combining the protocol with other techniques have not been studied. Determining the optimal number of check layers also needs to be further investigated.

Moreover, the best type of checks to use may be non-Pauli in the general PCS protocol. This is likely true given some knowledge of the dominant noise. One potential line of investigation is to use the controls in dynamical decoupling protocols as the C~2\tilde{C}_{2} parity checks [39, 9].

V Data Availability

The data presented in this paper is available online at https://github.com/alvinquantum/noise_mitigation_symmetry.

VI Code Availability

The code used for numerical experiments in this work is available online at https://github.com/alvinquantum/noise_mitigation_symmetry.

Acknowledgments

This research was supported in part by an appointment to the Intelligence Community Postdoctoral Research Fellowship Program at Argonne National Laboratory, administered by Oak Ridge Institute for Science and Education through an interagency agreement between the U.S. Department of Energy and the Office of the Director of National Intelligence. This work was supported in part by the U.S. Department of Energy (DOE), Office of Science, National Quantum Information Science Research Centers, Office of Advanced Scientific Computing Research AIDE-QC and FAR-QC projects, and by the Argonne LDRD program under contract number DE-AC02-06CH11357. Z.S. thanks Q-NEXT for supporting this project. We gratefully acknowledge the computing resources provided on Bebop, a high-performance computing cluster operated by the Laboratory Computing Resource Center at Argonne National Laboratory.

Author Contributions

AG performed the theoretical analysis with feedback from RS and ZS. RS developed the “finding checks quickly" protocol. ZS suggested using the Pauli expansion of Kraus operators in the theory. AG wrote the code, performed the numerical simulations, and wrote the manuscript with input from RS, ZS, and MS. RS structured multiple sections of the manuscript. RS, ZS, and MS developed the numerical experiments with input from AG. RS, MS, AG, and ZS found numerical optimizations.

Competing Interests

The authors declare no competing interests.

Disclaimer

This paper was prepared for information purposes with contributions from the Global Technology Applied Research group of JPMorgan Chase. This paper is not a product of the Research Department of JPMorgan Chase. or its affiliates. Neither JPMorgan Chase nor any of its affiliates make any explicit or implied representation or warranty and none of them accept any liability in connection with this paper, including, but not limited to, the completeness, accuracy, reliability of information contained herein and the potential legal, compliance, tax or accounting effects thereof. This document is not intended as investment research or investment advice, or a recommendation, offer or solicitation for the purchase or sale of any security, financial instrument, financial product or service, or to be used in any way for evaluating the merits of participating in any transaction.

References

  • [1] Emanuel Knill, Raymond Laflamme, and Wojciech H. Zurek. Resilient quantum computation: error models and thresholds. Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences, 454(1969):365–384, Jan 1998.
  • [2] Dorit Aharonov and Michael Ben-Or. Fault-tolerant quantum computation with constant error rate. arXiv preprint arXiv:quant-ph/9906129, 1999.
  • [3] A. Yu. Kitaev. Quantum Error Correction with Imperfect Gates, pages 181–188. Springer US, Boston, MA, 1997.
  • [4] Kristan Temme, Sergey Bravyi, and Jay M. Gambetta. Error mitigation for short-depth quantum circuits. Physical Review Letters, 119:180509, November 2017.
  • [5] Abhinav Kandala, Kristan Temme, Antonio D. Córcoles, Antonio Mezzacapo, Jerry M. Chow, and Jay M. Gambetta. Error mitigation extends the computational reach of a noisy quantum processor. Nature, 567(7749):491–495, 2019.
  • [6] Tudor Giurgica-Tiron, Yousef Hindy, Ryan LaRose, Andrea Mari, and William J. Zeng. Digital zero noise extrapolation for quantum error mitigation. In 2020 IEEE International Conference on Quantum Computing and Engineering (QCE), pages 306–316, 2020.
  • [7] Lorenza Viola and Seth Lloyd. Dynamical suppression of decoherence in two-state quantum systems. Phys. Rev. A, 58:2733–2744, Oct 1998.
  • [8] Mark S. Byrd and Daniel A. Lidar. Empirical determination of dynamical decoupling operations. Phys. Rev. A, 67:012324, Jan. 2003.
  • [9] Vinay Tripathi, Huo Chen, Mostafa Khezri, Ka-Wa Yip, E. M. Levenson-Falk, and Daniel A. Lidar. Suppression of crosstalk in superconducting qubits using dynamical decoupling. arXiv preprint arXiv:2108.04530, 2021.
  • [10] Benjamin Nachman, Miroslav Urbanek, Wibe A. de Jong, and Christian W. Bauer. Unfolding quantum computer readout noise. npj Quantum Information, 6(1):84, Sept. 2020.
  • [11] X. Bonet-Monroig, R. Sagastizabal, M. Singh, and T. E. O’Brien. Low-cost error mitigation by symmetry verification. Phys. Rev. A, 98:062339, Dec 2018.
  • [12] Sam McArdle, Xiao Yuan, and Simon Benjamin. Error-mitigated digital quantum simulation. Physical Review Letters, 122(18):180501, 2019.
  • [13] Ruslan Shaydulin and Alexey Galda. Error mitigation for deep quantum optimization circuits by leveraging problem symmetries. arXiv preprint arXiv:2106.04410, 2021.
  • [14] Zhenyu Cai. Quantum error mitigation using symmetry expansion. Quantum, 5:548, September 2021.
  • [15] Alberto Peruzzo, Jarrod McClean, Peter Shadbolt, Man-Hong Yung, Xiao-Qi Zhou, Peter J. Love, Alán Aspuru-Guzik, and Jeremy L. O’Brien. A variational eigenvalue solver on a photonic quantum processor. Nature Communications, 5(1):4213, Jul 2014.
  • [16] John Preskill. Quantum Computing in the NISQ era and beyond. Quantum, 2:79, Aug 2018.
  • [17] Jay Gambetta, Ishmael Faro, and Karl Wehden. IBM’s roadmap for building an open quantum software ecosystem. https://research.ibm.com/blog/quantum-development-roadmap, Feb 2021. Accessed: 4-18-2022.
  • [18] Peter Chapman. Scaling IonQ’s quantum computers: The roadmap. https://ionq.com/posts/december-09-2020-scaling-quantum-computer-roadmap, Dec 2020. Accessed: 4-18-2022.
  • [19] Isaac L. Chuang and M. A. Nielsen. Prescription for experimental determination of the dynamics of a quantum black box. Journal of Modern Optics, 44(11-12):2455–2467, Nov 1997.
  • [20] Dripto M. Debroy and Kenneth R. Brown. Extended flag gadgets for low-overhead circuit verification. Physical Review A, 102(5), nov 2020.
  • [21] Rui Chao and Ben W. Reichardt. Quantum error correction with only two extra qubits. Phys. Rev. Lett., 121:050502, Aug. 2018.
  • [22] Huiyang Zhou and Gregory T. Byrd. Quantum circuits for dynamic runtime assertions in quantum computation. IEEE Computer Architecture Letters, 18(2):111–114, jul 2019.
  • [23] Michael A Nielsen and Isaac L Chuang. Quantum Computation and Quantum Information. Cambridge University Press, 2011.
  • [24] E. C. G. Sudarshan, P. M. Mathews, and Jayaseetha Rau. Stochastic dynamics of quantum-mechanical systems. Phys. Rev., 121:920–924, Feb 1961.
  • [25] Man-Duen Choi. Completely positive linear maps on complex matrices. Linear Algebra and its Applications, 10(3):285–290, 1975.
  • [26] K. Kraus, A. Böhm, J.D. Dollard, and W.H. Wootters. States, Effects, and Operations: Fundamental Notions of Quantum Theory. Lecture Notes in Physics. Springer Berlin Heidelberg, 1983.
  • [27] Simon Milz, M. S. Kim, Felix A. Pollock, and Kavan Modi. Completely positive divisibility does not mean Markovianity. Phys. Rev. Lett., 123:040401, July 2019.
  • [28] Tirthak Patel, Abhay Potharaju, Baolin Li, Rohan Basu Roy, and Devesh Tiwari. Experimental evaluation of nisq quantum computers: Error measurement, characterization, and implications. In SC20: International Conference for High Performance Computing, Networking, Storage and Analysis, pages 1–15, 2020.
  • [29] Shuaining Zhang, Yao Lu, Kuan Zhang, Wentao Chen, Ying Li, Jing-Ning Zhang, and Kihwan Kim. Error-mitigated quantum gates exceeding physical fidelities in a trapped-ion system. Nature Communications, 11(1):587, Jan 2020.
  • [30] Zhenyu Cai and Simon C. Benjamin. Constructing smaller Pauli twirling sets for arbitrary error channels. Scientific Reports, 9(1):11281, Aug 2019.
  • [31] Charles H. Bennett, David P. DiVincenzo, John A. Smolin, and William K. Wootters. Mixed-state entanglement and quantum error correction. Phys. Rev. A, 54:3824–3851, Nov 1996.
  • [32] Charles H. Bennett, Gilles Brassard, Sandu Popescu, Benjamin Schumacher, John A. Smolin, and William K. Wootters. Purification of noisy entanglement and faithful teleportation via noisy channels. Phys. Rev. Lett., 76:722–725, Jan 1996.
  • [33] Peter W. Shor. Scheme for reducing decoherence in quantum computer memory. Phys. Rev. A, 52:R2493–R2496, Oct 1995.
  • [34] Ketan N. Patel, Igor L. Markov, and John P. Hayes. Optimal synthesis of linear reversible circuits. Quantum Info. Comput., 8(3):282–294, Mar 2008.
  • [35] Thomas J. Maldonado, Johannes Flick, Stefan Krastanov, and Alexey Galda. Error rate reduction of single-qubit gates via noise-aware decomposition into native gates. Scientific Reports, 12(1):6379, Apr 2022.
  • [36] Edward Farhi, Jeffrey Goldstone, and Sam Gutmann. A quantum approximate optimization algorithm. arXiv preprint arXiv:1411.4028, 2014.
  • [37] Sergey Bravyi, Dan Browne, Padraic Calpin, Earl Campbell, David Gosset, and Mark Howard. Simulation of quantum circuits by low-rank stabilizer decompositions. Quantum, 3:181, September 2019.
  • [38] Yifeng Xiong, Daryus Chandra, Soon Xin Ng, and Lajos Hanzo. Circuit symmetry verification mitigates quantum-domain impairments. arXiv preprint arXiv:2112.13904, 2021.
  • [39] Kaveh Khodjasteh and Daniel A. Lidar. Performance of deterministic dynamical decoupling schemes: Concatenated and periodic pulse sequences. Physical Review A, 75(6), Jun 2007.

The submitted manuscript has been created by UChicago Argonne, LLC, Operator of Argonne National Laboratory (“Argonne”). Argonne, a U.S. Department of Energy Office of Science laboratory, is operated under Contract No. DE-AC02-06CH11357. The U.S. Government retains for itself, and others acting on its behalf, a paid-up nonexclusive, irrevocable worldwide license in said article to reproduce, prepare derivative works, distribute copies to the public, and perform publicly and display publicly, by or on behalf of the Government. The Department of Energy will provide public access to these results of federally sponsored research in accordance with the DOE Public Access Plan. http://energy.gov/downloads/doe-public-access-plan.