Contextual quantum metrology

Jeongwoo Jae Department of Physics, Hanyang University, Seoul, 04763, Republic of Korea Jiwon Lee Department of Physics, Hanyang University, Seoul, 04763, Republic of Korea M. S. Kim QOLS, Blackett Laboratory, Imperial College London, London SW7 2AZ, United Kingdom Kwang-Geol Lee kglee@hanyang.ac.kr Department of Physics, Hanyang University, Seoul, 04763, Republic of Korea Jinhyoung Lee hyoung@hanyang.ac.kr Department of Physics, Hanyang University, Seoul, 04763, Republic of Korea

Abstract

Quantum metrology promises higher precision measurements than classical methods. Entanglement has been identified as one of quantum resources to enhance metrological precision. However, generating entangled states with high fidelity presents considerable challenges, and thus attaining metrological enhancement through entanglement is generally difficult. Here, we show that contextuality of measurement selection can enhance metrological precision, and this enhancement is attainable with a simple linear optical experiment. We call our methodology “contextual quantum metrology” (coQM). Contextuality is a nonclassical property known as a resource for various quantum information processing tasks. Until now, it has remained an open question whether contextuality can be a resource for quantum metrology. We answer this question in the affirmative by showing that the coQM can elevate precision of an optical polarimetry by a factor of $1.4$ to $6.0$ , much higher than the one by quantum Fisher information, known as the limit of conventional quantum metrology. We achieve the contextuality-enabled enhancement with two polarization measurements which are mutually complementary, whereas, in the conventional method, some optimal measurements to achieve the precision limit are either theoretically difficult to find or experimentally infeasible. These results highlight that the contextuality of measurement selection is applicable in practice for quantum metrology.

Introduction

Precision measurement has played a crucial role in the development of natural science and engineering since measurement is a means for observing nature. As a technology for precision measurement, quantum metrology has recently drawn attention with a wide range of applications such as microscopy [1], imaging [2, 3], patterning [4, 5], gravitational wave detection [6, 7, 8], and time keeping [9, 10]. Quantum metrology enables measurements going beyond precision of the standard quantum limit which can be obtained from the most-classical state in quantum physics. One of the resources for the precision enhancement is entanglement, a nonclassical property of quantum states [11, 12, 13, 14]. However, an entangled state can easily lose its property through interaction with other objects, while the interaction is inevitable in metrology. This makes it challenging to generate and manipulate an entangled state. Due to the limitations, it is difficult in practice to attain the entanglement-enabled enhancement of precision. If easy-to-implement resources for metrology are found, the performance of quantum metrology can be greatly enhanced, as well as its practicality. In this work, we demonstrate that contextuality of measurement selection [15], another nonclassical property, is an easy-to-implement resource for quantum metrology.

Specifically, contextuality here refers to dependency of quantum systems on measurement context [16]. Unlike classical predictions, quantum predictions for a given measurement can change depending on whether another measurement is performed simultaneously or not. Bell-Kochen-Specker theorem first showed that quantum physics is contextual [17, 18], and this has been experimentally proved on various quantum systems [19, 20, 21, 22]. Also, it has been revealed that the contextuality can be a resource for quantum information processing tasks such as quantum key distribution [23, 24], universal quantum computing [25], quantum state discrimination [26], and quantum machine learning [27, 28]. Yet, whether the contextuality can be a resource for quantum metrology remains a question.

To demonstrate the precision enhancement from the contextuality of measurement selection, we propose a method which we call contextual quantum metrology (coQM). Unlike conventional quantum metrology, the coQM utilizes two measurement settings and their contextuality. In our experiment, we adopt an optical polarimetry devised to measure concentration of sucrose solution [29], and modify its scheme for the coQM. Our experiment employs two polarization measurements in mutually unbiased (or complementary) bases, and their selection context is implemented by toggling a polarizing beam splitter ‘in’ and ‘out’ from its optical path. Our setup is scalable in that the size of the experiment does not increase along with the increase of the number of probe photons. Also, the enhanced precision is attainable without error correction or mitigation which requires overhead [30, 31]. We finally show that the precision of coQM can go beyond the precision limit of conventional quantum metrology [32, 33, 34] by a factor of $1.4$ to $6.0$ .

Refer to caption — Figure 1: An experimental schematic for coQM. Our probe state is a polarization state of a heralded single-photon source (see Methods). The probe polarization rotates by angle $\alpha lc$ when it propagates through the sucrose solution, where $\alpha\approx 34.1$ deg ml dm^-1 g^-1 is the specific rotation of the sucrose solution, $l=0.1$ dm is traveling length of light in the solution, and $c$ g ml^-1 is the concentration of the solution. We estimate the concentration $c$ by measuring the polarization change. Here, we consider two measurement settings $A$ and $B$ , where $A$ measures polarization in $H/V$ basis and $B$ does in a $D/A$ basis tilted $45^{\circ}$ from the basis of $A$ by half-wave plate (HWP). For triggered event, the probabilities of the polarization bases are determined from relative counts of four avalanche photodiodes (APDs), $D_{00}$ , $D_{01}$ , $D_{10}$ , and $D_{11}$ . When polarizing beam splitter PBS $1$ is ‘out’, the measurement setup corresponds to the $B$ measurement, and the counts on $D_{00}$ and $D_{01}$ determine probabilities of $|{D}\rangle$ and $|{A}\rangle$ , respectively. When the PBS $1$ is ‘in’, the measurement setup corresponds to the consecutive measurement performing $A$ first and $B$ later, and the counts of $D_{00}$ , $D_{01}$ , $D_{10}$ , and $D_{11}$ represent joint probabilities of $|{HD}\rangle$ , $|{HA}\rangle$ , $|{VA}\rangle$ , and $|{VD}\rangle$ , respectively. We combine measuring data to construct operational quasiprobabiltiy (OQ) in Eq. (1), and an estimate is calculated with maximum likelihood estimator of OQ in Eq. (2). The coQM utilizes the context of selecting the measurement $A$ to enhance its precision. Experimental results in Fig. 2 demonstrate the enhancement.

Contextual quantum metrology

Fig. 1 shows an experimental schematic for the coQM. Here, the coQM estimates the concentration of sucrose solution by following four steps: preparing a polarized single photon as a probe light (see Methods), interacting the photon with the sucrose solution, measuring the polarization, and calculating an estimate via maximum likelihood estimator (MLE) using the operational quasiprobability which will be discussed later as in Eq. (1). The photon interacts with the sucrose solution as it propagates through the solution. Afterwards, the photon polarization rotates by angle $\alpha cl$ , where $\alpha$ , $c$ , and $l$ are the specific rotation of the sucrose solution, the concentration of the solution, and traveling length of light in the solution, respectively [29]. We can decide the concentration by measuring the polarization change for a given specific rotation and traveling length. All the procedures here look similar to those of the conventional quantum metrology [35], but the measurement and the estimation are steps that differentiate the coQM from the conventional approach.

In the measurement step, the coQM employs two different polarization measurement settings $A$ and $B$ to utilize the contextuality of measurement selection therefrom [15]. $A$ measures the polarization in a specific basis $\{|{H}\rangle,|{V}\rangle\}$ and $B$ does in a basis $\{|{D}\rangle,|{A}\rangle\}$ tilted $45^{\circ}$ from the basis of $A$ by half-wave plate. Our basic setup is the measurement by $B$ , and we consider a context of whether measurement $A$ is selected to be performed or not, prior to performing $B$ . If $A$ is not selected, the measurement $B$ only is performed, and probability is given by $p(b|B)$ for a binary value $b$ . If $A$ is selected, the experimental setup runs the consecutive measurement performing $A$ first and $B$ later. In this case, probability is given by $p(a,b|AB)$ for an outcome pair $(a,b)$ , where $a$ is a binary outcome of $A$ . In our experiment (Fig. 1), the context of selecting $A$ is implemented by toggling ‘in’ and ’out’ the state of polarizing beam splitter PBS $1$ . Depending on the context, the quantum prediction of measurement $B$ changes, so the two probabilities $p(b|B)$ by $B$ only and $p(b|AB)=\sum_{a}p(a,b|AB)$ by the consecutive differ in general, i.e., $p(b|B)\neq p(b|AB)$ [36]. (We will discuss that this difference stems from the incompatibility of measurements [37].) Our metrology utilizes this effect to enhance precision of the polarimetry.

In the estimation step, we employ a noncontextual (or context-free) model, so-called operational quasiprobability [38, 39], which is given by

\displaystyle w(a,b)

\displaystyle=

\displaystyle p(a,b|AB)+\frac{1}{2}\left(p(b|B)-p(b|AB)\right).

(1)

The context-free condition in our measurement setup is to assume that the prediction of measurement $B$ is invariant under the context of selecting the measurement $A$ . This is called the condition of no-signaling in time represented by $p(b|B)=p(b|AB)$ , $\forall b$ [36]. The crucial property of the operational quasiprobability is that, for the context-free condition, $w$ is reduced to the probability by the consecutive measurement, $w(a,b)=p(a,b|AB)$ , $\forall a,b$ . To the contrary, the quantum predictions violate the condition in general, $w(a,b)\neq p(a,b|AB)$ , and $w(a,b)$ can even be negative-valued [38, 39].

The conventional quantum metrology estimates a physical parameter $\theta$ with an estimator based on a conditional probability $p(x|\theta)$ for a data set $\{x_{i}\}_{i=1}^{N_{s}}$ (see Supplementary Information). In the coQM, the operational quasiprobability plays a role of the conditional probability for the two data sets ${\bf x}_{B}=\{b_{i}\}_{i=1}^{N_{s}}$ and ${\bf x}_{AB}=\{(a_{j},b_{j})\}_{j=1}^{N_{s}}$ . In other words, the coQM calculates an estimate of polarization $\check{\theta}$ with a maximum likelihood estimator given by

\check{\theta}~{}~{}\text{s.t.}~{}~{}\partial_{\theta}l_{W}(\theta|{\bf x}_{B},{\bf x}_{AB})=0,

(2)

where $l_{W}(\theta|{\bf x}_{B},{\bf x}_{AB})$ is a log-likelihood function for $w$ (see Methods). The possible problem caused by this replacement is that $w$ can be negative unlike the conditional probability, so that the log-likelihood function diverges. However, we find that $w$ remains positive for some range of parameters. We focus on the case for $w$ to be applicable to the log-likelihood function without the divergence problem. Finally, for an initial polarization $\theta_{0}$ , we derive the estimate of concentration $\check{c}$ from the polarization change as $\check{c}=(\check{\theta}-\theta_{0})/\alpha l$ .

Results

Our goal is to demonstrate outperformance of the coQM over the conventional quantum metrology. We employ error of estimate $\Delta\theta$ , the standard deviation that the estimate differs from the actual value, to quantify the performance of estimation. The smaller the error is, the more precise the estimate is.

As a baseline of performance, we take the conventional quantum metrology and its error given by quantum Fisher information (QFI) $F_{q}$ , $\Delta\theta_{q}={1}/{\sqrt{N_{s}F_{q}}}$ , where $N_{s}$ is the number of samples. This is known as the lower bound of error in the conventional method. In the coQM, we propose contextual Fisher information (coFI) to quantify the performance of the coQM,

\displaystyle F_{\mathrm{co}}:=\sum_{ab}w(a,b|\theta)\left(\frac{\partial\log w(a,b|\theta)}{\partial\theta}\right)^{2}.

(3)

In the asymptotic limit of $N_{s}\rightarrow\infty$ , the error of the coQM $\Delta\theta_{\mathrm{co}}$ approaches to $1/\sqrt{N_{s}F_{\mathrm{co}}}$ (see Supplementary Information for the asymptotic property and estimator of coFI).

Contextuality-enabled enhancement The coQM gains precision enhancement over the conventional quantum metrology if

\displaystyle\Delta\theta_{\mathrm{co}}<\frac{\Delta\theta_{q}}{\sqrt{2}}.

(4)

Our method uses the two data sets ${\bf x}_{B}$ and ${\bf x}_{AB}$ . If each data set collects $N_{s}$ samples, the total number of samples is $2N_{s}$ in our method. Reduction factor $\sqrt{2}$ in the error by the conventional is introduced, assuming the conventional takes $2N_{s}$ samples (which is equivalent to comparing $F_{\mathrm{co}}$ to $2F_{q}$ ).

We here suggest specific cases satisfying the criterion (4). Instead of a theoretical proof, we briefly summarize the theory behind the enhancement of precision by following arguments: For the noncontextual model, the operational quasiprobability $w(a,b|\theta)$ becomes the joint probability of the consecutive measurement $p(a,b|AB)$ . In this case, the coQM is reduced and equivalent to the conventional quantum metrology using the consecutive measurement so that the $\Delta\theta_{\mathrm{co}}$ equals or larger than $\Delta\theta_{q}$ (see Supplementary Information). For the contextual model, conversely, $\Delta\theta_{\mathrm{co}}$ can be smaller than $\Delta\theta_{q}$ (see Ref. [40] for rigorous discussions). Fig. 2 a shows simulation results of the contextuality-enabled enhancement on the Bloch sphere.

We perform polarization estimation of $\theta$ with probe states prepared in $|{\psi}\rangle=\cos(\theta/2)|{H}\rangle+e^{i\phi}\sin(\theta/2)|{V}\rangle$ for $0.46\pi\leq\theta\leq 0.55\pi$ and $\phi=0.15\pi$ . For the probe states, the operational quasiprobability is given by $w(a,b|\theta)=\left(1+(-1)^{a}\cos\theta+(-1)^{b}\sin\theta\cos\phi\right)/4$ . We draw $N_{s}=10^{5}$ samples for each data set to construct the operational quasiprobability, and calculate an estimate with the estimator (2). For polarization estimation of $\theta$ , QFI $F_{q}=1$ , so the coQM gains the contextuality-enabled enhancement if $\Delta\theta_{\mathrm{co}}<\Delta\theta_{q}/\sqrt{2}=1/\sqrt{2N_{s}}\approx 2.24\times 10^{-3}$ . The errors of the coQM are smaller than the error limit of the conventional quantum metrology for the whole selected range of $\theta$ (Fig. 2 $\bf b$ ). The worst case in our results has $\Delta\theta_{\mathrm{co}}\approx 1.53\times 10^{-3}$ around $\theta=\pi/2$ , and the best case has $\Delta\theta_{\mathrm{co}}\approx 3.7\times 10^{-4}$ around each end of the range of $\theta$ . This demonstrates that our method elevates the precision of polarimetry by factor of $1.4$ to $6.0$ from the limit of conventional quantum metrology.

We estimate sucrose solutions of three different concentrations $c=0.1$ , $0.3$ and $0.5$ g ml^-1. We prepare the probe state with initial parameters $\theta_{0}=0.5\pi$ and $\phi=0.15\pi$ . For each concentration, we repeat the estimation $10$ times. The results (Fig. 2 d) show that the errors of estimates by the coQM, $\Delta c_{\mathrm{co}}$ , are smaller than the minimum error by the conventional quantum metrology ( $\approx 5.9\times 10^{-2}$ ); For $c=0.1$ , $0.3$ and $0.5$ g ml^-1, the mean errors are $\approx 3.7\times 10^{-2}$ , $\approx 3.3\times 10^{-2}$ , and $\approx 2.8\times 10^{-2}$ , respectively. This illustrates that the coQM exceeds the conventional quantum metrology by a wide margin.

Measurement Incompatibility

The contextuality of measurement selection stems from the incompatibility of quantum measurements [37]. In the scenario of the consecutive measurement, if the two measurements $A$ and $B$ commute, the consecutive measurement is de facto a single measurement; the probabilities $p(b|B)$ and $p(b|AB)$ are equal and the prediction for $B$ is noncontextual. Otherwise, the prediction for the measurement $B$ depends on whether performing the first measurement $A$ and it is contextual except for a case when the initial state is prepared in an eigenstate of the $A$ . Thus, the measurement incompatibility is necessary for the contextuality of measurement selection.

The noncommutation of observable operators defines the incompatibility among measurements, represented by projection-valued measures (PVM), which we assume in the present work. The notion of incompatibility needs to be generalized if the representation of measurement is generalized to positive operator-valued measure (POVM). This generalization is required, for example, if one considers an open quantum system in a noisy environment. Non-joint measurability (non-JM) is one of the generalizations [37]. The non-JM plays an important role in a contextuality [41, 42], as does the noncommutativity [43]. In fact, non-JM and the contextuality of measurement selection are also closely related as the negativity of operational quasiprobability is the necessary and sufficient condition for non-JM [44, 40].

Recently, there were studies in a similar vein to the present work [45, 46], showing that the noncommutativity can be a resource for quantum metrology. However, their schemes employ a post-selection to discard unwanted measurement outcomes, so there is a tradeoff between success probability and Fisher information; success probability becomes small if Fisher information is large [47]. Quantum post-selected metrology such as weak value amplification methods share this matter [48, 49]. On the contrary, our method utilizes all of measurement outcomes for the estimation [38, 39], implying that the coQM is free from such tradeoff.

Outlook

This work demonstrates that utilizing the contextuality of measurement selection can enhance the precision measurement. The experiment attains precision beyond the limit of conventional quantum metrology [33, 34]. The coQM has advantages over the conventional method (see Supplementary Information): it can enhance the precision without optimizing the measurements if they are incompatible, and it works even without any entangled state of probe that has been regarded difficult to generate and manipulate. We use the heralded single-photon source to clearly show the performance of coQM per a unit particle of probe. We expect that a multi-photon source can also be adopted for the coQM with the similar settings of measurements [15]. Our method is expected to be applicable to a quantum sensor [50] if the context of measurement selection can be implemented within the sensor’s system. In addition, the approaches employed to demonstrate the contextuality-enabled enhancements can be utilized to characterize quantum devices (see Supplementary Information), which is a fundamental task required to implement quantum technologies.

References

Casacio et al. [2021] C. A. Casacio, L. S. Madsen, A. Terrasson, M. Waleed, K. Barnscheidt, B. Hage, M. A. Taylor, and W. P. Bowen, Quantum-enhanced nonlinear microscopy, Nature 594, 201 (2021).
Treps et al. [2002] N. Treps, U. Andersen, B. Buchler, P. K. Lam, A. Maître, H.-A. Bachor, and C. Fabre, Surpassing the standard quantum limit for optical imaging using nonclassical multimode light, Phys. Rev. Lett. 88, 203601 (2002).
Brida et al. [2010] G. Brida, M. Genovese, and I. Ruo Berchera, Experimental realization of sub-shot-noise quantum imaging, Nature Photonics 4, 227 (2010).
Boto et al. [2000] A. N. Boto, P. Kok, D. S. Abrams, S. L. Braunstein, C. P. Williams, and J. P. Dowling, Quantum interferometric optical lithography: Exploiting entanglement to beat the diffraction limit, Phys. Rev. Lett. 85, 2733 (2000).
Parniak et al. [2018] M. Parniak, S. Borówka, K. Boroszko, W. Wasilewski, K. Banaszek, and R. Demkowicz-Dobrzański, Beating the rayleigh limit using two-photon interference, Phys. Rev. Lett. 121, 250503 (2018).
Abramovici et al. [1992] A. Abramovici, W. E. Althouse, R. W. P. Drever, Y. Gürsel, S. Kawamura, F. J. Raab, D. Shoemaker, L. Sievers, R. E. Spero, K. S. Thorne, R. E. Vogt, R. Weiss, S. E. Whitcomb, and M. E. Zucker, Ligo: The laser interferometer gravitational-wave observatory, Science 256, 325 (1992).
Collaboration [2011] T. L. S. Collaboration, A gravitational wave observatory operating beyond the quantum shot-noise limit, Nature Physics 7, 962 (2011).
Aasi et al. [2013] J. Aasi, J. Abadie, B. P. Abbott, R. Abbott, T. D. Abbott, M. R. Abernathy, C. Adams, T. Adams, P. Addesso, R. X. Adhikari, C. Affeldt, O. D. Aguiar, P. Ajith, B. Allen, E. Amador Ceron, D. Amariutei, S. B. Anderson, W. G. Anderson, K. Arai, M. C. Araya, C. Arceneaux, S. Ast, S. M. Aston, D. Atkinson, P. Aufmuth, C. Aulbert, L. Austin, B. E. Aylott, S. Babak, P. T. Baker, S. Ballmer, Y. Bao, J. C. Barayoga, D. Barker, B. Barr, L. Barsotti, M. A. Barton, I. Bartos, R. Bassiri, J. Batch, J. Bauchrowitz, B. Behnke, A. S. Bell, C. Bell, G. Bergmann, J. M. Berliner, A. Bertolini, J. Betzwieser, N. Beveridge, P. T. Beyersdorf, T. Bhadbhade, I. A. Bilenko, G. Billingsley, J. Birch, S. Biscans, E. Black, J. K. Blackburn, L. Blackburn, D. Blair, B. Bland, O. Bock, T. P. Bodiya, C. Bogan, C. Bond, R. Bork, M. Born, S. Bose, J. Bowers, P. R. Brady, V. B. Braginsky, J. E. Brau, J. Breyer, D. O. Bridges, M. Brinkmann, M. Britzger, A. F. Brooks, D. A. Brown, D. D. Brown, K. Buckland, F. Brückner, B. C. Buchler, A. Buonanno, J. Burguet-Castell, R. L. Byer, L. Cadonati, J. B. Camp, P. Campsie, K. Cannon, J. Cao, C. D. Capano, L. Carbone, S. Caride, A. D. Castiglia, S. Caudill, M. Cavaglià, C. Cepeda, T. Chalermsongsak, S. Chao, P. Charlton, X. Chen, Y. Chen, H.-S. Cho, J. H. Chow, N. Christensen, Q. Chu, S. S. Y. Chua, C. T. Y. Chung, G. Ciani, F. Clara, D. E. Clark, J. A. Clark, M. Constancio Junior, D. Cook, T. R. Corbitt, M. Cordier, N. Cornish, A. Corsi, C. A. Costa, M. W. Coughlin, S. Countryman, P. Couvares, D. M. Coward, M. Cowart, D. C. Coyne, K. Craig, J. D. E. Creighton, T. D. Creighton, A. Cumming, L. Cunningham, K. Dahl, M. Damjanic, S. L. Danilishin, K. Danzmann, B. Daudert, H. Daveloza, G. S. Davies, E. J. Daw, T. Dayanga, E. Deleeuw, T. Denker, T. Dent, V. Dergachev, R. DeRosa, R. DeSalvo, S. Dhurandhar, I. Di Palma, M. Díaz, A. Dietz, F. Donovan, K. L. Dooley, S. Doravari, S. Drasco, R. W. P. Drever, J. C. Driggers, Z. Du, J.-C. Dumas, S. Dwyer, T. Eberle, M. Edwards, A. Effler, P. Ehrens, S. S. Eikenberry, R. Engel, R. Essick, T. Etzel, K. Evans, M. Evans, T. Evans, M. Factourovich, S. Fairhurst, Q. Fang, B. F. Farr, W. Farr, M. Favata, D. Fazi, H. Fehrmann, D. Feldbaum, L. S. Finn, R. P. Fisher, S. Foley, E. Forsi, N. Fotopoulos, M. Frede, M. A. Frei, Z. Frei, A. Freise, R. Frey, T. T. Fricke, D. Friedrich, P. Fritschel, V. V. Frolov, M.-K. Fujimoto, P. J. Fulda, M. Fyffe, J. Gair, J. Garcia, N. Gehrels, G. Gelencser, L. Á. Gergely, S. Ghosh, J. A. Giaime, S. Giampanis, K. D. Giardina, S. Gil-Casanova, C. Gill, J. Gleason, E. Goetz, G. González, N. Gordon, M. L. Gorodetsky, S. Gossan, S. Goßler, C. Graef, P. B. Graff, A. Grant, S. Gras, C. Gray, R. J. S. Greenhalgh, A. M. Gretarsson, C. Griffo, H. Grote, K. Grover, S. Grunewald, C. Guido, E. K. Gustafson, R. Gustafson, D. Hammer, G. Hammond, J. Hanks, C. Hanna, J. Hanson, K. Haris, J. Harms, G. M. Harry, I. W. Harry, E. D. Harstad, M. T. Hartman, K. Haughian, K. Hayama, J. Heefner, M. C. Heintze, M. A. Hendry, I. S. Heng, A. W. Heptonstall, M. Heurs, M. Hewitson, S. Hild, D. Hoak, K. A. Hodge, K. Holt, M. Holtrop, T. Hong, S. Hooper, J. Hough, E. J. Howell, V. Huang, E. A. Huerta, B. Hughey, S. H. Huttner, M. Huynh, T. Huynh-Dinh, D. R. Ingram, R. Inta, T. Isogai, A. Ivanov, B. R. Iyer, K. Izumi, M. Jacobson, E. James, H. Jang, Y. J. Jang, E. Jesse, W. W. Johnson, D. Jones, D. I. Jones, R. Jones, L. Ju, P. Kalmus, V. Kalogera, S. Kandhasamy, G. Kang, J. B. Kanner, R. Kasturi, E. Katsavounidis, W. Katzman, H. Kaufer, K. Kawabe, S. Kawamura, F. Kawazoe, D. Keitel, D. B. Kelley, W. Kells, D. G. Keppel, A. Khalaidovski, F. Y. Khalili, E. A. Khazanov, B. K. Kim, C. Kim, K. Kim, and N. Kim, Enhanced sensitivity of the ligo gravitational wave detector by using squeezed states of light, Nature Photonics 7, 613 (2013).
Giovannetti et al. [2001] V. Giovannetti, S. Lloyd, and L. Maccone, Quantum-enhanced positioning and clock synchronization, Nature 412, 417 (2001).
Pedrozo-Peñafiel et al. [2020] E. Pedrozo-Peñafiel, S. Colombo, C. Shu, A. F. Adiyatullin, Z. Li, E. Mendez, B. Braverman, A. Kawasaki, D. Akamatsu, Y. Xiao, and V. Vuletić, Entanglement on an optical atomic-clock transition, Nature 588, 414 (2020).
Giovannetti et al. [2004] V. Giovannetti, S. Lloyd, and L. Maccone, Quantum-enhanced measurements: Beating the standard quantum limit, Science 306, 1330 (2004), https://www.science.org/doi/pdf/10.1126/science.1104149 .
Giovannetti et al. [2006] V. Giovannetti, S. Lloyd, and L. Maccone, Quantum metrology, Phys. Rev. Lett. 96, 010401 (2006).
Giovannetti et al. [2011] V. Giovannetti, S. Lloyd, and L. Maccone, Advances in quantum metrology, Nature Photonics 5, 222 (2011).
Tan and Jeong [2019] K. C. Tan and H. Jeong, Nonclassical light and metrological power: An introductory review, AVS Quantum Science 1, 014701 (2019).
Ryu et al. [2019] J. Ryu, S. Hong, J.-S. Lee, K. H. Seol, J. Jae, J. Lim, J. Lee, K.-G. Lee, and J. Lee, Optical experiment to test negative probability in context of quantum-measurement selection, Scientific Reports 9, 10.1038/s41598-019-53121-5 (2019).
Spekkens [2005] R. W. Spekkens, Contextuality for preparations, transformations, and unsharp measurements, Phys. Rev. A 71, 052108 (2005).
Bell [1964] J. S. Bell, On the Einstein-Podolsky-Rosen paradox, Physics 1, 195 (1964).
Kochen and Specker [1967] S. Kochen and E. P. Specker, The problem of hidden variables in quantum mechanics, Journal of Mathematics and Mechanics 17, 59 (1967).
Hasegawa et al. [2006] Y. Hasegawa, R. Loidl, G. Badurek, M. Baron, and H. Rauch, Quantum contextuality in a single-neutron optical experiment, Phys. Rev. Lett. 97, 230401 (2006).
Kirchmair et al. [2009] G. Kirchmair, F. Zähringer, R. Gerritsma, M. Kleinmann, O. Gühne, A. Cabello, R. Blatt, and C. F. Roos, State-independent experimental test of quantum contextuality, Nature 460, 494 (2009).
Jerger et al. [2016] M. Jerger, Y. Reshitnyk, M. Oppliger, A. Potočnik, M. Mondal, A. Wallraff, K. Goodenough, S. Wehner, K. Juliusson, N. K. Langford, and A. Fedorov, Contextuality without nonlocality in a superconducting quantum system, Nature Communications 7, 12930 (2016).
Zhang et al. [2019] A. Zhang, H. Xu, J. Xie, H. Zhang, B. J. Smith, M. S. Kim, and L. Zhang, Experimental test of contextuality in quantum and classical systems, Phys. Rev. Lett. 122, 080401 (2019).
Acín et al. [2007] A. Acín, N. Brunner, N. Gisin, S. Massar, S. Pironio, and V. Scarani, Device-independent security of quantum cryptography against collective attacks, Phys. Rev. Lett. 98, 230501 (2007).
Reichardt et al. [2013] B. W. Reichardt, F. Unger, and U. Vazirani, Classical command of quantum systems, Nature 496, 456 (2013).
Howard et al. [2014] M. Howard, J. Wallman, V. Veitch, and J. Emerson, Contextuality supplies the ‘magic’ for quantum computation, Nature 510, 351 (2014).
Schmid and Spekkens [2018] D. Schmid and R. W. Spekkens, Contextual advantage for state discrimination, Phys. Rev. X 8, 011015 (2018).
Gao et al. [2022] X. Gao, E. R. Anschuetz, S.-T. Wang, J. I. Cirac, and M. D. Lukin, Enhancing generative models via quantum correlations, Phys. Rev. X 12, 021037 (2022).
Anschuetz et al. [2023] E. R. Anschuetz, H.-Y. Hu, J.-L. Huang, and X. Gao, Interpretable quantum advantage in neural sequence learning, PRX Quantum 4, 020338 (2023).
Yoon et al. [2020] S.-J. Yoon, J.-S. Lee, C. Rockstuhl, C. Lee, and K.-G. Lee, Experimental quantum polarimetry using heralded single photons, Metrologia 57, 045008 (2020).
Zhou et al. [2018] S. Zhou, M. Zhang, J. Preskill, and L. Jiang, Achieving the heisenberg limit in quantum metrology using quantum error correction, Nature Communications 9, 78 (2018).
Maciejewski et al. [2020] F. B. Maciejewski, Z. Zimborás, and M. Oszmaniec, Mitigation of readout noise in near-term quantum devices by classical post-processing based on detector tomography, Quantum 4, 257 (2020).
Helstrom [1969] C. W. Helstrom, Quantum detection and estimation theory, Journal of Statistical Physics 1, 231 (1969).
Holevo [2011] A. S. Holevo, Probabilistic and statistical aspects of quantum theory, Vol. 1 (Springer Science & Business Media, 2011).
Braunstein and Caves [1994] S. L. Braunstein and C. M. Caves, Statistical distance and the geometry of quantum states, Phys. Rev. Lett. 72, 3439 (1994).
Liu et al. [2019] J. Liu, H. Yuan, X.-M. Lu, and X. Wang, Quantum fisher information matrix and multiparameter estimation, Journal of Physics A: Mathematical and Theoretical 53, 023001 (2019).
Leggett and Garg [1985] A. J. Leggett and A. Garg, Quantum mechanics versus macroscopic realism: Is the flux there when nobody looks?, Phys. Rev. Lett. 54, 857 (1985).
Busch [1986] P. Busch, Unsharp reality and joint measurements for spin observables, Phys. Rev. D 33, 2253 (1986).
Ryu et al. [2013] J. Ryu, J. Lim, S. Hong, and J. Lee, Operational quasiprobabilities for qudits, Physical Review A 88, 052123 (2013).
Jae et al. [2017] J. Jae, J. Ryu, and J. Lee, Operational quasiprobabilities for continuous variables, Phys. Rev. A 96, 042121 (2017).
Jae et al. [2023] J. Jae, J. Lee, K.-G. Lee, M. S. Kim, and J. Lee, Metrological power of incompatible measurements (2023), arXiv:2311.11785 [quant-ph] .
Tavakoli and Uola [2020] A. Tavakoli and R. Uola, Measurement incompatibility and steering are necessary and sufficient for operational contextuality, Phys. Rev. Res. 2, 013011 (2020).
Gühne et al. [2023] O. Gühne, E. Haapasalo, T. Kraft, J.-P. Pellonpää, and R. Uola, Colloquium: Incompatible measurements in quantum information science, Rev. Mod. Phys. 95, 011003 (2023).
Budroni et al. [2022] C. Budroni, A. Cabello, O. Gühne, M. Kleinmann, and J.-A. Larsson, Kochen-specker contextuality, Rev. Mod. Phys. 94, 045007 (2022).
Jae et al. [2019] J. Jae, K. Baek, J. Ryu, and J. Lee, Necessary and sufficient condition for joint measurability, Phys. Rev. A 100, 032113 (2019).
Arvidsson-Shukur et al. [2020] D. R. M. Arvidsson-Shukur, N. Yunger Halpern, H. V. Lepage, A. A. Lasek, C. H. W. Barnes, and S. Lloyd, Quantum advantage in postselected metrology, Nature Communications 11, 3775 (2020).
Lupu-Gladstein et al. [2022] N. Lupu-Gladstein, Y. B. Yilmaz, D. R. M. Arvidsson-Shukur, A. Brodutch, A. O. T. Pang, A. M. Steinberg, and N. Y. Halpern, Negative quasiprobabilities enhance phase estimation in quantum-optics experiment, Phys. Rev. Lett. 128, 220504 (2022).
Combes et al. [2014] J. Combes, C. Ferrie, Z. Jiang, and C. M. Caves, Quantum limits on postselected, probabilistic quantum metrology, Phys. Rev. A 89, 052117 (2014).
Ferrie and Combes [2014] C. Ferrie and J. Combes, Weak value amplification is suboptimal for estimation and detection, Phys. Rev. Lett. 112, 040406 (2014).
Knee and Gauger [2014] G. C. Knee and E. M. Gauger, When amplification with weak values fails to suppress technical noise, Phys. Rev. X 4, 011032 (2014).
Degen et al. [2017] C. L. Degen, F. Reinhard, and P. Cappellaro, Quantum sensing, Rev. Mod. Phys. 89, 035002 (2017).
Cramér [1946] H. Cramér, Mathematical methods of statistics (Princeton University Press, 1946).
C.R. [1992] R. C.R., Information and the accuracy attainable in the estimation of statistical parameters. In breakthroughs in statistics. (Springer Series in Statistics (Perspectives in Statistics). Springer, New York, NY., 1992).
Lehmann and Casella [2006] E. L. Lehmann and G. Casella, Theory of point estimation (Springer Science & Business Media, 2006).
Efron and Hinkley [1978] B. Efron and D. V. Hinkley, Assessing the accuracy of the maximum likelihood estimator: Observed versus expected fisher information, Biometrika 65, 457 (1978).
Lindsay and Li [1997] B. G. Lindsay and B. Li, On second-order optimality of the observed fisher information, The Annals of Statistics 25, 2172 (1997).
Yu et al. [2010] S. Yu, N.-l. Liu, L. Li, and C. H. Oh, Joint measurement of two unsharp observables of a qubit, Phys. Rev. A 81, 062116 (2010).

Methods

M1 Heralded single photons

We generate the heralded single photon as following. High energy pump photons ( $p=405.7$ nm) from a continuous wave (CW) single mode laser (TOPMODE $405$ , TOPTICA) are sent to a periodically poled KTP (PPKTP) crystal. PPKTP splits the input photons into photon pairs (signal and idler photons) through type-II spontaneous parametric down conversion (SPDC) process. The polarizations of signal and idler photons are orthogonal each other, so that polarizing beam splitter (PBS) can separate them into two different optical paths. The idler photon is sent to an avalanche photodiode (APD) for triggering. The signal photon is sent to one of the four APDs (SPCM-QC $4$ , Perkin Elmer). If the trigger APD is clicked, we count clicks on the four APDs. We control the count rate of the trigger APD to be $~{}2\times 10^{5}$ cps to sufficiently suppress multi photon events, i.e., $|{\text{SPDC}}\rangle\approx|{00}\rangle+\epsilon|{11}\rangle$ for $\epsilon\ll 1$ . The click signals are post-processed by a field programmable gate array (FPGA) with the time bin size of $25$ ns.

M2 Input state preparation

We prepare an initial probe state by using a series of three wave plates (Fig. 1), one of half-wave plate (HWP) and two of quarter-wave plates (QWP). After passing QWP₁, HWP, and QWP₂ sequentially, a horizontally polarized state $|{H}\rangle$ becomes an initial state

	$\displaystyle\|{\psi}\rangle_{\text{in}}$	$\displaystyle=$	$\displaystyle\text{QWP}_{2}\left(\frac{\pi}{4}\right)\text{HWP}\left(p\right)\text{QWP}_{1}\left(q\right)\|{H}\rangle$		(M1)
		$\displaystyle=$	$\displaystyle e^{i\left(-2p+q+\pi/4\right)}\begin{pmatrix}\cos\left(\frac{\pi}{4}-q\right)\\ e^{i\left(4p-2q-\frac{\pi}{2}\right)}\sin\left(\frac{\pi}{4}-q\right)\end{pmatrix},$		(M1)

where $p$ $(q)$ is the angle of the fast axis of the half (quarter)-wave plate from the horizontal axis. The $q$ value of QWP₂ is fixed at $\pi/4$ . By adjusting the control parameters $p$ and $q$ to satisfy $\theta=\pi/2-2q$ and $\phi=4p-2q-\pi/2$ , we finally obtain the parameterized state $|{\psi}\rangle_{\text{in}}=\cos(\theta/2)|{H}\rangle+e^{i\phi}\sin(\theta/2)|{V}\rangle$ .

M3 Maximum likelihood estimator using operational quasiprobability

Maximum likelihood estimation is a method to find a parameter of a probability model which best describes observed data. This method assumes a likelihood function of the model, and maximizes the function to determine the most likely value in the parameter space as an estimate. In this work, we take the operational quasiprobability as the model depending on the phase $\theta$ .

For the two data sets $\mathbf{x}_{B}$ and $\mathbf{x}_{AB}$ , we define the log-likelihood function as

\displaystyle l_{W}(\theta|\mathbf{x}_{B},\mathbf{x}_{AB})

\displaystyle:=\frac{1}{N_{s}}\sum_{a,b=0}^{1}N_{W}(a,b)\log w(a,b|\theta),

(M2)

where $N_{W}(a,b)=N_{AB}(a,b)+\left(N_{B}(b)-N_{AB}(b)\right)/2$ . $N_{B}(b)$ is the number of counts for outcome $b$ in the data set $\mathbf{x}_{B}$ , and $N_{AB}(a,b)$ is the number of counts for outcome pair $(a,b)$ in the data set $\mathbf{x}_{AB}$ . $N_{AB}(b)$ is obtained by the marginal number of counts as $N_{AB}(b)=\sum_{a}N_{AB}(a,b)$ . For a small number of samples, $N_{w}(a,b)$ can be negative by statistical fluctuations. We test whether the number count $N_{W}$ is positive, and neglect cases where the count is negative.

In a broader sense, the coQM proposes an approach of integrating the two different ensembles for single-parameter estimation. To show that our estimator $\check{\theta}$ is unbiased and error of the estimate achieves Cramér-Rao bound [51, 52], we propose a theory that describes the operational quasiprobability as an ensemble mixture model (see Supplementary Information).

Acknowledgements.

MSK acknowledges the EPSRC grant (EP/T00097X/1) and AppQInfo MSCA ITN from the European Unions Horizon 2020. KGL was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2023M3K5A109481311) and Institute of Information and Communications Technology Planning & Evaluation (IITP) grant funded by the Korea government (MSIT) (No. 2022-0-01026). JL was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIT) (No. 2022M3E4A1077369).

Author contributions

Jeongwoo Jae, M. S. Kim, and Jinhyoung Lee contributed to the theoretical formulation of the contextual quantum metrology. Jiwon Lee and Kwang-Geol Lee conducted the optical experiments and the data analysis. All authors contributed to discussions in this work. Jeongwoo Jae wrote the manuscript with assistance of other authors. Jeongwoo Jae and Jiwon Lee contributed equally to this work.

Competing interests

The authors declare no competing interests.

Supplementary Information

Supplementary Information is available for this paper.

Supplementary Information for contextual quantum metrology

We provide Supplementary Information (SI) including details of the present work: contextual quantum metrology. SI is organized into three parts. First, we briefly review the conventional quantum estimation theory. Second, we recall asymptotic normality of maximum likelihood estimator (MLE), and suggest ensemble mixing theory to prove that MLE using operational quasiprobability has the asymptotic normality. Based on the proof, we suggest an estimator of the estimation error. By using this estimator, we experimentally show that there is the contextuality-enabled enhancement also for estimation of the phase $\phi$ . Third, we characterize systematic errors, and investigate effects of noise caused by depolarization of probe state on the performance of contextual quantum metrology.

S1 quantum estimation theory

Estimation or metrology theory employs two models. One is a conditional probability function $p({\mathbf{x}}|\theta)$ of an observing data set $\mathbf{x}$ , conditioned on parameter $\theta$ to estimate. The other is an estimator $T({\mathbf{x}})$ , which estimates the value of parameter $\theta$ , based on the observation. Both types of models are verified whether appropriate for experiments. To find the two types of models with small estimation errors is at the heart of estimation. Once a conditional probability function is assumed, the estimator determines the estimation error.

The estimation error $\Delta\theta$ is known to have a lower bound over all estimators,

\displaystyle\Delta\theta\geq\frac{1}{\sqrt{N_{s}F}}.

(S1)

This inequality holds when samples are drawn from i.i.d. The lower bound is called Cramér-Rao bound (CRB). CRB contains Fisher information $F$ , which is defined by the conditional probability and its dependence of parameter $\theta$ :

\displaystyle F=F(\theta)=\sum_{x}p(x|\theta)\left(\frac{\partial\ln p(x|\theta)}{\partial\theta}\right)^{2}.

(S2)

Note that Fisher information is positive semidefinite. If the conditional probability is parameter-independent, $F$ has the minimum of zero, $F=0$ . If it is highly dependent, $F$ is large. As $F$ increases, CRB decreases and results in more precise estimation. An estimator attaining CRB is said optimal. It is known that a maximum likelihood estimator is (approximately) an optimal estimator in limit of large samples [53].

In quantum theory, the conditional probability $p(x|\theta)$ is decomposed into a quantum state $\hat{\varrho}$ and a measurement of positive operator-valued measure (POVM) $\{\hat{m}(x)\}$ . Assuming the quantum state is a functional of parameter $\theta$ , the conditional probability is given by $p(x|\theta)=\text{Tr}\left(\hat{m}(x)\hat{\varrho}(\theta)\right)$ . In this case, CRB can be further optimized over measurements to quantum CRB (QCRB). QCRB contains quantum Fisher information (QFI) defined by

F_{q}=\text{Tr}(\hat{L}^{2}\hat{\varrho}(\theta)),

(S3)

where $\hat{L}:=({\hat{L}\hat{\varrho}(\theta)+\hat{\varrho}(\theta)\hat{L}})/{2}$ is the symmetric logarithmic operator to define derivative of a quantum state $\hat{\varrho}(\theta)$ [34]. For a given probe state, QFI is the maximum value of Fisher information over all measurements, so it is never less than Fisher information $F$ , i.e., $F_{q}\geq F$ . Equivalently, the error given by quantum Fisher information is never larger than the error given by Fisher information. In that sense, the error given by QFI is the minimum error of quantum metrology using measurements which can be represented by a POVM.

S2 Maximum likelihood estimator using operational quasiprobability

S2-1 Maximum likelihood estimator

We recall the significant properties of maximum likelihood estimator (MLE) to show asymptotic normality of MLE using operational quasiprobability (MLEOQ). The MLE estimates a parameter $\theta$ by using a data set ${\bf x}=\{x_{1},x_{2},\cdots,x_{N}\}$ drawn from a single ensemble $X$ represented by a probability $p(x|X,\theta)$ . An estimate $\check{\theta}$ is a value that gives an extremum of log-likelihood function, which is defined by

l_{X}(\theta|{\bf x}):=\frac{1}{N}\sum_{x_{i}\in{\bf x}}\log p(x_{i}|W,\theta).

(S4)

The log-likelihood function satisfies following three conditions in the asymptotic limit of $N\rightarrow\infty$ [53]:

	$\displaystyle(\text{C}.1)$		$\displaystyle~{}~{}-\partial^{2}_{\theta}l_{X}(\theta\|{\bf x})=F_{X}(\theta)+{\cal O}\left({N}^{-1/2}\right)$
	$\displaystyle(\text{C}.2)$		$\displaystyle~{}~{}~{}{\bf{E}}\left[\partial_{\theta}l_{X}(\theta\|{\bf x})\right]=0.$
	$\displaystyle(\text{C}.3)$		$\displaystyle~{}~{}~{}{\bf{E}}\left[\left(\partial_{\theta}l_{X}(\theta\|{\bf x})\right)^{2}\right]=F_{X}(\theta)/N,$

where $F_{X}(\theta)$ is the Fisher information of probability $p(x|X,\theta)$ .

If we assume that an estimate $\check{\theta}$ is close to a true parameter value $\theta_{0}$ , the difference between the estimate and the true value can approximate to

\displaystyle\check{\theta}-\theta_{0}\approx-\frac{\partial_{\theta}l(\theta_{0}|{\bf x})}{\partial^{2}_{\theta}l(\theta_{0}|{\bf x})}.

(S5)

The first order moment of the difference determines biasedness, quantifying how much the estimate is shifted from the true value. The second order moment of the difference corresponds to estimation error.

The conditions (C. $1$ ) and (C. $2$ ) imply that the MLE is an unbiased estimator:

{\bf E}\left[\check{\theta}-\theta_{0}\right]\approx\frac{{\bf E}\left[\partial_{\theta}l(\theta_{0}|{\bf x})\right]}{F_{X}(\theta_{0})+{\cal O}\left({N}^{-1/2}\right)}=0.

(S6)

The (C. $2$ ) and (C. $3$ ) imply that MLE is an efficient estimator; the estimation error $\Delta^{2}\theta$ attains Cramér-Rao bound in the asymptotic limit of $N\rightarrow\infty$ ,

\Delta^{2}\theta={\bf E}\left[\left(\check{\theta}-\theta_{0}\right)^{2}\right]\approx\frac{{\bf E}\left[\left(\partial_{\theta}l(\theta_{0}|{\bf x})\right)^{2}\right]}{\left[F_{X}(\theta_{0})+{\cal O}\left({N}^{-1/2}\right)\right]^{2}}=\frac{1}{NF_{X}(\theta_{0})}+{\cal O}(N^{-3/2}).

(S7)

The unbiasedness and the efficiency finally imply that MLE satisfies asymptotic normality in the limit $N\rightarrow\infty$ , i.e.,

\sqrt{N}\left(\check{\theta}-\theta_{0}\right)\xrightarrow{N}{\cal N}(0,F^{-1}_{X}(\theta_{0})),

(S8)

where $\cal N$ is normal distribution of zero mean and variance $F^{-1}_{X}(\theta_{0})$ . Purpose of following sections is to prove that the MLEOQ has the asymptotic noramlity.

S2-2 Log-likelihood function of operational quasiprobability

Whereas the typical MLE is based on a single ensemble $X$ represented by a conditional probability $p(x|X,\theta)$ , MLE in the contextual quantum metrology (coQM) utilizes the two ensembles $B$ and $AB$ represented by the probabilities $p(b|B)$ and $p(a,b|AB)$ , respectively. To represent the probabilities from these two ensembles at once, we employ the operational quasiprobability (OQ) given by

w(a,b|W,\theta)=p(a,b|AB,\theta)+\frac{1}{2}\left(p(b|B,\theta)-p(b|AB,\theta)\right).

(S9)

One can directly collect samples from the two ensembles $B$ and $AB$ through the experiment shown in Fig. $1$ , but the samples of $W$ can only be deduced from those of $B$ and $AB$ as a mixture of two ensembles. In this sense, we call the ensembles $B$ and $AB$ observable, and $W$ virtual.

Our approach uses the virtual ensemble $W$ to estimate the parameter $\theta$ . MLE using OQ (MLEOQ) calculates an estimate as a value which gives an extremum of log-likelihood function of $w$ function, which is defined by

l_{W}(\theta|{\bf x}_{W}):=\frac{1}{N_{W}}\sum_{x_{i}\in{\bf x}_{W}}\log w(x_{i}|W,\theta),

(S10)

where ${\bf x}_{W}$ is a composite data set of ${\bf x}_{B}$ and ${\bf x}_{AB}$ drawn from the observable ensembles $A$ and $AB$ , respectively. Note that we only consider positive OQ to apply the function $w$ to the logarithmic function without divergence. We test whether the OQ is positive or not with negativity defined by $\sum_{a,b}\left(|{w(a,b|W,\theta)}|-w(a,b|W,\theta)\right)/2$ . $w$ is positive semidefinite if the negativity is zero.

A lot of work in quantum metrology have been done based on the theory of typical maximum likelihood estimation assuming a single observable ensemble. On the other hand, estimation based on an ensemble mixture has been drawn little attention in the field of quantum metrology. In this work, we propose a maximum likelihood estimation utilizing the two different ensembles. To do so, we suggest a theory of ensemble mixing and prove that MLEOQ satisfies the asymptotic normality.

S2-3 Forward mixing of ensembles

Suppose that a virtual ensemble $C$ is a mixture of two observable ensembles $A$ and $B$ by following relation

p(x|C)=p(x|A)p(A)+p(x|B)p(B),

(S11)

where $p(x|X)$ is a probability which represents a ensemble $X$ , and $p(X)$ is a probability to choose ensemble $X$ . If one draws $N_{A}$ samples from the ensemble $A$ and $N_{B}$ from $B$ , the number of samples of $C$ is $N_{C}=N_{A}+N_{B}$ . Thus, in the asymptotic limit of $N_{C}\rightarrow\infty$ , $p(A)=N_{A}/N_{C}$ and $P(B)=N_{B}/N_{C}$ . We describe the virtual ensemble $C$ as a statistical mixture of two observable ensembles $A$ and $B$ . We call this description as forward mixing of ensembles. The forward mixing is depicted in Fig. S1. a.

We call one of significant properties of the forward mixing as functional equivalence: expectation of an arbitrary function $f$ by the virtual ensemble $C$ can be decomposed into those by the observable ensembles $A$ and $B$ . In other words,

\mathbf{E}[f|C]=p(A)\mathbf{E}[f|A]+p(B)\mathbf{E}[f|B],

(S12)

where $\mathbf{E}[f|X]=\sum_{x}f(x)p(x|X)$ . This is a key property to show the asymptotic normality of MLEOQ.

To prove the functional equivalence, we analyze the fluctuation of number count $N(x|X)$ for an outcome $x$ of a data set ${\bf x}=\{x_{1},x_{2},\cdots,x_{N_{X}}\}$ drawn from an ensemble $X$ . For each outcome $x$ , the number of count $N(x|X)$ can approximate to mean value $\bar{N}(x|X):=N_{X}p(x|X)$ with standard deviations $\sigma=\sqrt{N_{X}}$ in the asymptotic limit $N_{X}\rightarrow\infty$ as

N(x|X)\approx\bar{N}(x|X)+{\cal O}\left(\sqrt{N_{X}}\right).

(S13)

Let a grand probability distribution of count numbers be $p(\{N(x|X)\})=\Pi_{x}p(N(x|X))$ . Each probability $p(N(x|X))$ respective to a number count follows a binomial distribution $\binom{N}{N(x|X)}p(x|X)^{N(x|X)}(1-p(x|X))^{N-N(x|X)}$ . In the asymptotic limit of $N_{X}\rightarrow\infty$ , the Binomial distribution approximates to a Poisson distribution of mean value $\bar{N}(x|X)$ . Furthermore, as the average number count increases along with the increase of $N_{X}$ , the Poisson distribution approximates to a normal distribution ${\cal N}(\bar{N}(x|X),\bar{N}(x|X))$ . Thus, the grand probability finally approximates to the normal distribution as

\displaystyle p(\{N(x|X)\})\propto\exp\left[-\frac{1}{2}\sum_{x}\left(\frac{N(x|X)-\bar{N}(x|X)}{\bar{N}(x|X)}\right)^{2}\right].

This implies that $N(x|X)\approx\bar{N}(x|X)\pm\sqrt{N_{X}p(x|X)}$ in the asymptotic limit of $N_{X}\rightarrow\infty$ .

By using the approximation of number count (S13), we show that the number count of virtual ensemble $C$ is decomposed of those of $A$ and $B$ as

N(x|C)\approx N(x|A)+N(x|B)+{\cal O}\left(\sqrt{N_{A,B,C}}\right).

(S14)

From the relation (S11), we can derive that

$\displaystyle N(x\|C)$	$\displaystyle=$	$\displaystyle N_{C}p(x\|C)$
	$\displaystyle=$	$\displaystyle N_{C}p(x\|A)p(A)+N_{C}p(x\|B)p(B)$
	$\displaystyle\approx$	$\displaystyle{N_{A}}p(x\|A)+N_{B}p(x\|B)+{\cal O}\left(\sqrt{N_{C}}\right)$
	$\displaystyle\approx$	$\displaystyle N(x\|A)+N(x\|B)+{\cal O}\left(\sqrt{N_{A,B,C}}\right),$

where we use $N_{X}\approx N_{C}p(X)+{\cal O}\left(\sqrt{N_{C}}\right)$ for the third step and $N(x|X)\approx N_{X}p(x|X)+{\cal O}\left(\sqrt{N_{X}}\right)$ for the fourth step.

Finally, using the relation (S14), we prove the functional equivalence. For the data set of ensemble $C$ , ${\bf x_{C}}$ , the expectation of function $f$ can be read

$\displaystyle{\bf E}\left[{\sum_{x_{i}\in{\bf x}_{C}}f(x_{i})}\right]$	$\displaystyle=$	$\displaystyle{\bf E}\left[\sum_{x}N(x\|C)f(x)\right]\approx N_{C}{\bf E}\left[f\|C\right]+{\cal O}\left(\sqrt{N_{C}}\right)$	(S15)
	$\displaystyle\approx$	$\displaystyle{\bf E}\left[{\sum_{x}N(x\|A)f(x)}\right]+{\bf E}\left[{\sum_{x}N(x\|B)f(x)}\right]+{\cal O}\left(\sqrt{N_{A,B,C}}\right)$
	$\displaystyle=$	$\displaystyle{\bf E}\left[{\sum_{x_{j}\in{\bf x}_{A}}f(x_{j})}\right]+{\bf E}\left[{\sum_{x_{k}\in{\bf x}_{B}}f(x_{k})}\right]+{\cal O}\left(\sqrt{N_{A,B,C}}\right)$
	$\displaystyle=$	$\displaystyle N_{A}{\bf E}\left[f\|A\right]+N_{B}{\bf E}\left[f\|B\right]+{\cal O}\left(\sqrt{N_{A,B,C}}\right).$

This implies that $N_{C}{\bf E}\left[f|C\right]=N_{A}{\bf E}\left[f|A\right]+N_{B}{\bf E}\left[f|B\right]$ or, equivalently, ${\bf E}\left[f|C\right]=p(A){\bf E}\left[f|A\right]+p(B){\bf E}\left[f|B\right]$ in the asymptotic limit of $N_{A,B,C}\rightarrow\infty$ . $\blacksquare$

S2-4 Backward mixing of ensembles

We can describe the forward mixing in a different view that considers the observable ensemble $A$ as a mixture of observable ensemble $B$ and virtual ensemble $C$ . We call this description as backward mixing of ensembles (see Fig. S1. b). In the backward mixing, we have a reciprocal relation of Eq. (S11),

p(x|A)=p(x|C)\frac{1}{p(A)}-p(x|B)\frac{p(B)}{p(A)}.

(S16)

The backward mixing preserves the functional equivalence, i.e.,

\mathbf{E}[f|A]=\mathbf{E}[f|C]\frac{1}{p(A)}-\mathbf{E}[f|B]\frac{p(B)}{p(A)}.

(S17)

In the backward problem, the number count satisfies a reciprocal relation of number counts of Eq. (S14).

$\displaystyle N_{A}p(x\|A)$	$\displaystyle=$	$\displaystyle N_{A}p(x\|C)\frac{1}{p(A)}-N_{A}p(x\|B)\frac{p(B)}{p(A)}$	(S18)
	$\displaystyle\approx$	$\displaystyle\frac{N_{A}}{N_{C}}{N_{C}}p(x\|C)\frac{1}{p(A)}-\frac{N_{A}}{N_{B}}N_{B}p(x\|B)\frac{p(B)}{p(A)}+{\cal O}\left(\sqrt{N_{A}}\right)$
	$\displaystyle\approx$	$\displaystyle N(x\|C)-N(x\|B)+{\cal O}\left(\sqrt{N_{A,B,C}}\right).$

Thus, the functional equivalence becomes

$\displaystyle{\bf E}\left[{\sum_{x_{i}\in{\bf x}_{A}}f(x_{i})}\right]$	$\displaystyle=$	$\displaystyle{\bf E}\left[\sum_{x}N(x\|A)f(x)\right]\approx N_{A}{\bf E}\left[f\|A\right]+{\cal O}\left(\sqrt{N_{A}}\right)$	(S19)
	$\displaystyle\approx$	$\displaystyle{\bf E}\left[{\sum_{x}N(x\|C)f(x)}\right]-{\bf E}\left[{\sum_{x}N(x\|B)f(x)}\right]+{\cal O}\left(\sqrt{N_{A,B,C}}\right)$
	$\displaystyle=$	$\displaystyle{\bf E}\left[{\sum_{j\in C}f(x_{j})}\right]+{\bf E}\left[{\sum_{k\in B}f(x_{k})}\right]+{\cal O}\left(\sqrt{N_{A,B,C}}\right)$
	$\displaystyle=$	$\displaystyle N_{C}{\bf E}\left[f\|C\right]-N_{B}{\bf E}\left[f\|B\right]+{\cal O}\left(\sqrt{N_{A,B,C}}\right).$

This implies that $N_{A}{\bf E}\left[f|A\right]=N_{C}{\bf E}\left[f|C\right]-N_{B}{\bf E}\left[f|B\right]$ and, equivalently, ${\bf E}\left[f|A\right]={\bf E}\left[f|C\right]\frac{1}{p(A)}-{\bf E}\left[f|B\right]\frac{p(B)}{p(A)}$ in the asymptotic limit of $N_{A,B,C}\rightarrow\infty$ . $\blacksquare$

S2-5 Operational quasiprobability as a distribution of ensemble mixtures

Now we are in a position to interpret the operational quasiprobability (OQ) as a model to represent the forward and backward mixture of ensembles. For the OQ, observable ensembles are $RB$ , $C_{AB}$ , and $C_{RB}$ , where $R$ is a random ensemble, $B$ is the ensemble for the single $B$ measurement, $C_{AB}$ is the ensemble for the consecutive measurement $AB$ , and $C_{RB}$ is the marginal ensemble $B$ of $C_{AB}$ .

The OQ is obtained by the backward mixing of a virtual ensemble, called $E_{+}$ , and the marginal ensemble $C_{RB}$ as

\displaystyle w(a,b|W)

\displaystyle=

\displaystyle p(a,b|E_{+})\frac{1}{p(W)}-p(a|R)p(b|AB)\frac{p(B)}{p(W)},

(S20)

where the virtual ensemble $E_{+}$ is defined by the forward mixing of observable ensembles $RB$ and $C_{AB}$ as

\displaystyle p(a,b|E_{+})=p(a|R)p(b|B)p(B)+p(a,b|AB)p(AB).

(S21)

We here consider $p(W)=1/2$ , $p(B)=1/2$ , $p(AB)=1/2$ , and $p(a|R)=1/2$ $\forall a$ , and the OQ in Eq. (S9) is obtained by

	$\displaystyle w(a,b\|W)$	$\displaystyle=$	$\displaystyle\frac{p(AB)}{p(W)}p(a,b\|AB)+\frac{p(B)}{p(W)}p(a\|R)p(b\|B)-\frac{p(B)}{p(W)}p(a\|R)p(b\|AB)$		(S22)
		$\displaystyle=$	$\displaystyle p(a,b\|AB)+\frac{1}{2}p(b\|B)-\frac{1}{2}p(b\|AB).$		(S22)

Diagram for the ensemble mixings to construct the OQ is presented in Fig. S2

S2-6 Asymptotic normality of MLEOQ

Based on the aforementioned asymptotic properties, we show that the log-likelihood function of OQ satisfies following three conditions:

	$\displaystyle(\text{C}^{\prime}.1)$		$\displaystyle~{}~{}-\partial^{2}_{\theta}l_{W}(\theta\|{\bf x})=F_{W}(\theta)+{\cal O}\left({N}^{-1/2}_{W}\right),~{}\text{as}~{}N_{W}\rightarrow\infty.$
	$\displaystyle(\text{C}^{\prime}.2)$		$\displaystyle~{}~{}~{}{\bf{E}}\left[\partial_{\theta}l_{W}(\theta\|{\bf x}_{W})\right]=0.$
	$\displaystyle(\text{C}^{\prime}.3)$		$\displaystyle~{}~{}~{}{\bf{E}}\left[\left(\partial_{\theta}l_{W}(\theta\|{\bf x}_{W})\right)^{2}\right]=F_{W}(\theta)/{N_{W}},$

Note that we consider a positive $w$ function.

Proof of $(\text{C}^{\prime}.1)$ .—In the asymptotic limit of $N_{W}\rightarrow\infty$ , the second order derivative of log-likelihood function of OQ can be read

$\displaystyle-\partial^{2}_{\theta}l_{W}(\theta\|{\bf x}_{W})$	$\displaystyle=$	$\displaystyle-\frac{1}{N_{W}}\sum_{x_{i}\in{\bf x}_{W}}\partial^{2}_{\theta}\log w(x_{i}\|W,\theta)$
	$\displaystyle=$	$\displaystyle-\frac{1}{N_{W}}\sum_{x}N(x\|W)\partial^{2}_{\theta}\log w(x\|W,\theta)$
	$\displaystyle\approx$	$\displaystyle-\frac{1}{N_{W}}\sum_{x}\left[N_{W}w(x\|W,\theta)+{\cal O}\left(\sqrt{N_{W}}\right)\right]\left[\frac{\partial^{2}_{\theta}w(x\|W,\theta)}{w(x\|W,\theta)}-\left(\partial_{\theta}\log w(x\|W,\theta)\right)^{2}\right]$
	$\displaystyle=$	$\displaystyle\sum_{x}w(x\|W,\theta)\left({\partial_{\theta}\log w(x\|W,\theta)}\right)^{2}+{\cal O}\left({N_{W}}^{-1/2}\right)$
	$\displaystyle=$	$\displaystyle F_{W}(\theta)+{\cal O}\left({N_{W}}^{-1/2}\right),$

where Fisher information of $w$ is defined by $F_{W}(\theta):=\sum_{x}w(x|W,\theta)\left({\partial_{\theta}\log w(x|W,\theta)}\right)^{2}$ . $\blacksquare$

Proof of $(\text{C}^{\prime}.2)$ .—By the functional equivalence in Eq. (S15) and Eq. (S19), ${\bf E}\left[\partial_{\theta}l_{W}(\theta|{\bf x}_{W})\right]$ can be decomposed into the respective expectations of observable ensembles $B$ and $AB$ in the asymptotic limit of $N_{W}\rightarrow\infty$ as

\displaystyle{\bf E}\left[\partial_{\theta}l_{W}(\theta|{\bf x}_{W})\right]

\displaystyle=

\displaystyle{\bf E}\left[\partial_{\theta}l_{AB}(\theta|{\bf x}_{AB})\right]+\frac{1}{2}{\bf E}\left[\partial_{\theta}l_{B}(\theta|{\bf x}_{B})\right]-\frac{1}{2}{\bf E}\left[\partial_{\theta}l_{B}(\theta|{\bf x}_{AB})\right].

By the condition $(\text{C}.2)$ , each term in the right-hand side is zero. Thus, ${\bf E}\left[\partial_{\theta}l_{W}(\theta|{\bf x}_{W})\right]$ is zero. $\blacksquare$

Proof of $(\text{C}^{\prime}.3)$ .— We use the condition $(\text{C}^{\prime}.2)$ to prove the condition $(\text{C}^{\prime}.3)$ :

$\displaystyle{\bf{E}}\left[\left(\partial_{\theta}l_{W}(\theta\|{\bf x}_{W})\right)^{2}\right]$	$\displaystyle=$	$\displaystyle\frac{1}{N_{W}^{2}}\mathbf{E}\left[\sum_{x_{i},x_{j}\in{\bf x}_{W}}\partial_{\theta}\log w(x_{i}\|W,\theta)\partial_{\theta}\log w(x_{j}\|W,\theta)\right]$
	$\displaystyle\approx$	$\displaystyle\frac{1}{N_{W}^{2}}\sum_{i=j}\mathbf{E}\left[\left(\partial_{\theta}\log w(x_{i}\|W,\theta)\right)^{2}\right]+\frac{1}{N_{W}^{2}}\sum_{i\neq j}\mathbf{E}\left[\partial_{\theta}\log w(x_{i}\|W,\theta)\right]\mathbf{E}\left[\partial_{\theta}\log w(x_{j}\|W,\theta)\right]$
	$\displaystyle=$	$\displaystyle\frac{1}{N_{W}}\mathbf{E}\left[\left(\partial_{\theta}\log w(x\|W,\theta)\right)^{2}\right]$
	$\displaystyle=$	$\displaystyle\frac{1}{N_{W}}\sum_{x}w(x\|W,\theta)\left(\partial_{\theta}\log w(x\|W,\theta)\right)^{2}$
	$\displaystyle=$	$\displaystyle{F_{W}(\theta)}/{N_{W}}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}~{}\blacksquare$

The conditions $(\text{C}^{\prime}.1)$ and $(\text{C}^{\prime}.2)$ imiply that the MLEOQ is unbiased estimator:

{\bf E}\left[\check{\theta}-\theta_{0}\right]\approx\frac{{\bf E}\left[\partial_{\theta}l_{W}(\theta_{0}|{\bf x}_{W})\right]}{F_{W}(\theta_{0})+{\cal O}\left({N_{W}}^{-1/2}\right)}=0.

(S23)

The $(\text{C}^{\prime}.2)$ and $(\text{C}^{\prime}.3)$ imply that MLEOQ is an efficient estimator as

\Delta^{2}\theta={\bf E}\left[\left(\check{\theta}-\theta_{0}\right)^{2}\right]\approx\frac{{\bf E}\left[\left(\partial_{\theta}l_{W}(\theta_{0}|{\bf x}_{W})\right)^{2}\right]}{\left[F_{W}(\theta_{0})+{\cal O}\left({N_{W}}^{-1/2}\right)\right]^{2}}=\frac{1}{N_{W}F_{W}(\theta_{0})}+{\cal O}(N_{W}^{-3/2}).

(S24)

Thus, the MLEOQ satisfies the asymptotic normality, i.e.,

\sqrt{N_{W}}\left(\check{\theta}-\theta_{0}\right)\xrightarrow{N_{W}}{\cal N}(0,F_{W}^{-1}(\theta_{0})).

(S25)

Based on these facts, to estimate the estimation error ${\bf E}\left[\left(\check{\theta}-\theta_{0}\right)^{2}\right]$ in the asymptotic limit of $N_{W}\rightarrow\infty$ , we employ an estimator for Fisher information [54, 55] defined by

\displaystyle{F}_{\text{est}}({\theta}_{\text{est}}|\mathbf{x})

\displaystyle:=\left.-\partial^{2}_{\theta}l_{W}(\theta|\mathbf{x}_{W})\right|_{\theta={\theta}_{\text{est}}}.

(S26)

Fig. S3 shows that, in a large sample size, the estimation of error closes to the theoretical predictions. In this Figure, we demonstrate the contextuality-enabled enhancement for $\theta$ and $\phi$ estimation. We apply a systematic error model to make the theoretical predictions. Explanation for the error model is presented in the next section.

S3 Error and noise analysis

S3-1 Systematic error model

We characterize systematic errors of our optical setting. The errors do not make our quantum system to lose its coherence, but they cause unwanted drift in experimental results. However, it is impossible to avoid misalignment of control parameters in experiment, so that one performs experiment by allowing a small amount of systematic errors. For example, the axes of wave-plates can be deviated $0.1$ degrees in their alignment.

To anlyze the effect of systematic error, we model a quantum probe state $\hat{\varrho}$ with Bloch representation as

\hat{\varrho}(\vec{r}_{s})=\frac{1}{2}\left(I+\vec{r}_{s}\cdot\vec{\sigma}\right),

(S27)

where $\vec{r}_{s}=r_{s}(\sin\theta_{s}\cos\phi_{s},\sin\theta_{s}\sin\phi_{s},\cos\theta_{s})$ satisfying $|{r_{s}}|\leq 1$ and $\vec{\sigma}=(\sigma_{x},\sigma_{y},\sigma_{z})$ . For an element of POVM $\hat{m}$ of a binary outcome $a$ , we use the representation including biasedness $x$ and unsharpness $\mu$ of outcome, which is given by

\hat{m}(a|x,\vec{\mu})=\frac{1}{2}\left\{\left[1+(-1)^{a}x\right]I+(-1)^{a}\vec{\mu}\cdot\vec{\sigma}\right\},

(S28)

where $\vec{\mu}=\mu(\sin\theta\cos\phi,\sin\theta\sin\phi,\cos\theta)$ for $0\leq\mu\leq 1$ , and the condition for the measurement to be a POVM is $|{x}|\leq 1-\mu$ [56]. We also represent overall drift of measurement parameters with a linear model

\displaystyle\theta=\theta_{0}+\theta_{1}\theta_{\text{exp}},\quad\text{and}\quad\phi=\phi_{0}+\phi_{1}\phi_{\text{exp}},

(S29)

where $\theta_{0}$ and $\phi_{0}$ represent shift of parameters, and $\theta_{1}$ and $\phi_{1}$ represent scaling of parameters. Finally, a probability obtained by the probe state and the measurement model is given by

	$\displaystyle p(a\|t_{s},t)$	$\displaystyle=$	$\displaystyle\text{tr}\hat{\varrho}(\vec{r}_{s})\hat{m}(a\|x,\vec{\mu})$		(S30)
		$\displaystyle=$	$\displaystyle\frac{1}{2}\left\{\left[1+(-1)^{a}x\right]+\vec{r}_{s}\cdot\vec{\mu}\right\},$		(S30)

where we abbreviate the model parameters as $t_{s}=\vec{r}_{s}$ and $t=(x,\vec{\mu})$ .

Our goal is to find the model parameters that best describe the experiments. We collect empirical frequencies for each outcome $a$ for a set of states $R=\{\vec{r}^{i}_{s}|i=1,2,\cdots,N_{s}\}$ with a priori probability $p(\vec{r}^{i}_{s})$ . By Bayes’ theorem, the joint probability $p(b,t_{s}|t)$ is given by

p(b,t_{s}|t)=p(b|t^{i}_{s},t)p(t^{i}_{s}).

(S31)

A uniform distribution for the parameters of quantum state is defined by $p(t^{i}_{s})=1/4\pi$ as $\int dt_{s}1/4\pi=1$ . In terms of $\theta_{s}$ and $\phi_{s}$ , the uniform distribution comes to another distribution $\sin\theta_{s}/4\pi$ . This distribution on continuous space $0\leq\theta_{s}\leq\pi$ and $0\leq\phi_{s}\leq 2\pi$ approximates to the distribution on discrete lattices of $\Delta\theta_{s}\Delta\phi_{s}$ as

p(\theta_{s},\phi_{s})\approx\Delta\theta_{s}\Delta\phi_{s}p(\theta^{i}_{s},\phi^{j}_{s}),

(S32)

where $(i,j)$ is the index pair to the lattices and $p(\theta^{i}_{s},\phi^{j}_{s})=\sin\theta^{i}_{s}/N$ . Here, $N_{s}$ is a normalization constant,

N_{s}=\sum_{i,j}\sin\theta^{i}_{s}=N_{\phi_{s}}N_{\theta_{s}}\langle\sin\theta_{s}\rangle=\frac{2N_{\phi_{s}}N_{\theta_{s}}}{\pi}\left(\frac{1}{2}\sum_{i}\frac{\pi}{N_{\theta_{s}}}\sin\theta^{i}_{s}\right)~{}\rightarrow~{}\frac{2N_{\phi_{s}}N_{\theta_{s}}}{\pi}~{}\text{as}~{}N_{s}\rightarrow\infty

(S33)

for the lattice of $N_{\phi_{s}}\times N_{\theta_{s}}$ . We use $\lim_{N_{\theta_{s}}\rightarrow\infty}\sum_{i}{\pi}\sin\theta^{i}_{s}/{N_{\theta_{s}}}=\int^{\pi}_{0}d\theta_{s}\sin\theta_{s}=2$ . With the a priori distribution on the discrete lattice, our model for the joint probability becomes

\displaystyle p(a,\theta^{i}_{s},\phi^{j}_{s}|t)=p(a|\theta^{i}_{s},\phi^{j}_{s},t)p(\theta^{i}_{s},\phi^{j}_{s})=\frac{\sin\theta^{i}_{s}}{2N_{s}}\left\{\left[1+(-1)^{a}x\right]+(-1)^{a}\vec{r}_{s}^{ij}\cdot\vec{\mu}\right\}.

(S34)

To estimate the parameter $t$ of measurement model that best describes the experimental results $N(a|t^{ij}_{s})$ for each lattice point $(\theta^{i}_{s},\phi^{j}_{s})$ , we use Bayesian inference. We define a log-likelihood function as

	$\displaystyle l(t)$	$\displaystyle:=$	$\displaystyle\sum_{a,i,j}f(a\|t^{ij}_{s})p(t^{ij}_{s})\log p(a\|t^{ij}_{s},t)p(t^{ij}_{s})$		(S35)
		$\displaystyle=$	$\displaystyle\sum_{a,i,j}f(a,t^{ij}_{s})\log p(a,t^{ij}_{s}\|t)$		(S35)

where $f(a|t^{ij}_{s})=N(a|t^{ij}_{s})/N(t^{ij}_{s})$ and $N(t^{ij}_{s})=\sum_{a}N(a|t^{ij}_{s})$ . An estimate is a value to give an extremum of log-likelihood function $l(t)$ . We assume that the probabilities given by the model is a baseline. In this case, finding an extreme value of likelihood function becomes maximizing Kullback–Leibler divergence $D_{\text{KL}}$ , which is defined by

\displaystyle D_{\text{KL}}\left[f(A|t_{s})p(t_{s})\|p(A|t_{s},t)p(t_{s})\right]

\displaystyle:=

\displaystyle\sum_{a,i,j}f(a,t^{ij}_{s})\log\frac{f(a,t^{ij}_{s})}{p(a,t^{ij}_{s}|t)}.

(S36)

The $D_{\text{KL}}$ is zero if the experimental frequency $f(A,t_{s})$ is equivalent to the model probability $p(a,t_{s}|t)$ . The two approaches result in the same extreme value as

$\displaystyle 0$	$\displaystyle=$	$\displaystyle\partial_{t}D_{\text{KL}}\left[f(A\|t_{s})p(t_{s})\\|p(A\|t_{s},t)p(t_{s})\right]$	(S37)
	$\displaystyle=$	$\displaystyle\partial_{t}\sum_{a,i,j}f(a\|t^{ij}_{s})p(t^{ij}_{s})\log\frac{f(a\|t^{ij}_{s})p(t^{ij}_{s})}{p(a\|t^{ij}_{s},t)p(t^{ij}_{s})}$
	$\displaystyle=$	$\displaystyle-\partial_{t}l(t).$

As the coQM utilizes the two measurements $A$ and $B$ , we collect empirical frequencies for each measurement, and consider a minimization problem to analyze the systematic errors:

	$\displaystyle\underset{t_{A},t_{B}}{\text{minimize}}~{}~{}~{}D_{KL}\left[f(A,t_{s})\\|p(A,t_{s}\|t_{A})\right]+D_{KL}\left[f(B,t_{s})\\|p(B,t_{s}\|t_{B})\right]$
	$\displaystyle\text{subject to}~{}~{}\|{x_{A}}\|\leq 1-\mu_{A},~{}0\leq\mu_{A}\leq 1,~{}\|{x_{B}}\|\leq 1-\mu_{B},~{}\text{and}~{}0\leq\mu_{B}\leq 1.$		(S38)

In summary, we characterize the measurements $A$ and $B$ with the model parameters $t_{A}=(x_{A},\mu_{A},\theta_{0},\theta_{1},\phi_{0},\phi_{1})$ and $t_{B}=(x_{B},\mu_{B},\theta_{0},\theta_{1},\phi_{0},\phi_{1})$ , where meaning of each parameter is following;

$\theta_{0}$ :overall drift of $\theta$

$\phi_{0}$ :overall drift of $\phi$

$\theta_{1}$ :scaling factor of $\theta$

$\phi_{1}$ :scaling factor of $\phi$

$x_{A,B}$ :biasedness of measurement $A$ , $B$

$\mu_{A,B}$ :sharpness of measurement $A$ , $B$

$\theta_{A,B}$ :shift of $\theta$ for measurement $A$ , $B$

$\phi_{A,B}$ :shift of $\phi$ for measurement $A$ , $B$

The parameter values for the ideal (no error) case and for an exemplary case with sample size $N_{s}=10^{5}$ are shown in the table 1. We apply these model parameters to make the theoretical prediction in Fig. S3.

Table 1: The model parameters to describe the systematic errors.

	$\theta_{0}$	$\phi_{0}$	$\theta_{1}$	$\phi_{1}$	$x_{A}$	$\mu_{A}$	$\theta_{A}$	$\phi_{A}$	$x_{B}$	$\mu_{B}$	$\theta_{B}$	$\phi_{B}$
No error	$0.0$	$0.0$	$1.0$	$1.0$	$0.0$	$1.0$	$0.0$	$0.0$	$0.0$	$1.0$	$0.5$	$0.0$
Experiment	$0.00023$	$0.0078$	$1.0$	$1.0$	$-0.0015$	$1.0$	$0.0$	$0.0$	$0.0016$	$0.99$	$0.5$	$0.0$

S3-2 Effect of depolarization noise

We investigate how performance of contextual quantum metrology depends on purity of a probe state. Depolarization noise degrades coherence of a quantum system, so it is severe error to the quantum system. We represent a partially depolarized quantum state $\hat{\varrho}_{d}$ as

\hat{\varrho}_{d}=\lambda\hat{\varrho}_{0}+\frac{(1-\lambda)}{3}\left({\sigma}_{x}\hat{\varrho}_{0}{\sigma}_{x}+{\sigma}_{y}\hat{\varrho}_{0}{\sigma}_{y}+{\sigma}_{z}\hat{\varrho}_{0}{\sigma}_{z}\right),

(S39)

where $\hat{\varrho}_{0}$ is a pure probe state, ${\sigma}$ s are Pauli matrices, and $\lambda\in[0.25,1]$ indicates degree of purity. $\lambda=1$ for a pure state and $\lambda=0.25$ for a fully depolarized state $I/2$ , where $I$ is an identity matrix. To model the outcome probabilities of partially depolarized state $\hat{\varrho}_{d}$ , we mix probabilities obtained by measuring the states $\hat{\varrho}_{0}$ and $\hat{\varrho}_{i\in\{x,y,z\}}=\sigma_{i}\hat{\varrho}_{0}\sigma_{i}$ with a measurement as

P_{d}(a,b)=\lambda P_{0}(a,b)+\frac{(1-\lambda)}{3}\left[P_{x}(a,b)+P_{y}(a,b)+P_{z}(a,b)\right],

(S40)

where $P_{i}$ is the probability obtained by measuring the state $\hat{\varrho}_{i}$ .

For $\phi$ estimation, we investigate how the estimation error changes by varying $\lambda$ from $1$ to $0.8$ . Monte-Carlo simulation (Fig. S4 a) shows the degradation of the estimation precision by the depolarization. The gray dashed line is the minimum error in the conventional quantum metrology (cvQM) obtained by the pure probe state. In experiment (Fig. S4 b), we assume that our probe states are pure. We observe the contextuality-enabled enhancement if $\lambda=1$ , $0.95$ and $0.9$ . If $\lambda=0.8$ , the enhancement is disappeared in the range of the parameters we consider.

$\displaystyle N(x\|C)$	$\displaystyle=$	$\displaystyle N_{C}p(x\|C)$
	$\displaystyle=$	$\displaystyle N_{C}p(x\|A)p(A)+N_{C}p(x\|B)p(B)$
	$\displaystyle\approx$	$\displaystyle{N_{A}}p(x\|A)+N_{B}p(x\|B)+{\cal O}\left(\sqrt{N_{C}}\right)$
	$\displaystyle\approx$	$\displaystyle N(x\|A)+N(x\|B)+{\cal O}\left(\sqrt{N_{A,B,C}}\right),$