This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Optimal Pooling Matrix Design for Group Testing with Dilution (Row Degree) Constraints

Jirong Yi Myung Cho Xiaodong Wu  Raghu Mudumbai and Weiyu Xu Jirong Yi, Raghu Mudumbai, Xiaodong Wu and Weiyu Xu are with Department of Electrical and Computer Engineering, University of Iowa, Iowa City, IA, 52242.Myung Cho is with Department of Electrical and Computer Engineering, Penn State Behrend, Erie, PA, 16563.Email: weiyu-xu@uiowa.edu
Abstract

In this paper, we consider the problem of designing optimal pooling matrix for group testing (for example, for COVID-19 virus testing) with the constraint that no more than r>0r>0 samples can be pooled together, which we call “dilution constraint”. This problem translates to designing a matrix with elements being either 0 or 1 that has no more than rr ‘1’s in each row and has a certain performance guarantee of identifying anomalous elements. We explicitly give pooling matrix designs that satisfy the dilution constraint and have performance guarantees of identifying anomalous elements, and prove their optimality in saving the largest number of tests, namely showing that the designed matrices have the largest width-to-height ratio among all constraint-satisfying 0-1 matrices.

1 Introduction

In the current COVID-19 pandemic, before effective vaccines are introduced, large-scale shutdowns can be safely ended if a systematic “test and trace” program [1, 2] is adopted to contain the spread of the virus. Widespread availability of mass diagnostic testing is needed for re-opening economy, school and social activities. However, most countries including the US are currently experiencing a scarcity [3] of various medical resources including tests [4], and the test throughput or capacity can be limited. Pooled sample testing has been proposed as a method for increasing the effective capacity of existing testing infrastructure using the classical method of group testing or newly introduced compressed sensing techniques for virus testing [5, 6, 7, 8, 9, 10, 11] using the RT-qPCR (real-time Quantitative Polymerase Chain Reaction) tests.

However, there is potentially a dilution problem associated with group testing when samples are pooled together. When a positive sample is mixed with multiple negative samples, the concentration of the substances to be tested will be reduced, sometimes below the detection threshold of adopted testing technologies. This limits the number of samples that can be pooled together. Recently the FDA of the US has authorized pooled testings for COVID-19 infection by Quest Diagnostics, and another pooled testing from the LabCorp. Possibly partially due to dilution concerns, the FDA’s emergency use authorizations (EUA) for pooled testing allows pooling to no more than 4 specimens for the test developed by Quest Diagnostics, and allows pooling to no more than 5 specimens for the test from the LabCorp [12, 13, 14, 15, 16, 17, 18, 19].

Considering the dilution constraint, we ask the following question, “What is the best pooling design which maximizes test throughput while having a required capability of identifying infected samples, and satisfying the dilution constraint of pooling no more than r>0r>0 specimens in each pool?” In this paper, we give explicit designs for such matrices, and prove their optimality in maximizing test throughput under dilution constraint. We remark that, when there is no dilution constraint, maximizing test throughput under a given infection-identifying capability reduces to a traditional group testing problem.

2 Problem Formulation and Optimal Designs

In this paper, we focus on non-adaptive group testings which deal with nn test subjects, and perform mm non-adaptive (pooled) tests. Non-adaptive testing has the advantage of reducing the latency of an individual’s test because one can infer infection statuses of a group of subjects in a single RT-qPCR run. We require a pooling matrix design using which we are able to infer the correct infection status for each of these nn people, whenever there are no more than kk people infected among them. It is well-known that the pooling strategy can be represented by an m×nm\times n matrix AA which elements being ‘0’ or ‘1’. The jj-th (1jn1\leq j\leq n) person’s sample is part of the ii-th pool (test, 1in1\leq i\leq n) if and only if the element of AA in its ii-th row and jj-th column is ‘1’, namely Ai,j=1A_{i,j}=1. Moreover, as mentioned above, we require that each row of the pooling matrix have at most rr 1’s. Given an identification capability parameter kk, we would like to design a group testing matrix of dimension m×nm\times n such that nm\frac{n}{m} is maximized. In this short paper, we focus on k=1k=1, but the results in this paper can be extended to larger kk using similar techniques.

Theorem 2.1.

Let us define a ‘good’ group testing matrix of dimension m×nm\times n as a matrix satisfying the following two conditions: 1) using this matrix we can correctly infer the correct status of each of the nn elements whenever there is no more than k=1k=1 positive element; 2) and there are no more than rr 1’s in each row of such a matrix. Then there is a ‘good’ group testing matrix which satisfies m=rm=r, n=r(r+1)2n=\frac{r(r+1)}{2}, and

nm=r+12.\frac{n}{m}=\frac{r+1}{2}.

Moreover, there are no ‘good’ group testing matrices of dimension m×nm\times n for which

nm>r+12.\frac{n}{m}>\frac{r+1}{2}.
Proof.

We first prove that there are no ‘good’ group testing matrices for which nm>r+12\frac{n}{m}>\frac{r+1}{2}. To make sure that we can correctly infer the correct status of each of the nn elements whenever there is no more than k=1k=1 positive element, the columns of this matrix must be non-zero columns and must be distinct from each other. Suppose that we have a ‘good’ group matrices, which has n1n_{1} columns with a single ‘1’, n2n_{2} columns with two ‘1’s, n3n_{3} columns with three ‘1’s, …., and nmn_{m} columns with mm ‘1’s.

Then we have

i=1mni=n.\sum_{i=1}^{m}n_{i}=n.

Moreover, for every 1im1\leq i\leq m, we have

ni(mi).n_{i}\leq\binom{m}{i}.

In addition, the total number of ‘1’s in the matrix is given by T=i=1mi×niT=\sum_{i=1}^{m}i\times n_{i}. Since the number of ‘1’s in each row is no more than rr, we have

Trm.T\leq rm.

We now argue that nn cannot be bigger than m×r+12m\times\frac{r+1}{2}. Suppose instead that n>m×r+12n>m\times\frac{r+1}{2}. Since there are at most mm columns with a single ‘1’, then there must be at least

i=2mninm>m×r+12m=m(r12)\sum_{i=2}^{m}n_{i}\geq n-m>m\times\frac{r+1}{2}-m=m(\frac{r-1}{2})

columns which have at least two ‘1’s.

We let ll be the largest integer such that

i=1l(ml)n.\sum_{i=1}^{l}\binom{m}{l}\leq n.

Then we have

T=i=1mi×ni\displaystyle T=\sum_{i=1}^{m}i\times n_{i} (1)
i=1li(mi)+(ni=1l(mi))×(l+1)\displaystyle\geq\sum_{i=1}^{l}i\binom{m}{i}+\left(n-\sum_{i=1}^{l}\binom{m}{i}\right)\times(l+1) (2)
>1×m+m(r12)×2\displaystyle>1\times m+m(\frac{r-1}{2})\times 2 (3)
=mr.\displaystyle=mr. (4)

However, this contradicts with TrmT\leq rm. So we must have nmr+12\frac{n}{m}\leq\frac{r+1}{2}.

Now we show an explicit construction of a m×nm\times n ‘good’ group testing matrix with m=rm=r and n=r(r+1)2n=\frac{r(r+1)}{2}. This matrix has m=rm=r distinct columns with a single ‘1’ (part of this matrix making an r×rr\times r identity matrix), and (m2)=(r2)=r(r1)2\binom{m}{2}=\binom{r}{2}=\frac{r(r-1)}{2} distinct columns with two ‘1’s in each column. So in total there are r+2×r(r1)2=r+r(r1)=r2r+2\times\frac{r(r-1)}{2}=r+r(r-1)=r^{2} ‘1’s in this matrix. By the symmetry of this matrix’s construction, each row of this matrix has exactly r2×1r=rr^{2}\times\frac{1}{r}=r ‘1’s in each of its rows.

3 Examples for FDA approved Pooling Size r=4r=4 or r=5r=5

From the result in the last section, we have the following optimal design for r=4r=4:

A=[1000111000010010011000100101010001001011],A=\begin{bmatrix}1&0&0&0&1&1&1&0&0&0\\ 0&1&0&0&1&0&0&1&1&0\\ 0&0&1&0&0&1&0&1&0&1\\ 0&0&0&1&0&0&1&0&1&1\\ \end{bmatrix}, (5)

and the following optimal design for r=5r=5:

A=(100001111000000010001000111000001000100100110000100010010101000010001001011).A=\displaystyle\left(\begin{array}[]{rrrrrrrrrrrrrrr}1&0&0&0&0&1&1&1&1&0&0&0&0&0&0\\ 0&1&0&0&0&1&0&0&0&1&1&1&0&0&0\\ 0&0&1&0&0&0&1&0&0&1&0&0&1&1&0\\ 0&0&0&1&0&0&0&1&0&0&1&0&1&0&1\\ 0&0&0&0&1&0&0&0&1&0&0&1&0&1&1\end{array}\right).

4 Conclusions and Future Work

It would be interesting to extend this work to adaptive testing, and compressed sensing with dilution constraints [20].

References

  • [1] V. Lee, C. Chiew, and W. Khong. Interrupting transmission of COVID-19: lessons from containment efforts in Singapore. J Travel Med, 27(3), 2020.
  • [2] M. Salathé, C. Althaus, R. Neher, S. Stringhini, E. Hodcroft, J. Fellay, M. Zwahlen, G. Senti, M. Battegay, A. Wilder-Smith, I. Eckerle, M. Egger, and N. Low. COVID-19 epidemic in Switzerland: on the importance of testing, contact tracing and isolation. Swiss Medical Weekly, 150:w20225, 2020.
  • [3] M. Ranney, V. Griffeth, and A. Jha. Critical supply shortages — the need for ventilators and personal protective equipment during the Covid-19 pandemic. New England Journal of Medicine, 382(18):e41, April 2020.
  • [4] E. Emanuel, G. Persad, R. Upshur, B. Thome, M. Parker, A. Glickman, C. Zhang, C. Boyle, M. Smith, and J. Phillips. Fair allocation of scarce medical resources in the time of Covid-19. New England Journal of Medicine, 382(21):2049–2055, May 2020.
  • [5] Robert Dorfman. The detection of defective members of large populations. Ann. Math. Statist., 14(4):436–440, December 1943.
  • [6] N. Sinnott-Armstrong, D. Klein, and B. Hickey. Evaluation of group testing for SARS-CoV-2 RNA. medRxiv, page 2020.03.27.20043968, March 2020.
  • [7] H. Shani-Narkiss, O. Gilday, N. Yayon, and I. Landau. Efficient and practical sample pooling for high-throughput PCR diagnosis of COVID-19. medRxiv, page 2020.04.06.20052159, April 2020.
  • [8] C. Hogan, M. Sahoo, and B. Pinsky. Sample pooling as a strategy to detect community transmission of SARS-CoV-2. JAMA, 323(19):1967–1969, May 2020.
  • [9] I. Yelin, N. Aharony, E. Shaer-Tamar, A. Argoetti, E. Messer, D. Berenbaum, E. Shafran, A. Kuzli, N. Gandali, T. Hashimshony, Y. Mandel-Gutfreund, M. Halberthal, Y. Geffen, M. Szwarcwort-Cohen, and R. Kishony. Evaluation of COVID-19 RT-qPCR test in multi-sample pools. medRxiv, page 2020.03.26.20039438, March 2020.
  • [10] J. Yi, R. Mudumbai, and W. Xu. Low-cost and high-throughput testing of COVID-19 viruses and antibodies via compressed sensing: system concepts and computational experiments. arXiv:2004.05759 [cs, eess, math, q-bio], April 2020.
  • [11] S. Ghosh, R. Agarwal, M. Rehan, S. Pathak, P. Agrawal, Y. Gupta, S. Consul, N. Gupta, R. Goyal, A. Rajwade, and M. Gopalkrishnan. A compressed sensing approach to group-testing for COVID-19 detection. arXiv:2005.07895 [q-bio], May 2020. arXiv: 2005.07895.
  • [12] Office of The Commissioner. Coronavirus (COVID-19) update: facilitating diagnostic test availability for asymptomatic testing and sample pooling, June 2020. Library Catalog: www.fda.gov Publisher: FDA.
  • [13] Office of The Commissioner. Coronavirus (COVID-19) update: FDA authorizes first diagnostic Test for screening of people without known or suspected COVID-19 infection, July 2020. Library Catalog: www.fda.gov Publisher: FDA.
  • [14] Office of The Commissioner. Coronavirus (COVID-19) update: FDA issues first emergency authorization for sample pooling in diagnostic testing, July 2020. Library Catalog: www.fda.gov Publisher: FDA.
  • [15] Quest Digostics. FDA authorizes Quest Diagnostics COVID-19 diagnostic testing for specimen pooling for emergency use, Aug 2020. Library Catalog: newsroom.questdiagnostics.com.
  • [16] C. Hale. FDA opens door to ’batch testing’ for COVID-19 with new green lights for Quest Diagnostics, July 2020. Library Catalog: www.fiercebiotech.com.
  • [17] S. Taylor, J. Juliano, P. Trottman, J. Griffin, S. Landis, P. Kitsa, A. Tshefu, and S. Meshnick. High-throughput pooling and real-time PCR-based strategy for malaria detection. J. Clin. Microbiol., 48(2):512–519, February 2010.
  • [18] N. Taylor. LabCorp COVID-19 test gets 1st FDA OK for asymptomatic screening, July 2020. Library Catalog: www.medtechdive.com.
  • [19] Quest Diagostics. SARS-CoV-2 RNA, qualitative Real-Time RT-PCR, 2020.
  • [20] W. Xu, E. Mallada, and A. Tang. Compressive sensing over graphs. In 2011 Proceedings IEEE INFOCOM, pages 2087–2095, 2011.