Bare Demo of IEEEtran.cls
for IEEE Conferences

Michael Shell School of Electrical and
Computer Engineering
Georgia Institute of Technology
Atlanta, Georgia 30332–0250
Email: http://www.michaelshell.org/contact.html Homer Simpson Twentieth Century Fox
Springfield, USA
Email: homer@thesimpsons.com James Kirk
and Montgomery Scott Starfleet Academy
San Francisco, California 96678–2391
Telephone: (800) 555–1212
Fax: (888) 555–1212

Abstract

Compared to a standard CMOS Image Sensor (CIS), Quanta Image Sensor (QIS) has significantly higher photon sensitivity and speed. This offers new imaging capabilities beyond a conventional CIS, especially in low light conditions. If similar light conditions are considered, we show that QIS has significantly higher dynamic range than CIS. In addition, we present an algorithm to construct high dynamic range images from multiple low dynamic range (LDR) images.

I Introduction

Quanta Image Sensor (QIS) is envisioned as a candidate solution to to resolve the limited full well capacity issue resulting from shrinking pixel size. Since its introduction in 2005 [fossum200611] a significant number of research have demonstrated the feasibility of QIS and have proposed a few protoypes. In [Ma:17] Ma et al. demonstrated a QIS prototype with read noise below $0.25e^{-}$ rms at room temperature and frame rate beyond 1000 frames per second. This high level of photon sensitivity, low read noise, and high speed have made QIS an ideal sensor for low-light imaging applications.

The subject of this paper is to consider the high dynamic range imaging problem of QIS. One of the major shortcomings of a digital camera is the lack of dynamic range. Like traditional CMOS image sensors (CIS), the limited full well capacity of QIS upper bounds the dynamic range it can support. However, owing to its high speed, QIS is able to leverage photons acquired by the multiple frames to extend the dynamic range.

In this paper, we present a high-dynamic range image reconstruction theory and algorithm for QIS. We offer two contributions. First, we theoretically derive the dynamic range which can be offered by QIS compared to CIS. This provides a foundation of how much dynamic range we can expect from QIS. Second, we propose a novel high-dynamic range reconstruction algorithm for QIS. Our reconstruction algorithm is customized for QIS as it handles the truncated Poisson statistics uniquely present in QIS. We compare the performance of our algorithm with other state-of-the-art high-dynamic range reconstruction methods.

\includegraphics

[scale = 0.45]img2.png

Figure 1: HDR Reconstruction pipeline. The raw frames from QIS are first summed and denoised. Then the denoised images are linearly combined by giving weights to each image proportional to their

\text{SNR}_{H}

iteratively. (See Section LABEL:sec:HDR_rec)

\includegraphics

[height = 6cm]hdr_plot.png

Figure 2: Comparison of Exposure referred signal-to-noise (

\text{SNR}_{H}

) for CIS and QIS. (QIS1b: Single-bit QIS, QIS2b: Two-bit QIS). Ideal QIS and CIS are considered, i.e. the read noise is assumed to be zero in both cases. CIS is assumed to have a full well capacity of 4000 electrons. For single-bit QIS, binning of 1000 frames with

2\times 2

spatial oversampling is assumed. For 2-bit QIS,

T=333

frames with

2\times 2

spatial oversampling is assumed. QIS has a larger dynamic range than CIS for each exposure, and has a more consistent SNR over the entire range when the low dynamic range images are combined to get a single high-dynamic range image.

II Dynamic Range - QIS vs. CIS

The dynamic range is defined as the ratio of the illumination that just saturates the sensor to the read noise. As we do not consider any read noise in the sensors, the quantity dynamic range does not exist. So, instead we study dynamic range by analyzing the SNR for the sensors. In [fossum2013modeling], Fossum states that the exposure referred signal-to-noise ratio $\text{SNR}_{H}$ is a better metric because of non-linearity in the QIS response, than the regular voltage based SNR. So, in this section, we first derive the expression for $\text{SNR}_{H}$ , assuming an imaging model similar to [Chan16] and use it to characterize the dynamic range of the CIS and QIS.

Let $c$ denote the discretized illumination (photons per second) of a particular pixel we are interested in, in the scene we are capturing. Let $\tau$ denote the integration time. Then the number of photons $X$ reaching the sensor in a particular frame is modelled as a Poisson random variable $X\sim\text{Poisson}(\theta),$ where $\theta=\tau c$ . Let $B$ denote the output of the sensor of a sensor which can count up to $L$ photons. Then $B=X$ , if $X<L$ , and $B=L$ , if $X\geq L$ .

$\text{SNR}_{H}$ is the ratio of the exposure signal to the exposure referred noise. The exposure referred noise is defined as $\sigma_{H}={\sigma_{B}\frac{\partial\theta}{\partial\mu_{B}}}$ , where $\sigma_{B}$ is the standard deviation of $B$ and $\mu_{B}$ is the mean and $\theta=\tau c$ . So, for sum of T frames taken at a particular exposure, the exposure referred signal-to-noise ratio is

\text{SNR}_{H}=\frac{\theta\frac{\partial\mu_{B}}{\partial\theta}}{\sigma_{B}},

(1)

For sum of T frames taken at a particular exposure, the exposure signal is $T\theta$ and the exposure referred noise is $\sqrt{T}\sigma_{H}$ . So,

\text{SNR}_{H}=\frac{\sqrt{T}\theta\frac{\partial\mu_{B}}{\partial\theta}}{\sigma_{B}},

(2)

The expressions for $\mu_{B}$ and $\sigma_{B}$ , the mean and standard deviation of the output of the sensor $B$ is derived in theorem II and then we find the expression for $\frac{\partial\mu_{B}}{\partial\theta}$ .

{theorem}

Let $B$ (the output of the sensor) be a thresholded Poisson random variable defined as $B=X$ , if $X<L$ , and $B=L$ , if $X\geq L$ where $X\sim\text{Poisson}(\theta)$ , for some fixed integer $L>0$ . Then, the mean $\mu_{B}$ and the variance $\sigma_{B}^{2}$ of the random variable $B$ are

{align}

μ_B &= θ(ψ_L-1(θ)) + L(1 - ψ_L(θ))
σ_B^2 = L^2 - ∑_n=0^L-1 ((2n+1) ψ_n+1(θ)) - μ_B^2 where $\psi_{q}(s)=\sum\limits_{k=0}^{q-1}\frac{s^{k}e^{-s}}{k!}$ the incomplete gamma function.

{remark}

From Theorem II, we can obtain the expression for $\frac{\partial\mu_{B}}{\partial\theta}$ as

\frac{\partial\mu_{B}}{\partial\theta}=\cases{\psi}_{L-1}(\theta)-\theta\frac{e^{-\theta}\theta^{L-2}}{(L-2)!}+L\frac{e^{-\theta}\theta^{L-1}}{(L-1)!},L\geq 2\\ e^{-\theta},L=1

(3)

Now that we have the expression for $SNR_{H}$ , we conduct two experiments to study and compare the dynamic range of the CIS and QIS.

1. The exposure referred signal-to-noise ratio ( $\text{SNR}_{H}$ ) is compared for QIS and CIS in \freffig:SNR_CISvsQIS. The details on how the experiment was run can be found in the figure. Two main observations are made. First, notice that the Dynamic range of one single exposure for QIS in both modes are much larger than the dynamic range of the CIS. Also, notice that single bit QIS has a better range than the 2 bit QIS. This implies that QIS with lesser number of bits is more advantageous in terms of dynamic range. Second, thanks to the extended overexposure latitude, the combined $\text{SNR}_{H}$ of the QIS with multiple exposures is stable over a wide range of illumination, without the dipping effects that is observed in CIS. This means, we can achieve a consistent SNR over all parts of the HDR image in a QIS.

The two observations made here make it obvious that QIS has a clear advantage over CIS in HDR imaging. Not only does QIS have better dynamic range, it also has more consistent SNR over the range of illumination in which it is operating, compared to CIS which has sudden dips in the SNR.

\includegraphics[width=0.5]Cameraman256.png	\includegraphics[width=0.5]cbd.png
Ground Truth	CIS
	PSNR = 30.10dB
\includegraphics[width=0.5]1bd.png	\includegraphics[width=0.5]3bd.png
QIS (1-bit)	QIS (3-bit)
PSNR=39.01 dB	PSNR=39.82dB

Figure 3: Dynamic range of QIS and CIS. The image is setup in such a way that the maximum illumination of the image is

6\times 10^{6}

photons per pixel per second. CIS can count upto 4000 electrons, single bit QIS - 1 electon and 3 bit QIS - 7 electrons. The exposure times are: CIS - 1ms, single bit QIS -

0.25\mu s

, and 3 bit QIS -

1.75\mu s

. We use 1 CIS frame, 4000 frames for single bit QIS and 571 frames for 3 bit QIS. The red circles represent the regions where the loss of dynamic range in the CIS image is apparent.

2. In this experiment (\freffig:DR_cpmpare), we show using a synthetic imaging experiment that the QIS has a better dynamic range than the CIS. We again compare 3 different modes - CIS, one bit QIS and three bit QIS. There is no post processing of the data other than just tone mapping for the QIS. Notice that the PSNR for the QIS modes are much higher than the CIS. The figure also points towards regions where the smaller dynamic range of the CIS causes the loss of information in certain bright regions, which the QIS preserves.

While the first experiment showed that QIS is better at HDR imaging theoretically, the second experiment where we simulated both the QIS and CIS imaging, showed visually that QIS has a better dynamic range than the CIS.

References

[1] H. Kopka and P. W. Daly, A Guide to LaTeX, 3rd ed. Harlow, England: Addison-Wesley, 1999.

Bare Demo of IEEEtran.cls for IEEE Conferences

Abstract

I Introduction

II Dynamic Range - QIS vs. CIS

References

Bare Demo of IEEEtran.cls
for IEEE Conferences