This paper was converted on www.awesomepapers.org from LaTeX by an anonymous user.
Want to know more? Visit the Converter page.

Near-field spectroscopy of silicon dioxide thin films

L. M. Zhang Department of Physics, Boston University, 590 Commonwealth Avenue, Boston, Massachusetts, 02215    G. O. Andreev    Z. Fei    A. S. McLeod Department of Physics, University of California San Diego, 9500 Gilman Drive, La Jolla, California 92093    G. Dominguez    M. Thiemens Department of Chemistry, University of California San Diego, 9500 Gilman Drive, La Jolla, California 92093    A. H. Castro Neto Graphene Research Centre and Department of Physics, National University of Singapore, 2 Science Drive 3, 117542, Singapore    D. N. Basov    M. M. Fogler Department of Physics, University of California San Diego, 9500 Gilman Drive, La Jolla, California 92093
(August 15, 2025)
Abstract

We analyze the results of scanning near-field infrared spectroscopy performed on thin films of a-SiO2 on Si substrate. The measured near-field signal exhibits surface-phonon resonances whose strength has a strong thickness dependence in the range from 22 to 300nm300\,\text{nm}. These observations are compared with calculations in which the tip of the near-field infrared spectrometer is modeled either as a point dipole or an elongated spheroid. The latter model accounts for the antenna effect of the tip and gives a better agreement with the experiment. Possible applications of the near-field technique for depth profiling of layered nanostructures are discussed.

pacs:
68.37.Uv, 63.22.-mP

I Introduction

Scattering scanning near-field optical microscopy (s-SNOM) Keilmann and Hillenbrand (2004); Novotny and Hecht (2006); Keilmann and Hillenbrand (2009) is a powerful tool for probing local electromagnetic response of diverse materials. The s-SNOM achieves spatial resolution of 101020nm20\,\text{nm}, which is especially valuable in the physically interesting infrared region Basov et al. (2011); Basov and Chubukov (2011) where the resolution of conventional spectroscopy is fundamentally limited by a rather large wavelength λ5\lambda\sim 5500μm500\,\mu\text{m}. The s-SNOM techniques have been rapidly advancing, Amarie and Keilmann (2011); Huth et al. (2011) which enabled their applications to imaging spectroscopy of complex oxides Qazilbash et al. (2007); Zhan et al. (2007); Qazilbash et al. (2009); Frenzel et al. (2009); Lai et al. (2010); Qazilbash et al. (2011) and graphene. Fei et al. (ress)

Refer to caption
Figure 1: (Color online) Schematics of an s-SNOM experiment. A scanned probe, modeled as a metallic spheroid with length 2L2L and the apex curvature radius aa, is positioned distance ztipz_{\text{tip}} above the sample. The sample contains a film of thickness d1d_{1} and dielectric function ϵ1\epsilon_{1}, which is deposited on a bulk substrate with dielectric function ϵ2\epsilon_{2}. The system is illuminated by infrared field 𝑬ext\boldsymbol{E}_{\text{ext}} at an angle of incidence θ\theta. Scattering of this radiation by the tip creates evanescent waves with large in-plane momenta q1/aq\sim 1/a. The experiment measures the total radiating dipole pp of tip, which is determined by multiple reflections of the evanescent waves between the tip and sample. The reflections off the sample are characterized by the coefficient rP(q,ω)r_{{}_{\text{P}}}(q,\omega).

The s-SNOM utilizes scattering of incident light by the tip of an atomic force microscope (AFM) positioned next to the probed sample (Fig. 1). The tip couples to the sample via evanescent waves of large in-plane momenta q1/aq\sim 1/a, where aa is the tip radius of curvature (a few tens of nm). This is why the lateral resolution of the s-SNOM is determined primarily by aa rather than λ\lambdaLai et al. (2007); Huber et al. (2008); Olmon et al. (2008)

One of the interesting open questions is the depth (zz-coordinate) resolution of the s-SNOM probes. Previous experiments suggested that it is comparable to the lateral resolution a\sim a, based on imaging of small sub-surface particles. Taubner et al. (2005) Surprisingly, our recently near-field measurements of SiO2 thin films have demonstrated that films as thick as several hundred nm have a response clearly different from that of the bulk material. Andreev et al. Thus, if instead of particles one has layers, then the s-SNOM is able to detect them at much larger depths.

In this paper these experimental results are re-analyzed and compared with two theoretical models, the conventional point-dipole approximation Keilmann and Hillenbrand (2004); Hillenbrand et al. (2002) and the spheroidal model. The former is very simple to implement but is also very crude. Predictably, it yields a bulk-like response of the s-SNOM signal as soon as the SiO2 film thickness exceeds the tip radius, in disagreement with the experiment. A plausible reason for shortcomings of the point-dipole model is its failure to account for the strongly elongated shape of the tip. Such a tip acts as an optical antenna Keilmann and Hillenbrand (2004); Novotny and Hecht (2006); Keilmann and Hillenbrand (2009) that greatly enhances the electric field inside the tip-sample nanogap. Unfortunately, analytical models, Cvitkovic et al. (2007); Moon et al. (2011) that attempt to treat elongated tips do not apply to layered substrates. This compels us to study the problem numerically.

Refer to caption
Refer to caption
Refer to caption
Figure 2: (Color online) (a) Main panel: Measured infrared near-field spectra for several SiO2 film thicknesses. The quantity plotted is the absolute value s3s_{3} of the third harmonic of the scattering amplitude normalized by that for the Si wafer. (b) Theoretical results for the spheroid model with a=30nma=30\,\text{nm} and L=15aL=15a. (c) Theoretical results for the point-dipole model with a=30nma=30\,\text{nm} and b=0.75ab=0.75a.

To make the calculations tractable, we follow examples in the literature Porto et al. (2003); Renger et al. (2005); Esteban et al. (2009) and model the tip as a metallic spheroid of total length 2La2L\gg a, see Fig. 1. As shown below, this gives results in a much better agreement with the experiment in terms of both the frequency and the thickness dependence of the near-field signal. We attribute the origin of the more gradual film-thickness dependence in the spheroidal model to the aforementioned “antenna effect.” The magnitude of this effect is determined by the material response over length scales ranging from aa to 2L2L, and so it truly saturates only when the film thickness becomes much larger than 2L2L.

The remainder of the paper is organized as follows. In Sec. II we summarize the experimental procedures and results. In Secs. III and IV we discuss the two theoretical models and compare their predictions with the measurements. Concluding remarks are given in Sec. V.

II Experiment

To make the paper self-contained we summarize the results of our recent experiments Andreev et al. in this Section. We investigated commercially available calibration gratings, which contain strips or islands of SiO2 thermally grown on Si. The manufacturer specified thicknesses of the SiO2 layer spanned the range d1=2d_{1}=2, 1818, 2222, 108108, and 300nm300\,\text{nm}. A combination of CO2 and tunable quantum cascade lasers (Daylight Solutions) allowed us to cover the frequency range between 890 cm1\text{cm}^{-1} and 1250 cm1\text{cm}^{-1}. The near-field data were collected using a Neaspec system.

The measured s-SNOM signal represents the electromagnetic field backscattered by the probe and the scanned sample. The complex amplitude s(ω,t)s(\omega,t) of the backscattered field varies periodically with the tapping frequency Ω40kHz\Omega\sim 40\,\text{kHz} as the distance ztipz_{\text{tip}} between the sample and the nearest point of the tip undergoes harmonic oscillations

ztip(t)=z0+Δz(1cosΩt),z_{\text{tip}}(t)=z_{0}+\Delta z\,(1-\cos\Omega t)\,, (1)

where Δz=50nm\Delta z=50\,\text{nm} typically. In order to suppress unwanted background and isolate the part of the signal scattered by the probe tip, the signal is demodulated. Namely, we extracted the absolute values sn(ω)s_{n}(\omega) and phases ϕn(ω)\phi_{n}(\omega) at tapping harmonics

sneiϕn=0TdtTeinΩts(ω,t),T=2πΩ.s_{n}e^{i\phi_{n}}=\int\limits_{0}^{T}\frac{dt}{T}\,e^{in\Omega t}\,s(\omega,t)\,,\quad T=\frac{2\pi}{\Omega}\,. (2)

The experimental results for the spectra are shown in Fig. 2(a). These spectra were intended to be taken in the tapping mode, i.e., for zero z0z_{0}. However, experimentally z0z_{0} can be determined only up to an additive constant 1nm\sim 1\,\text{nm}. Therefore, we measured z0z_{0}-dependence of s3s_{3} (the approach curves) shown in Fig. 3(a) and selected the largest observed s3s_{3}. Our results are in a qualitative agreement with previous experimental study, Taubner et al. (2005) which reported approach curves for SiO2 at a few discrete frequencies and film thicknesses.

Refer to caption
Refer to caption
Refer to caption
Figure 3: (Color online) Approach curves. (a) Experimental data for the 105-nm thick SiO2. Theoretical results for the spheroid (b) and the point-dipole models (c) using the same parameters as in Fig. 2.

The data points in Fig. 2(a) represent the normalized amplitude s3(SiO2)/s3(Si)s_{3}(\text{SiO}_{2})/s_{3}(\text{Si}), where s3(SiO2)s_{3}(\text{SiO}_{2}) and s3(Si)s_{3}(\text{Si}) are the raw third-order demodulation signals averaged over the entire SiO2 and Si areas, respectively. The statistical uncertainty of these averaged data traces is about 2%2\%.

For each thickness studied, the normalized amplitude s3(SiO2)/s3(Si)s_{3}(\text{SiO}_{2})/s_{3}(\text{Si}) exhibits several maxima. The main peak is situated at ω1130cm1\omega\approx 1130\,\text{cm}^{-1}. The key aspect of the data is a rapid decrease in the normalized amplitude of this peak as the thickness is reduced. A trace of this resonance can be reliably identified even for the 22-nm thick SiO2 film. Another notable feature is the growing strength and frequency shift of the secondary peaks on the high-ω\omega side of the main peak as d1d_{1} is decreased.

Since the response of Si is frequency independent in our experimental range, the frequency dependence of the spectra in Fig. 2(a) originates from that of SiO2. We attribute the maxima of s3(SiO2)/s3(Si)s_{3}(\text{SiO}_{2})/s_{3}(\text{Si}) to the phonon modes localized at the air-SiO2 interface. Amarie and Keilmann (2011) These resonances occur in the frequency region between the bulk transverse and longitudinal modes of SiO2 (the outer dashed lines in Fig. 4 below).

The results of our theoretical calculations for the normalized scattering amplitude are presented in the remaining panels of Figs. 2 and 3. They are discussed and compared with the experimental findings in the following Sections.

III Response functions and collective modes

The sample is modeled as a two-layer system. The first layer with dielectric function ϵ1(ω)\epsilon_{1}(\omega) occupies the slab d1<z<0-d_{1}<z<0. The second layer with dielectric function ϵ2(ω)\epsilon_{2}(\omega) occupies the half-space z<d1z<-d_{1}. The half-space z>0z>0 (“layer 0”) is filled with air (dielectric constant ϵ0=1\epsilon_{0}=1). The fundamental response functions of the system are the reflection coefficients rX(q,ω)r_{X}(q,\omega), which are functions of in-plane momentum qq, frequency ω\omega, and polarization X=SX=S or PP. The domain of definition of rX(q,ω)r_{X}(q,\omega) is understood to include nonradiative modes q>ϵ0ω/cq>\sqrt{\epsilon_{0}}\,\omega/c. It is known from previous studies that the s-SNOM signal is dominated by the PP-polarized waves. In our two-layer model their reflection coefficient is given by a Fresnel-like formula

rP(q,ω)\displaystyle r_{{}_{\text{P}}}(q,\omega) =ϵk0zϵ0k1zϵk0z+ϵ0k1z,\displaystyle=\frac{\epsilon_{*}k^{z}_{0}-\epsilon_{0}k^{z}_{1}}{\epsilon_{*}k^{z}_{0}+\epsilon_{0}k^{z}_{1}}\,, (3)
ϵ(q,ω)\displaystyle\epsilon_{*}(q,\omega) =ϵ1ϵ2k1zϵ1k1ztanhik1zd1ϵ1k2zϵ2k1ztanhik1zd1,\displaystyle=\epsilon_{1}\,\frac{\epsilon_{2}k^{z}_{1}-\epsilon_{1}k^{z}_{1}\tanh ik^{z}_{1}d_{1}}{\epsilon_{1}k^{z}_{2}-\epsilon_{2}k^{z}_{1}\tanh ik^{z}_{1}d_{1}}\,, (4)

where zz-axis momenta kjzk^{z}_{j} are defined by

kjz=ϵjω2c2q2,Imkjz0.k^{z}_{j}=\sqrt{\epsilon_{j}\,\frac{\omega^{2}}{c^{2}}-q^{2}}\,,\quad\text{Im}\,k^{z}_{j}\geq 0\,. (5)

Equation (3) is valid for arbitrary qq. In the near-field case where qq is large and kjziqk^{z}_{j}\simeq iq, it simplifies to

rP(q,ω)ϵϵ0ϵ+ϵ0,ϵϵ1ϵ2+ϵ1tanhqd1ϵ1+ϵ2tanhqd1.r_{{}_{\text{P}}}(q,\omega)\simeq\frac{\epsilon_{*}-\epsilon_{0}}{\epsilon_{*}+\epsilon_{0}}\,,\quad\epsilon_{*}\simeq\epsilon_{1}\frac{\epsilon_{2}+\epsilon_{1}\tanh qd_{1}}{\epsilon_{1}+\epsilon_{2}\tanh qd_{1}}\,. (6)

Assuming all ϵj\epsilon_{j} are qq-independent, the effective dielectric function ϵ(q,ω)\epsilon_{*}(q,\omega) depends on qq only via the product qd1qd_{1} in this limit. Therefore, rP(q,ω)r_{{}_{\text{P}}}(q,\omega) for one thickness d1d_{1} can be obtained from another by rescaling qq. As discussed in Sec. I and shown in more detail below, the most important momenta are q1/aq\sim 1/a where a30nma\sim 30\,\text{nm} is the tip radius. Therefore, we can get an approximate understanding of the system response by examining the behavior of rP(q,ω)r_{{}_{\text{P}}}(q,\omega) as a function of ω\omega at fixed qd1d1/aqd_{1}\sim d_{1}/a. This behavior is dictated by the spectrum of surface collective modes, as follows.

In general, surface modes correspond to poles of the response functions rXr_{X}. Function rPr_{{}_{\text{P}}} given by Eq. (6) can have up to two poles at each qd1qd_{1}, see, e.g., Ref. Prade et al., 1991. They are defined by the following condition on ϵ1(ω)\epsilon_{1}(\omega):

ϵ1(ω)=ϵ0+ϵ22tanhqd1±(ϵ0+ϵ2)24tanh2qd1ϵ0ϵ2.\epsilon_{1}(\omega)=-\frac{\epsilon_{0}+\epsilon_{2}}{2\tanh qd_{1}}\pm\sqrt{\frac{(\epsilon_{0}+\epsilon_{2})^{2}}{4\tanh^{2}qd_{1}}-\epsilon_{0}\epsilon_{2}}\,. (7)

At large qd1qd_{1}, where tanhqd1=1\tanh qd_{1}=1, this condition yields ϵ1(ω)=ϵ0\epsilon_{1}(\omega)=-\epsilon_{0} or ϵ1(ω)=ϵ2\epsilon_{1}(\omega)=-\epsilon_{2}, which correspond to modes localized at the upper 011 and the lower 1122 interfaces, respectively. Actually, the latter “pole” has vanishingly small residue because evanescent waves do not reach the lower interface at qd1=qd_{1}=\infty. There is no qq-dispersion and no coupling of the two modes in this limit. The dispersion appears at finite qd1qd_{1}, where the two modes become mixed. In particular, we find

ϵ1(ω)\displaystyle\epsilon_{1}(\omega) qd1ϵ01+ϵ21,\displaystyle\simeq-\frac{qd_{1}}{\epsilon_{0}^{-1}+\epsilon_{2}^{-1}}\,, “0–1” (8a)
ϵ0+ϵ2qd1\displaystyle\simeq-\frac{\epsilon_{0}+\epsilon_{2}}{qd_{1}} “1–2” (8b)

at qd11qd_{1}\ll 1. At finite qq, both interfaces participate in generating these excitations. The labels “0–1” and “1–2” are for convenience: they indicate at which interface a given dispersion branch is ultimately localized as qq increases. At qd1=0qd_{1}=0, the “0–1” and “1–2” branches are characterized by ϵ1(ω)=0\epsilon_{1}(\omega)=0 and ϵ1(ω)=\epsilon_{1}(\omega)=-\infty, which correspond, respectively, to the bulk longitudinal and transverse phonon frequencies ωLO\omega_{{}_{\text{LO}}} and ωTO\omega_{{}_{\text{TO}}}.

If we try to apply this formalism to real materials, we face the problem that Eq. (7) has no solutions for real ω\omega because the dielectric functions have finite imaginary parts. This is why in practice the collective mode spectra are usually defined differently. They are identified with the maxima of dissipation, i.e., ImrP\text{Im}\,r_{{}_{\text{P}}}. The number of these maxima can be fewer than the total allowed number of the modes because some of them can be overdamped. Similarly, we define ωLO\omega_{{}_{\text{LO}}} and ωTO\omega_{{}_{\text{TO}}} as the frequencies that correspond to the maxima of Imϵ11(ω)-\text{Im}\,\epsilon_{1}^{-1}(\omega) and Imϵ1(ω)\text{Im}\,\epsilon_{1}(\omega).

To see what kind of spectra are realized in our system, we use our ellipsometry data for ϵ1(ω)\epsilon_{1}(\omega) [Fig. 4(a)] and Eq. (6) to compute rPr_{{}_{\text{P}}} for several values of qd1qd_{1}. The plot of these quantities as a function of ω\omega is presented in Fig. 4(c). Three maxima on each curve in the region of primary interest ω>1000cm1\omega>1000\,\text{cm}^{-1} are apparent. They exist already at qd1=qd_{1}=\infty, and so all of them belong to the upper (air-SiO2) interface. In fact, we do not expect sharp modes at the lower (SiO2-Si) interface because the dielectric function of Si is quite large ϵ211.7\epsilon_{2}\approx 11.7 in the studied range of ω\omega. The lowest value of Reϵ15.0\text{Re}\,\epsilon_{1}\approx-5.0 is not sufficient to compensate ϵ2\epsilon_{2} and generate “1-2” modes, cf. Eq. (7).

Refer to caption
Figure 4: (Color online) (a) Dielectric function of bulk SiO2 as a function of frequency from ellipsometry. (b) The real and (c) the imaginary parts of the near-field reflection coefficient, rP(q,ω)r_{{}_{\text{P}}}(q,\omega) for several qd1qd_{1}. In all the panels three dashed lines indicate the transverse optical phonon frequency ωTO1074cm1\omega_{{}_{\text{TO}}}\approx 1074\,\text{cm}^{-1}, the surface optical phonon frequency ωSP1164cm1\omega_{{}_{\text{SP}}}\approx 1164\,\text{cm}^{-1}, and the longitudinal optical phonon frequency ωLO1263cm1\omega_{{}_{\text{LO}}}\approx 1263\,\text{cm}^{-1}.

The main peak of ImrP\text{Im}\,r_{{}_{\text{P}}} at qd1=qd_{1}=\infty defines the surface phonon frequency of SiO2 ωSP1164cm1\omega_{{}_{\text{SP}}}\approx 1164\,\text{cm}^{-1}. There also exist secondary peaks at ω1100cm1\omega\approx 1100\,\text{cm}^{-1} and ω1220cm1\omega\approx 1220\,\text{cm}^{-1}. Their evolution as a function of qd1qd_{1} comply with the general scheme outlined above. As qd1qd_{1} decreases, all the three peaks loose strength, as expected, because the amount of SiO2 diminishes. The lower-ω\omega secondary peak redshifts, moving towards ωTO\omega_{{}_{\text{TO}}}, and then quickly disappears. This agrees with the SiO2-Si resonance being highly damped. The higher-ω\omega secondary peak becomes dominant at qd1<0.5qd_{1}<0.5 and demonstrates a systematic shift towards ωLO\omega_{{}_{\text{LO}}}, see Fig. 4(c).

A notable feature of Fig. 4(b) is the clustering of the crossing points of the different curves near ω=1036cm1\omega=1036\,\text{cm}^{-1}. This is the frequency where the dielectric function of SiO2 is the closest to that of Si, ϵ211.7\epsilon_{2}\approx 11.7. As a result, the two layers act almost as one bulk material, so that rP(ω)r_{{}_{\text{P}}}(\omega) is approximately thickness-independent.

There is a qualitative correspondence between the features displayed by the reflection coefficient rPr_{{}_{\text{P}}} and the observed near-field signal s3(SiO2)/s3(Si)s_{3}(\text{SiO}_{2})/s_{3}(\text{Si}), cf. Figs. 2(a) and 4(b),(c). However, the relation between rP(q,ω)r_{{}_{\text{P}}}(q,\omega) and the measured s-SNOM signal is nontrivial. For example, the frequency positions of the maxima in ImrP(q,ω)\text{Im}\,r_{{}_{\text{P}}}(q,\omega) and those in s3(SiO2)/s3(Si)s_{3}(\text{SiO}_{2})/s_{3}(\text{Si}) differ by as much as 40cm140\,\text{cm}^{-1}. We also suspect that there may be some slight differences between the optical constants of thick films we assume in our calculations and those of the small SiO2 structures we probe by the s-SNOM. This is the likely reason why the crossing point of the experimental curves occurs near 1060cm11060\,\text{cm}^{-1} rather than 1036cm11036\,\text{cm}^{-1} predicted by both our models, cf. Fig. 2.

Developing a reliable procedure for inferring rP(q,ω)r_{{}_{\text{P}}}(q,\omega) from s3s_{3} remains a challenge for the theory. The next section presents our current approach towards this ultimate goal.

IV Tip-sample interaction

Both radiative and nonradiative waves may play significant roles in the s-SNOM experiment. Porto et al. (2003) The radiative modes magnify the signal by a certain far-field factor (FFF) F(qs,ω)F(q_{s},\omega), where qs=(ω/c)sinθq_{s}=(\omega/c)\sin\theta is the momentum of these modes for the angle of incidence θ\theta. The nonradiative modes influence the effective polarizability χ(ω,ztip)\chi(\omega,z_{\text{tip}}) of the tip, i.e., the ratio of its dipole moment pzp^{z} and the external electric field EextzE_{\text{ext}}^{z}. Altogether the demodulated s-SNOM signal sneiϕns_{n}e^{i\phi_{n}} can be written as

sneiϕn\displaystyle s_{n}e^{i\phi_{n}} χnEextsin2θF(qs,ω),\displaystyle\propto\chi_{n}E_{{}_{{}_{\text{ext}}}}\sin 2\theta\,F(q_{s},\omega)\,, (9)
χn(ω)\displaystyle{\chi}_{n}(\omega) =0TdtTeinΩtχ(ω,ztip(t)).\displaystyle=\int\limits_{0}^{T}\frac{dt}{T}\,e^{in\Omega t}\,{\chi}\bigr{(}\omega,z_{\text{tip}}(t)\bigl{)}\,. (10)

Below we discuss the FFF and the tip polarizability separately.

IV.1 Far-field factor

The FFF for an infinite layered system is given by Sukhov (2004); Aizpurua et al. (2008)

F(qs,ω)=[1+rP(qs,ω)]2.F(q_{s},\omega)=[1+r_{{}_{\text{P}}}(q_{s},\omega)]^{2}\,. (11)

As shown in Figs. 5(a), for d1=300nmd_{1}=300\,\text{nm} SiO2 film, the absolute value of the FFF has a maximum near ω TO1074cm1\omega_{\text{\,TO}}\approx 1074\,\text{cm}^{-1} and a suppression near ω LO1272cm1\omega_{\text{\,LO}}\approx 1272\,\text{cm}^{-1}. For thinner films, these features are less pronounced. The main maximum of s3s_{3}, which is the main focus of our analysis, is away from both ω TO\omega_{\text{\,TO}} and ω LO\omega_{\text{\,LO}}. It is essentially unaffected by the FFF. Still, if FFF were to be included in the calculation in the form prescribed by Eq. (11), it would produce a visible hump of s3(ω)s_{3}(\omega) near ω TO\omega_{\text{\,TO}} and a dip near ω LO\omega_{\text{\,LO}}. These features are not present in the experimental data, Fig. 2(a). A better agreement with the experiment is obtained if F(qs,ω)F(q_{s},\omega) is set to a constant, which is what we do here. We rationalize this decision by noting that the SiO2 layer in the actual samples does not extend over the entire xxyy plane but occupies only small sub-wavelength regions. Therefore, the FFF is dominated by the ω\omega-independent response of Si.

Refer to caption
Figure 5: (Color online) (a) The absolute value and (b) the phase of the far-field factor computed as a function of frequency for the incidence angle θ=45\theta=45^{\circ}. The black trace is for bulk Si substrate, the blue one is for bulk SiO2 substrate, the green one is for 300nm300\,\text{nm} thick SiO2 followed by bulk Si. The meaning of the dashed lines is the same as in Fig. 4.

IV.2 Point-dipole model of the tip

The effective tip polarizability χ(ω,ztip)\chi(\omega,z_{\text{tip}}) is the most important factor on the right-hand side of Eq. (9) and it is also the most difficult one to compute. This quantity is dictated by the near-field coupling between the tip and the sample. For irregular tip shapes it can be calculated only numerically. However, previous s-SNOM studies demonstrated that acceptable results can often be obtained if the tip is approximated by a spheroid, Porto et al. (2003); Renger et al. (2005); Esteban et al. (2009) a small sphere, Rendell and Scalapino (1981); Aravind and Metiu (1982, 1983); Ruppin (1992); Sukhov (2004); Renger et al. (2005) a “finite” dipole, Cvitkovic et al. (2007); Amarie and Keilmann (2011) or a point dipole. Hillenbrand and Keilmann (2000); Taubner et al. (2004); Aizpurua et al. (2008) The actual tip shape in our experiment is close to a rounded pyramid.

The point-dipole approximation is the simplest one and it has been used extensively for modeling s-SNOM experiments, including those performed on multilayer systems. Aizpurua et al. (2008); Fei et al. (ress) The point-dipole model has two adjustable parameters: the polarizability a3a^{3} of the effective dipole and its position bb with respect to the bottom of the tip. The results obtained following the standard analysis Aizpurua et al. (2008); Fei et al. (ress) are shown in Fig. 2(c) using a=30nma=30\,\text{nm} and b=0.75ab=0.75a. We see that even for this rather large aa the point-dipole model does not reproduce the observed strong dependence of s3s_{3} on thickness at d1>22nmd_{1}>22\,\text{nm}.

The discrepancy can be seen more clearly in Fig. 6, where the height of the peak in s3(SiO2)/s3(Si)s_{3}(\text{SiO}_{2})/s_{3}(\text{Si}) corresponding to the surface phonon is plotted as a function of d1d_{1}. For the point dipole model the curve flattens at d1bd_{1}\sim b. In contrast, the experimentally observed s3(SiO2)/s3(Si)s_{3}(\text{SiO}_{2})/s_{3}(\text{Si}) maximum continues to rise with d1d_{1}. The point-dipole model also predicts a very steep approach curve, Fig. 3(c), in poor agreement with the measurements.

The physical origin of the saturation of the thickness dependence in Fig. 2(c) is easy to understand. One can think about the near-field coupling between the point dipole and the sample in terms of the method of images. For a dipole positioned at zpd=ztip+bz_{\text{pd}}=z_{\text{tip}}+b, the image is concentrated at the depth zpdz_{\text{pd}} below the surface. Therefore, films of thickness larger than zpdz_{\text{pd}} would act as a bulk material. Another way to arrive at the same conclusion is to notice that the characteristic range of momenta of the relevant nonradiative waves is q1/zpdq\lesssim 1/z_{\text{pd}}. Since rPr_{{}_{\text{P}}} depends on qq through the term tanhqd1\tanh qd_{1} [Eq. (6)], the dependence of the near-field coupling on d1d_{1} should saturate at d1zpdbd_{1}\gtrsim z_{\text{pd}}\sim b.

IV.3 Spheroid model of the tip

The lack of saturation in the observed s-SNOM signal as a function of d1d_{1} at d1ad_{1}\gg a indicates that evanescent waves with momenta q1/aq\ll 1/a also play an important role in the near-field coupling between the tip and the sample. This is a signature of models in which the tip has a finite extent in space 2La2L\gg a, see Fig. 1. Although such models are certainly more realistic than a point-dipole approximation, there has not been a systematic study of how the results would depend on the exact shape of the tip. Given some initial success of the point-dipole approximation, we speculate that a suitable simple shape can provide a good compromise between increase in computational effort and ability to capture relevant physics.

To test this idea, we model the tip as an elongated metallic spheroid positioned above a two-layer medium. This follows a tradition in the literature wherein similar models were considered Porto et al. (2003); Renger et al. (2005); Esteban et al. (2009) for the case of bulk substrates. In Ref. Cvitkovic et al., 2007 an analytical formula for the spheroidal tip was also proposed, based on heuristic arguments. However, it cannot be easily extended to the qq-dependent rPr_{{}_{\text{P}}} we study here. Instead, our calculations are done numerically. They involve only two essential approximations. One is neglecting retardation, which is justified is the length 2L2L of the spheroid is smaller than λ\lambda. The other one is neglecting the finite skin depth of the metal (Pt-Ir alloy) covering the tip. Due to computational difficulties involved, this issue is left for future investigation.

The calculations were performed in two ways. First is the standard boundary-element method. In this method we divide the entire tip — assuming azimuthal symmetry — into a large number (typically, 200) of small cylindrical segments. We assume that different segments interact by Coulomb interaction as coaxial rings. The interaction of each segment with itself is defined in such a way that the polarizability of the tip in the absence of the sample coincides with the known analytical result for the prolate spheroid. The effect of the sample is included by adding ring-ring interactions mediated by reflected electrostatic fields. This is accomplished by numerical quadrature over the product of rP(q,ω)r_{{}_{\text{P}}}(q,\omega) and suitable form-factors. This is the most time-consuming step of the simulation. After the interaction kernel is generated in this way, it is straightforward to solve numerically for the dipole moment of the tip induced by a unit external field, which is the desired polarizability χ(ω,ztip)\chi(\omega,z_{\text{tip}}).

We also developed a second numerical method of computing χ\chi (to be described elsewhere), based on an expansion of the electric field in ellipsoidal harmonics. This alternative method is similar to that used for a metallic sphere above a dielectric half-space. Ford and Weber (1984); Sukhov (2004) We verified that the two methods give identical results.

Substituting the computed polarizability χ\chi into Eqs. (9) and demodulating per Eq. (10), we obtain approach curves. Figure 3(b) illustrates that some approach curves are nonmonotonic near the resonances. In calculating the s-SNOM amplitude s3s_{3} we choose ztipz_{\text{tip}} that corresponds to the largest s3s_{3} because this is how it was done in the experiments. The results for the normalized amplitude are plotted in Fig. 2(b).

The spheroid model has two adjustable parameters: the apex radius of curvature aa and the half-length LL. When L=aL=a the spheroid becomes a sphere. In this case the spheroid model gives results similar to the point-dipole model, i.e., Fig. 2(c). As the ratio L/aL/a increases, the differences appear. However, once L/aL/a exceeds ten, the normalized signal s3(SiO2)/s3(Si)s_{3}(\text{SiO}_{2})/s_{3}(\text{Si}) does not change much at d1300nmd_{1}\leq 300\,\text{nm}. Therefore, for long spheroids we effectively have only a single adjustable parameter, aa. Remarkably, the thickness dependence of the s3s_{3} peak for the spheroid model matches the experiment extremely well (Fig. 6).

Refer to caption
Figure 6: (Color online) The thickness dependence of the s3s_{3} peak for different tip models. The circles represent the point-dipole calculations, one for a=30nma=30\,\text{nm} (blue) and the other for a=50nma=50\,\text{nm} (green). The diamonds are for the spheroid model with a=30nma=30\,\text{nm} and L=15aL=15a, the same as in Figs. 2(b). The black squares are derived from the experimental data shown in Fig. 2(a) after some smoothing over fluctuations.

V Conclusions

In this paper we analyzed the results of experimental study of amorphous SiO2 films on Si obtained by scanning near-field optical spectroscopy. Andreev et al. We discussed the collective mode spectra of such structures and compared measurements with two theoretical calculations. The first is based on a conventional approximation in which the tip of the scanned probe is modeled as a point dipole. In the second the tip is treated as an elongated spheroid, significantly improving agreement with the experiment.

We explain the qualitative difference between the two models as follows. An important physical ingredient missing in the point-dipole model is the enhancement of the electric field near the apex of the tip — the antenna effect. This phenomenon is well-known from classical electrostatics. The enhancement of the field is controlled primarily by the ratio of the total length of the tip 2L2L (actually, the smaller of 2L2L and λ\lambda) and the apex radius of curvature a\sim a. The point-dipole model has been successful in the past without this enhancement factor only on account of the normalization procedure. Instead of absolute sns_{n}, one usually reports sns_{n} normalized to some reference material such as Au or in our case, Si. This way, one eliminates any possible frequency dependence of the source radiation, but at the same time cancels the part of the signal scaling with tip size. For a stratified sample this cancellation is imperfect because the the field enhancement depends also on the dielectric response of the sample, which is a function of momentum qq. For a tip of length 2L2L, harmonics relevant for the field enhancement have momenta ranging from q1/aq\sim 1/a down to q1/Lq\sim 1/L. Therefore, one may expect that the dependence of the s-SNOM signal on the thickness d1d_{1} of the top layer would saturate only when d1Ld_{1}\sim L. Our simulations provide direct evidence for this claim. Therefore, we think that the spheroid model holds a great promise as an analysis tool for near-field experiments. It captures a lot of physics relevant to the near-field interaction while remaining computationally fast.

The strong experimentally observed thickness dependence of the near-field signal Andreev et al. indicates that s-SNOM is capable of not only high lateral resolution but can also probe the system in the third dimension. However, the response of a layered system is different from those containing small subsurface particles Taubner et al. (2005). We hope that experimental and theoretical approaches presented in this paper may be of use for accurate depth profiling of various dielectric and metallic nanostructures.

The work at UCSD is supported by ONR, AFOSR, NASA, and UCOP. AHCN and LMZ acknowledge DOE grant DE-FG02-08ER46512 and ONR grant MURI N00014-09-1-1063. We thank F. Keilmann and R. Hillenbrand for illuminating discussions.

References