Skip to article frontmatterSkip to article content
Site not loading correctly?

This may be due to an incorrect BASE_URL configuration. See the MyST Documentation for reference.

1Introduction

Although the model of geometrical optics helps us to design optical systems and explains many phenomena, there are properties of light that require a more elaborate model. For example, interference fringes observed in Young’s double-slit experiment or the Arago spot (Figure 3) indicate that light is more accurately modeled as a wave.

Arago spot observed with a 4 mm diameter circular disc. The bright central spot appears at the center of the disc’s shadow, demonstrating the wave nature of light through constructive interference of diffracted waves. Image captured at 1 m distance using 633 nm laser light.

Arago spot observed with a 4 mm diameter circular disc. The bright central spot appears at the center of the disc’s shadow, demonstrating the wave nature of light through constructive interference of diffracted waves. Image captured at 1 m distance using 633 nm laser light.

Arago spot observed with a 2 mm diameter circular disc. The relative size of the central bright spot increases compared to the disc diameter, as diffraction effects become more pronounced with smaller obstacles. Image captured at 1 m distance using 633 nm laser light.

Arago spot observed with a 2 mm diameter circular disc. The relative size of the central bright spot increases compared to the disc diameter, as diffraction effects become more pronounced with smaller obstacles. Image captured at 1 m distance using 633 nm laser light.

The Arago spot is the bright spot which occurs at the center of the shadow of a circular disc and which is caused by diffraction. The disc has diameter 4 mm, 2 mm and 1 mm, from left to right, the wavelength is 633 nm and the intensity is recorded at 1 m behind the disc and has width of 16 mm

Figure 3:The Arago spot is the bright spot which occurs at the center of the shadow of a circular disc and which is caused by diffraction. The disc has diameter 4 mm, 2 mm and 1 mm, from left to right, the wavelength is 633 nm and the intensity is recorded at 1 m behind the disc and has width of 16 mm

In this chapter we will study the wave model of light. It will be shown that the extent to which light can show interference depends on a property called coherence. In the largest part of the discussion we will assume that all light has the same polarization, so that we can treat the fields as scalar. In the last section we will look at how polarization affects interference, as described by the Fresnel-Arago laws.

It is important to note that the concepts of interference and coherence are not just restricted to optics. Since quantum mechanics dictates that particles have a wave-like nature, interference and coherence also play a role in e.g. solid state physics and quantum information.

2Interference of Monochromatic Fields of the Same Frequency

Let us first recall the basic concepts of interference. What causes interference is the fact that light is a wave, which means that it not only has an amplitude but also a phase. Suppose for example we evaluate a time-harmonic field in two points

U1(t)=cos(ωt),U2(t)=cos(ωt+φ).\begin{align*} \mathcal{U}_1(t)=\cos(\omega t), \quad \mathcal{U}_2(t)=\cos(\omega t +\varphi). \end{align*}

Here φ\varphi denotes the phase difference between the fields at the two points. If φ=0\varphi=0, or φ\varphi is a multiple of 2π2\pi, the fields are in phase, and when they are added they interfere constructively

U1(t)+U2(t)=cos(ωt)+cos(ωt+2mπ)=2cos(ωt).\begin{align*} \mathcal{U}_1(t)+\mathcal{U}_2(t)=\cos(\omega t) + \cos(\omega t + 2 m \pi) =2\cos(\omega t). \end{align*}

However, when φ=π\varphi=\pi, or more generally φ=π+2mπ\varphi=\pi+ 2 m \pi, for some integer mm, then the waves are out of phase, and when they are superimposed, they interfere destructively.

U1(t)+U2(t)=cos(ωt)+cos(ωt+π+2mπ)=cos(ωt)cos(ωt)=0\begin{align*} \begin{split} \mathcal{U}_1(t)+\mathcal{U}_2(t)&=\cos(\omega t)+\cos(\omega t+\pi+ 2 m\pi) \\ &=\cos(\omega t)-\cos(\omega t) \\ &=0 \end{split} \end{align*}

We can sum the two fields for arbitrary φ\varphi more conveniently using complex notation:

U1(t)=Re[eiωt],    U2(t)=Re[eiωteiφ].\begin{align*} \mathcal{U}_1(t)=\text{Re}[ e^{-i\omega t}], \; \; \mathcal{U}_2(t) =\text{Re}[ e^{-i\omega t}e^{-i\varphi}]. \end{align*}

Adding gives

U1(t)+U2(t)=Re[eiωt(1+eiφ)]=Re[eiωteiφ/2(eiφ/2+eiφ/2)]=Re[eiωteiφ/22cos(φ/2)]=2cos(φ/2)cos(ωt+φ/2).\begin{align*} \mathcal{U}_1(t)+\mathcal{U}_2(t)&= \text{Re}[e^{-i\omega t}(1+e^{-i\varphi})] &=\text{Re}[e^{-i\omega t}e^{-i\varphi/2}(e^{i\varphi/2}+e^{-i\varphi/2})] &=\text{Re}[e^{-i\omega t}e^{-i\varphi/2}2\cos(\varphi/2)] &= 2\cos(\varphi/2)\cos(\omega t+\varphi/2). \end{align*}

For φ=2mπ\varphi= 2 m \pi and φ=π+2mπ\varphi=\pi+2 m \pi we retrieve the results obtained before. It is important to realize that what we see or detect physically (the ‘brightness’ of light) does not correspond to the quantities U1\mathcal{U}_1, U2\mathcal{U}_2. After all, U1\mathcal{U}_1 and U2\mathcal{U}_2 can attain negative values, while there is no such thing as ‘negative brightness’. What U1\mathcal{U}_1 and U2\mathcal{U}_2 describe are the fields, which may be positive or negative. The ‘brightness’ or the irradiance or intensity is given by taking an average over a long time of U(t)2\mathcal{U}(t)^2 (as discussed in Chapter 1), we shall omit the factor ϵ/μ0\sqrt{\epsilon/\mu_0} . As explained in Chapter 1, we see and measure only the long time-average of U(t)2\mathcal{U}(t)^2, because at optical frequencies U(t)2\mathcal{U}(t)^2 fluctuates very rapidly. We recall the definition of the time average over an interval of length TT at a specific time tt from Chapter 1:

f(t)=1Ttt+Tf(t)dt,\begin{align*} \langle f(t) \rangle= \frac{1}{T}\int_{t}^{t+T}f(t')\,\text{d}t', \end{align*}

where TT is a time interval that is the response time of a typical detector, i.e. T106sT\approx 10^{-6}\,\text{s} which is extremely long compared to the period of visible light which is of the order of 1014s10^{-14}\, \text{s}. For a time-harmonic function, the long-time average is equal to the average over one period of the field and hence it is independent of the time tt at which it is taken. Indeed for Eq. (5) we get

I=(U1(t)+U2(t))2=4cos2(φ/2)cos2(ωt+φ/2)=2(1+cosϕ)cos2(ωt+φ/2)=1+cos(φ)\begin{align*} I &= \langle (\mathcal{U}_1(t)+\mathcal{U}_2(t))^2 \rangle &=4\cos^2(\varphi/2) \langle \cos^2(\omega t+\varphi/2) \rangle &= 2(1+\cos\phi) \langle \cos^2(\omega t+\varphi/2) \rangle \\ &= 1 +\cos(\varphi) \end{align*}

Using complex notation one can obtain this result more easily. Let

U1(t)=Re[U1eiωt],U2(t)=Re[U2eiωt],\begin{align*} \mathcal{U}_1(t)=\text{Re}[U_1 e^{-i\omega t}], \quad \mathcal{U}_2(t)=\text{Re}[U_2 e^{-i\omega t}], \end{align*}

where

U1=1,U2=eiφ.\begin{align*} U_1=1, \quad U_2=e^{-i\varphi}. \end{align*}

Then we find

U1+U22=1+eiφ2=(1+eiφ)(1+eiφ)=1+1+eiφ+eiφ=2+2cos(φ),\begin{align*} \begin{split} |U_1+U_2|^2 &= |1+e^{-i\varphi}|^2 \\ &= (1+e^{i\varphi})(1+e^{-i\varphi}) \\ &= 1+1+e^{-i\varphi}+e^{i\varphi} \\ &= 2+2\cos(\varphi), \end{split} \end{align*}

hence

I=12U1+U22.\begin{align*} I = \frac{1}{2}|U_1 + U_2|^2. \end{align*}

To see why this works, recall the time averaging formula and choose A=B=U1+U2A=B=U_1+U_2.

Remark. To shorten the formulae, we will omit in this chapter the factor 1/21/2 in front of the time-averaged intensity.

Hence we define I1=U12I_1=|U_1|^2 and I2=U22I_2=|U_2|^2, and we then find for the time-averaged intensity of the sum of U1U_1 and U2U_2:

I=U1+U22=(U1+U2)(U1+U2)=U12+U22+U1U2+U1U2=I1+I2+2Re[U1U2]=I1+I2+2I1I2cos(ϕ1ϕ2),\begin{align*} I &= |U_1+U_2|^2=(U_1+U_2)(U_1+U_2)^* \\ &= |U_1|^2+|U_2|^2+U_1U_2^*+U_1^* U_2 \\ &= I_1+I_2+2\text{Re}[ U_1 U_2^* ] \\ &= I_1 + I_2 + 2 \sqrt{I_1}\sqrt{I_2}\cos(\phi_1-\phi_2), \end{align*}

where ϕ1\phi_1 and ϕ2\phi_2 are the arguments of U1U_1 and U2U_2 and ϕ1ϕ2\phi_1-\phi_2 is the phase difference. The term 2Re[U1U2]2\text{Re}[U_1^* U_2] is known as the interference term. In the famous double-slit experiment (which we will discuss in a later section), we can interpret the terms as follows: let us say U1U_1 is the field that comes from slit 1, and U2U_2 comes from slit 2. If only slit 1 is open, we measure on the screen intensity I1I_1, and if only slit 2 is open, we measure I2I_2. If both slits are open, we would not measure I1+I2I_1+I_2, but we would observe fringes due to the interference term 2Re[U1U2]2\text{Re}[U_1^* U_2].

The intensity Eq. (12) varies when the phase difference varies. These variations are called fringes. The fringe contrast is defined by

 Fringe contrast=ImaxIminImax+Imin.\begin{align*} \textrm{ Fringe contrast} = \frac{ I_{max}-I_{min}}{I_{max}+I_{min}}. \end{align*}

It is maximum and equal to 1 when the intensities of the interfering fields are the same. If these intensities are different the fringe contrast is less than 1.

More generally, the intensity of a sum of multiple time-harmonic fields UjU_j all having the same frequency is given by the coherent sum

I=jUj2.\begin{align*} I=\left|\sum_j U_j\right|^2. \end{align*}

However, we will see in the next section that sometimes the fields are unable to interfere. In that case all the interference terms of the coherent sum vanish, and the intensity is given by the incoherent sum

I=jUj2.\begin{align*} I=\sum_j |U_j|^2. \end{align*}

3Coherence

In the discussion so far we have only considered monochromatic light, which means that the spectrum of the light consists of only one frequency. Although light from a laser often has a very narrow band of frequencies and therefore can be considered to be monochromatic, purely monochromatic light does not exist. One reason that light can not be perfectly monochromatic is that any source must have been switched on a finite time ago. Hence, all light consists of multiple frequencies and therefore is polychromatic. Classical light sources such as incandescent lamps and also LEDs have relatively broad frequency bands. The question then arises how differently polychromatic light behaves compared to the idealized case of monochromatic light. To answer this question, we must study the topic of coherence. One distinguishes between two extremes: fully coherent and fully incoherent light, while the degree of coherence of practical light is somewhere in between. Generally speaking, the broader the frequency band of the source, the more incoherent the light is. It is a very important observation that no light is actually completely coherent or completely incoherent. All light is partially coherent, but some light is more coherent than others.

An intuitive way to think about these concepts is in terms of the ability to form interference fringes. For example, with laser light, which usually is almost monochromatic and hence coherent, one can form an interference pattern with clear maxima and minima in intensities using a double slit, while with sunlight (which is incoherent) this is much more difficult. Every frequency in the spectrum of sunlight gives its own interference pattern with its own frequency dependent fringe pattern. These fringe patterns wash out due to superposition and the total intensity therefore shows little fringe contrast, i.e. the coherence is less. However, it is not impossible to create interference fringes with natural lightYoung, 1804. The trick is to let the two slits be so close together (of the order of 0.02 mm0.02~\text{mm}) that the difference in distances from the slits to the sun is so small for the fields in the slits to be sufficiently coherent to interfere. To understand the effect of polychromatic light, it is essential to understand that the degree to which the fields in two points are coherent, i.e. the ability to form fringes, is determined by the difference in distances between these points and the source. The distance itself to the source is not relevant. This will be made clear in this chapter.

3.1Coherence of Light Sources

In a conventional light source such as a gas discharge lamp, photons are generated by spontaneous emission with energy equal to the energy difference between certain electronic states of the atoms of the gas. These transitions have a duration of the order of 10-8 to 109s10^{-9} \, \text{s}. Because the emitted wave trains are finite, the emitted light does not have a single frequency; instead, there is a band of frequencies around a center frequency with width roughly equal to the reciprocal of the duration of the wave train. This spread of frequencies is called the natural linewidth. Random thermal motions of the molecules cause further broadening due to the Doppler effect. In addition, the atoms undergo collisions that interrupt the wave trains and therefore further broaden the frequency spectrum.

We first consider a single emitting atom. When collisions are the dominant broadening effect and these collisions are sufficiently brief, so that any radiation emitted during the collision can be ignored, an accurate model for the emitted wave is a steady monochromatic wave train at frequency ωˉ\bar{\omega} at the center of the frequency band, interrupted by random phase jumps each time that a collision occurs. The discontinuities in the phase due to the collisions cause a spread of frequencies around the center frequency. An example is shown in Figure 4. The average time τ0\tau_0 between the collisions is typically less than 10-10 s which implies that on average between two collisions roughly 106 harmonic oscillations occur and that during an atom transition of the order of hundred collisions may occur. The coherence time Δτc\Delta \tau_c is defined as the maximum time interval over which the phase of the electric field can be predicted. In the case of collisions-dominated emission by a single atom, the coherence time is equal to the average time between subsequent collisions: Δτc=τ0 1010\Delta \tau_c = \tau_0\text{ }10^{-10} s.

To understand coherence and incoherence, it is helpful to use this model for the emission by a single atom as harmonic wave trains of many thousands of periods interrupted by roughly hundred random phase jumps. Due to the random phase jumps, the interference term of the sum of harmonic wave trains emitted by two atoms when integrated of the relatively long integration time of a detector becomes a sum over integrals over time intervals of average length τ0\tau_0:

j0τ0cos(ωt)cos(ωt+ϕj)dt,\sum_{j} \int_0^{\tau_0} \cos(\omega t) \cos(\omega t + \phi_j) \text{d} t,

where the sum is over roughly one hundred random phase jumps during the total duration of the wave trains. The random phase jumps lead to cancellation of the integrals and hence the interference term vanishes. We conclude that over the integration time of typical detectors

The electric field amplitude of the harmonic wave train radiated by a single atom at the center frequency \bar{\omega}. The vertical lines are collisions separated by periods of free flight with mean duration \tau_0. The quantity \bar{\omega}\tau_0, which is the number of periods in a typical wave train, is chosen unrealistically small (namely 60, whereas a realistic value would be 10^5) to show the random phase changes.

Figure 4:The electric field amplitude of the harmonic wave train radiated by a single atom at the center frequency ωˉ\bar{\omega}. The vertical lines are collisions separated by periods of free flight with mean duration τ0\tau_0. The quantity ωˉτ0\bar{\omega}\tau_0, which is the number of periods in a typical wave train, is chosen unrealistically small (namely 60, whereas a realistic value would be 105) to show the random phase changes.

The coherence time and the width Δω\Delta \omega of the frequency line are related as

Δτc=2πΔω.\begin{align*} \Delta \tau_c = \frac{2\pi}{\Delta \omega}. \end{align*}

The coherence length is defined by

Δc=cΔτc.\begin{align*} \Delta \ell_{c}= c \Delta \tau_c. \end{align*}

Since λω=2πc\lambda \omega = 2\pi c, we have

Δλλˉ=Δωωˉ,\begin{align*} \frac{\Delta \lambda}{\bar{\lambda}} = \frac{\Delta \omega}{\bar{\omega}}, \end{align*}

where λˉ\bar{\lambda} and ωˉ\bar{\omega} are the wavelength and the frequency at the center of the line. Hence,

Δc=c2πΔω=2πcωˉωˉΔω=λˉ2Δλ.\begin{align*} \Delta \ell_c = c \frac{2\pi}{\Delta \omega} = 2\pi \frac{c}{\bar{\omega}} \frac{\bar{\omega}}{\Delta \omega} = \frac{\bar{\lambda}^2}{\Delta \lambda}. \end{align*}

The coherence length and coherence time of a number of sources are listed in Table 1. For a laser, the linewidth is extremely small and the coherence time very long. This is because the photons in a laser are not generated predominantly by spontaneous emission as classical sources, but instead by stimulated emission. Lasers are discussed in .

Table 1:Coherence time and coherence length of several sources

SourceMean wavelengthLinewidthCoherence LengthCoherence Time
λˉ\bar{\lambda}Δλ\Delta \lambdaλˉ2/Δλ\bar{\lambda}^2/\Delta \lambdaΔτc \Delta \tau_c
Mid-IR (3-5 μm\mu\text{m})4.0 μm\mu\text{m}2.0 μm2.0~\mu\text{m}8.0 μm\mu\text{m}2.66×10142.66 \times10^{-14} s.
White light550 nm300\approx 300 nm900 \approx 900 nm3.0×1014 \approx 3.0 \times 10^{-14}s.
Mercury arc546.1 nm1.0\approx 1.0 nm0.3\approx 0.3 mm1.0×1012 \approx 1.0 \times 10^{-12}s.
Kr86\text{Kr}^{86} discharge lamp605.6 nm1.2×1031.2 \times 10^{-3} nm0.3 m1.0×109 1.0 \times 10^{-9}s.
Stabilized He-Ne laser632.8 nm106\approx 10^{-6} nm400 m1.33×1061.33\times 10^{-6}s.

3.2Polychromatic Light

When dealing with coherence one has to consider fields that consist of a range of different frequencies. Let U(r,t){\cal U}(\mathbf{r},t) be the real-valued field component. It is always possible to write U(r,t){\cal U}(\mathbf{r},t) as an integral over time-harmonic components:

U(r,t)=Re0Aω(r)eiωtdω,\begin{align*} {\cal U}(\mathbf{r}, t) = \text{Re} \int_0^\infty A_\omega(\mathbf{r}) e^{-i \omega t} \, \, \text{d} \omega, \end{align*}

where Aω(r)A_\omega(r) is the complex amplitude of the time-harmonic field with frequency ω\omega. When there is only a certain frequency band that contributes, then Aω=0A_\omega=0 for ω\omega outside this band. We define the complex time-dependent field U(r,t)U(\mathbf{r},t) by

U(r,t)=0Aω(r)eiωt dω.\begin{align*} U(\mathbf{r},t) = \int_0^\infty A_\omega(\mathbf{r}) e^{-i\omega t}\ \, \text{d} \omega. \end{align*}

Then

U(r,t)=ReU(r,t).\begin{align*} \mathcal{U}(\mathbf{r},t)= \text{Re}\, U(\mathbf{r},t). \end{align*}

Remark: The complex field U(r,t)U(\mathbf{r},t) contains now the time dependence in contrast to the notation used for a time-harmonic (i.e. single frequency) field introduced in Chapter 2, where the time-dependent eiωte^{-i\omega t} was a separate factor.

We now compute the intensity of polychromatic light. The instantaneous energy flux is (as for monochromatic light) proportional to the square of the instantaneous real field: U(r,t)2\mathcal{U}(\mathbf{r},t)^2. We average the instantaneous intensity over the integration time TT of common detectors which, as stated before, is very long compared to the period at the center frequency 2π/ωˉ2\pi/\bar{\omega} of the field. Using Eq. (6) and

U(r,t)=ReU(r,t)=(U(r,t)+U(r,t))/2,\begin{align*} \mathcal{U}(\mathbf{r},t)= \text{Re}\, U(\mathbf{r},t) =(U(\mathbf{r},t)+U(\mathbf{r},t)^*)/2, \end{align*}

we get

U(r,t)2=14(U(r,t)+U(r,t))(U(r,t)+U(r,t))=14{U(r,t)2+(U(r,t))2+2U(r,t)U(r,t)}12U(r,t)U(r,t)=12U(r,t)2,\begin{align*} \langle \mathcal{U}(\mathbf{r},t)^2 \rangle &= \frac{1}{4} \langle (U(\mathbf{r},t)+U(\mathbf{r},t)^*)(U(\mathbf{r},t)+U(\mathbf{r},t)^*) \rangle \nonumber \\ &= \frac{1}{4} \left\{ \langle U(\mathbf{r},t)^2 \rangle + \langle (U(\mathbf{r},t)^*)^2 \rangle + 2 \langle U(\mathbf{r},t)^* U(\mathbf{r},t) \rangle\right\} \nonumber \\ & \approx & \frac{1}{2} \langle U(\mathbf{r},t)U(\mathbf{r},t)^* \rangle \nonumber \\ &= \frac{1}{2} \langle |U(\mathbf{r},t)|^2 \rangle, \end{align*}
\begin{align*} \\\end{align*}

where the averages of U(r,t)2U(\mathbf{r},t)^2 and (U(r,t))2(U(\mathbf{r},t)^*)^2 are zero because they are fast-oscillating and go through many cycles during the integration time of the detector. In contrast, U(r,t)2=U(r,t)U(r,t)|U(\mathbf{r},t)|^2=U(\mathbf{r},t)^*U(\mathbf{r},t) has a DC-component which does not average to zero.

Remark: In contrast to the time-harmonic case, the long time average of polychromatic light depends on the time tt at which the average is taken. However, we assume in this chapter that the fields are omitted by sources that are stationary. The property of stationarity implies that the average over the time interval of long length TT does not depend on the time that the average is taken. Many light sources, in particular conventional lasers, are stationary. (However, a laser source which emits short high-power pulses cannot be considered as a stationary source). We furthermore assume that the fields are ergodic, which means that taking the time-average over a long time interval amounts to the same as taking the average over the ensemble of possible fields. It can be shown that this property implies that the limit TT\rightarrow \infty in Eq. (6) indeed existsMandel & Wolf, 1995.

We use for the intensity again the expression without the factor 1/21/2 in front, i.e.

I(r)=U(r,t)2.\begin{align*} I(\mathbf{r}) &= \langle |U(\mathbf{r},t)|^2 \rangle. \end{align*}

The time-averaged intensity has hereby been expressed in terms of the time-average of the squared modulus of the complex field.

Quasi-monochromatic field. If the width Δω\Delta \omega of the frequency band is very narrow compared to the center frequency ωˉ\bar{\omega}, we speak of a quasi-monochromatic field. In the propagation of quasi-monochromatic fields, we use the formula for time-harmonic fields at ωˉ\bar{\omega}. The quasi-monochromatic assumption simplifies the computations considerably and will therefore be used frequently.

4Temporal Coherence and the Michelson Interferometer

To investigate the time coherence of a field in a certain point r\mathbf{r}, we let the field in that point interfere with itself but delayed in time, i.e. we let U(r,t)U(\mathbf{r},t) interfere with U(r,tτ)U(\mathbf{r}, t-\tau). Because, when studying temporal coherence, the point r\mathbf{r} is always the same, we omit it from the formula. Furthermore, for easier understanding of the phenomena, we assume for the time being that the field considered is emitted by a single atom (i.e. a point source).

Temporal coherence is closely related to the spectral content of the light: if the light consists of fewer frequencies (think of monochromatic light), then it is more temporally coherent. To study the interference of U(t)U(t) with U(tτ)U(t-\tau), a Michelson interferometer, shown in Figure 5, is a suitable setup. The light that goes through one arm takes time tt to reach the detector, while the light that goes through the other (longer) arm takes time t+τt+\tau which means that it was radiated earlier. Therefore, the detector observes the time-averaged intensity U(t)+U(tτ)2\langle |U(t)+U(t-\tau)|^2 \rangle. As remarked before, this averaged intensity does not depend on the time the average is taken, it only depends on the time difference τ\tau between the two beams.

A Michelson interferometer to study the temporal coherence of a field. A beam is split in two by a beam splitter, and the two beams propagate over different distances which corresponds to a time difference \tau and then interfere at the detector.

Figure 5:A Michelson interferometer to study the temporal coherence of a field. A beam is split in two by a beam splitter, and the two beams propagate over different distances which corresponds to a time difference τ\tau and then interfere at the detector.

We have

I(τ)=U(t)+U(tτ)2=U(t)2+U(tτ)2+2ReU(t)U(tτ)=2U(t)2+2ReU(t)U(tτ).\begin{align*} I(\tau)&= \langle |U(t)+U(t-\tau)|^2 \rangle &= \langle |U(t)|^2 \rangle+\langle |U(t-\tau)|^2 \rangle+2\text{Re} \langle U(t)U(t-\tau)^* \rangle \\ &= 2 \langle |U(t)|^2 \rangle + 2\text{Re}\langle U(t)U(t-\tau)^* \rangle. \end{align*}

The detected intensity varies with the difference in arm length.

So far we have considered a field that originates from a single atom. The total field emitted by an extended source is the sum of fields Ui(t)U_i(t) corresponding to all atoms ii. As has been explained already, the fields emitted by different atoms can not interfere. But the field emitted by an atom can interfere with the delayed field of that same atom and for every atom the interference is given by the same expression Eq. (28). The total intensity is simply that given by that of a single atom multiplied by the number of atoms. In particular, the ratio of the interference term and the other terms is the same for the entire source as for a single atom.

The self coherence function Γ(τ)\Gamma(\tau) is defined by

Γ(τ)=U(t)U(tτ)selfcoherence.\begin{align*} \Gamma(\tau)=\langle U(t)U(t-\tau)^* \rangle \hspace{1.5cm}\mathbf{self-coherence}. \end{align*}

The intensity of U(t)U(t) is

I0=U(t)2=Γ(0).\begin{align*} I_0=\langle |U(t)|^2 \rangle = \Gamma(0). \end{align*}

The complex degree of self-coherence is defined by:

γ(τ)=Γ(τ)Γ(0).complexdegreeofselfcoherence\begin{align*} \gamma(\tau)=\frac{\Gamma(\tau)}{\Gamma(0)}. \hspace{1.2cm} \mathbf{complex degree of self-coherence} \end{align*}

Using Bessel’s inequality it can be shown that this is a complex number with modulus between 0 and 1:

0γ(τ)1.\begin{align*} 0 \leq |\gamma(\tau)| \leq 1. \end{align*}

The observed intensity can then be written:

I(τ)=2I0{1+Re[γ(τ)]},\begin{align*} I(\tau)=2 I_0 \left\{1 +\text{Re}\left[\gamma(\tau) \right]\right\}, \end{align*}

We consider two special cases.

  1. Suppose U(t)U(t) is a monochromatic wave

U(t)=eiωt.\begin{align*} U(t)=e^{-i\omega t}. \end{align*}

In that case we get for the self-coherence

Γ(τ)=eiωteiω(tτ)=eiωτ,\begin{align*} \begin{split} \Gamma(\tau)&=\langle e^{-i\omega t}e^{i\omega (t-\tau)} \rangle =e^{-i\omega \tau}, \end{split} \end{align*}

and

γ(τ)=eiωτ.\begin{align*} \gamma(\tau) = e^{-i\omega \tau}. \end{align*}

Hence the interference pattern is given by

I(τ)=2[1+cos(ωτ)].\begin{align*} \begin{split} I(\tau)&=2\left[1+ \cos\left( \omega\tau \right) \right]. \end{split} \end{align*}

So for monochromatic light we expect to detect a cosine interference pattern, which shifts as we change the arm length of the interferometer (i.e. change τ\tau). No matter how large the time delay τ\tau, a clear interference pattern should be observed.

  1. Next we consider what happens when the light is a superposition of two frequencies:

U(t)=ei(ωˉ+Δω/2)t+ei(ωˉΔω/2)t2,\begin{align*} U(t)=\frac{e^{-i(\bar{\omega}+\Delta\omega/2) t}+e^{-i(\bar{\omega}-\Delta\omega/2) t}}{2}, \end{align*}

where (2π/T)Δωωˉ\left(2\pi/T\right) \ll \Delta \omega \ll \bar{\omega}, where TT is integration time of the detector. Then:

Γ(τ)=14(ei(ωˉ+Δω/2)t+ei(ωˉΔω/2)t)(ei(ωˉ+Δω/2)(tτ)+ei(ωˉΔω/2)(tτ))ei(ωˉ+Δω/2)τ+ei(ωˉΔω/2)τ4=cos(Δωτ/2)eiωˉτ2,\begin{align*} \Gamma(\tau)&=\frac{1}{4}\langle \left(e^{-i(\bar{\omega}+\Delta\omega/2) t}+e^{-i(\bar{\omega}-\Delta\omega/2) t}\right)\left(e^{i(\bar{\omega}+\Delta\omega/2) (t-\tau)}+e^{i(\bar{\omega}-\Delta\omega/2) (t-\tau)}\right) \rangle &\approx & \frac{e^{-i(\bar{\omega}+\Delta\omega/2) \tau}+e^{-i(\bar{\omega}-\Delta\omega/2) \tau}}{4} \\ &= \cos\left(\Delta\omega\,\tau/2\right)\frac{e^{-i \bar{\omega} \tau}}{2}, \end{align*}

where in the second line the time average of terms that oscillate with time is set to zero because the averaging is done over time interval TT satisfying TΔωˉ1T\Delta \bar{\omega} \gg 1. Hence, the complex degree of self-coherence is:

γ(τ)=cos(Δωτ/2)eiωˉτ\begin{align*} \gamma(\tau)= \cos\left(\Delta\omega\,\tau/2 \right) e^{-i \bar{\omega} \tau} \end{align*}

and Eq. (33) becomes

I(τ)={1+Re[γ(τ)]}=1+cos(Δωτ/2)cos(ωˉτ).\begin{align*} I(\tau)= \left\{1 +\text{Re}\left[\gamma(\tau) \right]\right\}= 1 + \cos\left(\Delta\omega\,\tau/2\right) \cos(\bar{\omega} \tau ). \end{align*}

The interference term is the product of the function cos(ωˉτ)\cos(\bar{\omega}\tau), which is a rapidly oscillating function of τ\tau, and a slowly varying envelope cos(Δωτ/2)\cos \left(\Delta\omega\,\tau/2\right). It is interesting to note that the envelope, and hence γ(τ)\gamma(\tau), vanishes for some periodically spaced τ\tau, which means that for certain τ\tau the degree of self-coherence vanishes and no interference fringes form[3],[4]. Note that when Δω\Delta\omega is larger the intervals between the zeros of γ(τ)\gamma(\tau) decrease. If more frequencies are added, the envelope function is not a cosine function but on average decreases with τ\tau. The typical value of τ\tau below which interferences are observed is roughly equal to half the first zero of the envelope function. This value is called the coherence time Δτc \Delta \tau_c. We conclude with some further interpretations of the degree of self-coherence γ(τ)\gamma(\tau).

Remarks.

  1. In stochastic signal analysis Γ(τ)=U(t)U(tτ)\Gamma(\tau)=\langle U(t)U(t-\tau)^* \rangle is called the autocorrelation of U(t)U(t). Informally, one can interpret the autocorrelation function as the ability to predict the field UU at time tt given the field at time tτt-\tau.

  2. The Wiener-Khinchin theorem says that (under the assumption of ergodicity and for stationary fields) the Fourier transform of the self coherence function is the spectral power density of U(t)U(t):

Γ^(ω)=U^(ω)2,\hat{\Gamma}(\omega)=|\hat{U}(\omega)|^2,

Using the uncertainty principle, we can see that the larger the spread of the frequencies of U(t)U(t) (i.e. the larger the bandwidth), the more sharply peaked Γ(τ)\Gamma(\tau) is. Thus, the light gets temporally less coherent when it consists of a broader range of frequencies. Measuring the spectral power density with a spectroscope and applying a back Fourier transform is an alternative method to obtain the complex self-coherence function.

5Spatial Coherence and Young’s Experiment

Temporal coherence concerns the coherence of the field in one point. The absolute value of the degree of self coherence Eq. (31) quantifies how strong the interference is of the field in the point of interest with the field in that same point at a later time. In contrast, spatial coherence is concerned with determining how coherent the fields in two different points are. This is done by letting the fields interfere using a mask with two small holes at the positions of the points of interest and observing the fringe contrast at a distant screen (Young’s experiment).

While for temporal coherence we used a Michelson interferometer, the natural choice to characterize spatial coherence is Young’s experiment, because it allows the fields in two points P1P_1, P2P_2 which are separated in space to interfere with each other.

The spatial coherence of light from an extended source.

Figure 6:The spatial coherence of light from an extended source.

Let r1\mathbf{r}_1and r2\mathbf{r}_2 be the position vectors of the points P1P_1 and P2P_2, respectively. We write the complex field in P1P_1 as a superposition of monochromatic fields as in Eq. (22):

U(r1,t)=Aω(r1)eiωt dω.\begin{align*} U(\mathbf{r}_1,t) = \int A_\omega(\mathbf{r}_1) e^{-i\omega t}\ \, \text{d} \omega. \end{align*}

The reason for doing this is that for a monochromatic field in the pinhole, i.e. a field with a well defined frequency, we can derive the disturbance in any point r\mathbf{r} behind the mask. In fact, according to the Huygens-Fresnel Principle, a monochromatic disturbance with frequency ω\omega in the pinhole at r1\mathbf{r}_1 generates a radiating spherical wave with the same frequency ω\omega, such that in a point r\mathbf{r} behind the mask the field is:

SAω(r1)ωceiω(trr1/c)rr1,\begin{align*} {\cal S} A_\omega(\mathbf{r}_1)\, \frac{\omega}{c} \,\frac{e^{-i \omega(t- |\mathbf{r}-\mathbf{r}_1|/c)}}{ |\mathbf{r}-\mathbf{r}_1|}, \end{align*}

where S{\cal S} is the surface area of the pinhole. We assume that the frequency band is sufficiently small such that the frequency factor that multiplies AωA_\omega can be replaced by the center frequency ωˉ\bar{\omega}. Note that this should not be done in the exponent in Eq. (44) because an error in the phase can easily lead to large errors in the total field. The total field U1(r,t)U_1(\mathbf{r},t) in r\mathbf{r} due to the pinhole at P1P_1 is obtained by integrating the monochromatic components over frequency:

U1(r,t)=SωˉcAω(r1)eiω(trr1/c)rr1dω=SωˉcU(r1,trr1/c)rr1.\begin{align*} U_1(\mathbf{r},t) = {\cal S} \,\frac{\bar{\omega}}{c} \int A_\omega(\mathbf{r}_1)\frac{e^{-i \omega( t-|\mathbf{r}-\mathbf{r}_1|/c)}}{ |\mathbf{r}-\mathbf{r}_1|} \text{d} \omega ={\cal S}\, \frac{\bar{\omega}}{c} \frac{U(\mathbf{r}_1, t - |\mathbf{r}-\mathbf{r}_1|/c)}{ |\mathbf{r}-\mathbf{r}_1|}. \end{align*}

In words:

For the field in r\mathbf{r} due to pinhole 2 we have similarly

U2(r,t)=SωˉcU(r2,trr1/c)rr2.\begin{align*} U_2(\mathbf{r},t) = {\cal S}\, \frac{\bar{\omega}}{c} \frac{U(\mathbf{r}_2, t - |\mathbf{r}-\mathbf{r}_1|/c)}{ |\mathbf{r}-\mathbf{r}_2|}. \end{align*}

The total field in r\mathbf{r} is the sum U1(r,t)+U2(r,t)U_1(\mathbf{r},t)+U_2(\mathbf{r},t). Because of the difference in propagation distance ΔR=rr2rr1\Delta R=|\mathbf{r}-\mathbf{r}_2|-|\mathbf{r}-\mathbf{r}_1|, there is a time difference τ\tau between when the two fields have been emitted by the two pinholes when they arrive at a given time TT IN point r\mathbf{r} on the screen in Figure 6. This time difference is given by

τ=ΔRc.\begin{align*} \tau = \frac{\Delta R}{c}. \end{align*}

Furthermore, the amplitudes are reduced by a factor proportional to the reciprocal distance which is different for the two fields. But if the distance of the screen to the mask is large enough, we may assume these factors to be the same and then omit them. Using Eq. (47), the interference pattern on the screen is then, apart from a constant factor, given by

I(τ)=U1(r,t)+U2(r,t)2=U(r1,trr1/c)+U(r2,trr2/c)2=U(r1,t)+U(r2,tτ)2=U(r1,t)2+U(r2,tτ)2+2ReU(r1,t)U(r2,tτ)=U(r1,t)2+U(r2,t)2+2ReU(r1,t)U(r2,tτ),\begin{align*} I(\tau)&= \langle \, |U_1(\mathbf{r},t) + U_2(\mathbf{r},t) |^2 \, \rangle &= \langle \, | U(\mathbf{r}_1, t-|\mathbf{r}-\mathbf{r}_1||/c) + U(\mathbf{r}_2, t-|\mathbf{r}-\mathbf{r}_2||/c)|^2 \, \rangle \\ &= \langle \, |U(\mathbf{r}_1,t)+U(\mathbf{r}_2,t- \tau)|^2\, \rangle \\ &=\langle \, |U(\mathbf{r}_1,t)|^2 \rangle+\langle |U(\mathbf{r}_2,t-\tau)|^2\, \rangle+2\text{Re}\langle \,U(\mathbf{r}_1,t)U(\mathbf{r}_2,t-\tau)^*\, \rangle \\ &= \langle \, |U(\mathbf{r}_1,t)|^2\, \rangle+\langle \, |U(\mathbf{r}_2,t)|^2\, \rangle+2\text{Re}\langle \, U(\mathbf{r}_1,t)U(\mathbf{r}_2,t-\tau)^*\, \rangle, \end{align*}

where in the third and last line we used that the time average does not depend on the time it is taken because the light source is assumed to be stationary. We define the mutual coherence function by:

Γ12(τ)=U(r1,t)U(r2,tτ),mutualcoherence.\begin{align*} \Gamma_{12}(\tau)=\langle \,U(\mathbf{r}_1,t)U(\mathbf{r}_2,t-\tau)^*\, \rangle, \hspace{1.5cm} \mathbf{mutual coherence}. \end{align*}

With the intensities

I1=U(r1,t)2=Γ11(0),I2=U(r2,t)2=Γ22(0).\begin{align*} \begin{split} I_1&=\langle \, |U(\mathbf{r}_1,t)|^2\, \rangle = \Gamma_{11}(0),\\ I_2&=\langle \, |U(\mathbf{r}_2,t)|^2\, \rangle = \Gamma_{22}(0). \end{split} \end{align*}

the complex degree of mutual coherence is defined by

γ12(τ)=Γ12(τ)Γ11(0)Γ22(0),complexdegreeofmutualcoherence.\begin{align*} \gamma_{12}(\tau)=\frac{\Gamma_{12}(\tau)}{\sqrt{ \Gamma_{11}(0)}\sqrt{\Gamma_{22}(0)}}, \quad \mathbf{complex degree of mutual coherence}. \end{align*}

It can be proved using Bessel’s inequality that

γ12(τ)1.|\gamma_{12}(\tau)| \leq 1.

We can now write Eq. (48) as

I(τ)=I1+I2+2I1I2Reγ12(τ).\begin{align*} I(\tau)=I_1+I_2+2\sqrt{I_1}\sqrt{I_2}\,\text{Re} \, \gamma_{12}(\tau). \end{align*}

By varying the point r\mathbf{r} over the screen we can vary τ\tau and by measuring the intensities we can determine the real part of γ12(τ)\gamma_{12}(\tau) and hence the fringe contrast observed on the screen.

As an example, consider what happens when U(r,t)U(\mathbf{r},t) is a monochromatic field

U(r,t)=A(r)eiωt.\begin{align*} U(\mathbf{r},t)=A(\mathbf{r})e^{-i\omega t}. \end{align*}

In that case

Γ12(τ)=A(r1)A(r2)eiωteiω(tτ)=A(r1)A(r2)eiωτ\begin{align*} \begin{split} \Gamma_{12}(\tau) &= \langle A(\mathbf{r}_1)A(\mathbf{r}_2)^*e^{-i\omega t}e^{i\omega (t-\tau)} \rangle \\ &= A(\mathbf{r}_1) A(\mathbf{r}_2)^*e^{-i\omega \tau} \end{split} \end{align*}

and

Γ11(0)=A(r1)2,Γ22(0)=A(r2)2.\Gamma_{11}(0)= |A(\mathbf{r}_1)|^2, \quad \Gamma_{22}(0)=|A(\mathbf{r}_2)|^2.

So we get

γ12(τ)=Γ12(τ)A(r1)A(r2)=eiωτ+iφ,\begin{align*} \gamma_{12} (\tau) = \frac{\Gamma_{12}(\tau)}{|A(\mathbf{r}_1)| |A(\mathbf{r}_2)|} = e^{-i \omega \tau + i \varphi}, \end{align*}

where φ\varphi is the phase difference of A(r2)A(\mathbf{r}_2) and A(r1)A(\mathbf{r}_1). In this case γ12\gamma_{12} has modulus 1, as expected for a monochromatic field. The intensity on the screen becomes

I(τ)=A(r1)2+A(r2)2+2A(r1)A(r2)cos(ωτφ).\begin{align*} I(\tau)=|A(\mathbf{r}_1)|^2+|A(\mathbf{r}_2)|^2+2|A(\mathbf{r}_1)||A(\mathbf{r}_2)|\cos\left(\omega \tau -\varphi\right). \end{align*}

So indeed we see interference fringes with maximum contrast 1 and hence the fields in P1P_1 and P2P_2 are fully coherent as one would expect for a monochromatic wave. If φ=0\varphi=0, then interference maxima occur for

ωτ=0,±2π,±4π,±6π,\begin{align*} \omega\tau=0,\pm 2\pi, \pm 4\pi, \pm 6\pi,\dots \end{align*}

Because ω=c2πλ\omega=c\frac{2\pi}{\lambda}, and ΔR=cτ\Delta R=c\tau, we find that maxima occur when

ΔR=0,±λ,±2λ,±3λ,\begin{align*} \Delta R =0,\pm\lambda,\pm 2\lambda, \pm 3\lambda,\dots \end{align*}

For large distance between the screen and the mask (in the Fraunhofer limit), these path length differences correspond to directions of the maxima given by the angles θm\theta_m (see Figure 6):

θm=ΔRd=mλd,\begin{align*} \theta_m = \frac{\Delta R}{d} = m \frac{\lambda}{d}, \end{align*}

where dd is the distance between the slits and mm is an integer[5].

Remarks.

  1. The mutual coherence Γ12(τ)=U(r1,t)U(r2,tτ)\Gamma_{12}(\tau)= \langle U(\mathbf{r}_1,t)U(\mathbf{r}_2,t-\tau)^* \rangle is the cross-correlation of the two signals U(r1,t)U(\mathbf{r}_1,t) and U(r2,t)U(\mathbf{r}_2,t).

  2. As remarked above, by moving the point of observation r\mathbf{r} over the screen, one can obtain the real part of the complex degree of mutual coherence. To derive also the imaginary part, one can put a piece of glass behind one of the pinholes with thickness such that for the center frequency ωˉ\bar{\omega} an additional phase difference of π/2\pi/2 is obtained of the fields in r1\mathbf{r}_1 and r2\mathbf{r}_2. If the frequency band Δω\Delta \omega is sufficiently narrow this phase difference applies in good approximations to all frequencies in the band.

6More on Spatial Coherence

We first consider the case that the source is so small (e.g. a single emitting atom) that it can be considered to be a point source SS. In that case it is the fields in two points P1P_1, P2P_2 somewhere in space are coherent if and only if the difference in time that it takes for light to propagate from SS to P1P_1 and from SS to P2P_2 is less than the coherence time Δτc\Delta\tau_c. Equivalently, for coherence the difference between the distances SP1SP_1 and SP2SP_2 must be less than the coherence length Δlc\Delta l_c.

An extended classical light source consists of a large set of emitting point sources that emit by spontaneous emission. As we have explained in Section 3.1, the wave trains emitted by different atoms (point sources) in the source suffer random phase jumps due to e.g. collisions and therefore the fields emitted by different point sources in an extended classical light source can not interfere. Such a light source is called spatially incoherent. For a spatially incoherent light source, the spatial coherence in any two points P1P_1 and P2P_2 is determined by measuring the fringe contrast on a distance screen when a mask is used that is perpendicular to the mean direction of propagation of the light and which contains pinholes at P1P_1 and P2P_2. The fringe contrast and hence the mutual coherence at P1P_1 and P2P_2 is determined by two effects:

  1. First of all it is determined by how coherent the contributions to the total field in P1P_1 and P2P_2 is of the individual point sources SS in the extended source. This coherence is determined by the extent to which the difference between the distance of SS to P1P_1 and of SS to P2P_2 is smaller than the coherence length. If these differences in distances are for all point sources larger than the coherence length, the fringe contrast on the screen n Young’s experiment will be very low and hence the mutual coherence is very low.

  2. The second effect is the size of the extended source. Even if for all point sources in the source the fields in P1P_1 and P2P_2 are coherent, the coherence of the total fields at P1P_1 and P2P_2 due to the entire source can be small. As we know, the contributions of different point sources can not interfere. Hence the intensity observed in Young’s experiment is the sum of the intensities due to the individual point sources in the extended source. The reason that the coherence of the total fields in P1P_1 and P2P_2 due to the entire extended source can be low even though for all point sources individually the mutual coherence in P1P_1 and P2P_2 is high, is that the fringe patterns due to the point sources are shifted with respect to each other which reduces the fringe contrast and hence the mutual coherence. The shift of the fringe patterns is due to the different positions in the extended source of the point sources which cause that the phase difference between the fields in P1P_1 and P2P_2 varies with the point sources.

We will show that when P1P_1 and P2P_2 have a large distance to the extended source, the two conditions mentioned above for the fields in P1P_1 and P2P_2 to be spatially mutually coherent are equivalently to the requirement that:

To show this we consider two mutually incoherent point sources S1S_1 and S2S_2 in the z=0z=0 plane. Their mutual coherence function satisfies:

ΓS1S2(τ)=0, for all τ,\begin{align*} \Gamma_{S_1S_2}(\tau) &= 0, \text{ for all $\tau$}, \end{align*}
ΓS1S1(τ)=ΓS2S2(τ)=Γ0(τ),\begin{align*} \\ \Gamma_{S_1S_1}(\tau)&=\Gamma_{S_2S_2}(\tau)= \Gamma_0(\tau),\end{align*}

where Γ0\Gamma_0 is the self-coherence which we assume to be the same for both point sources. Γ0(τ)\Gamma_0(\tau) has width given by the coherence time Δτc\Delta \tau_c of the source and on average decreases with τ\tau (although not always monotonically). Eq. (62) expresses the fact that two point sources are mutually incoherent. Using that the long-time average does not depend on the origin of time which was based on the assumption that the source is stationary, we find:

Γ0(τ)=<U(S1,t)U(S1,t+τ)>=<U(S1,tτ)U(S1,t)>=Γ0(τ).\begin{align*} \Gamma_0(-\tau)=<U(S_1,t) U(S_1,t+\tau)^*> = < U(S_1,t-\tau)U(S_1,t)^*> = \Gamma_0(\tau)^*. \end{align*}

Furthermore, for τ=0\tau=0: Γ0(0)=I0\Gamma_0(0)=I_0, which is the intensity of either source.

We assume for convenience that the two points P1P_1, P2P_2 to be at a large distance zz from the two point sources and that the line P1P2P_1P_2 is parallel to the extended source as shown in Figure 7. We will compute the mutual coherence ΓP1P2(0)\Gamma_{P_1P_2}(0) for zero time delay τ=0\tau=0 (we can also compute the mutual coherence for more general time delays τ>0\tau>0, i.e. ΓP1P2(τ)\Gamma_{P_1P_2}(\tau), but it will suffice for our purpose to take τ=0\tau=0). The fields in P1P_1 and P2P_2 are the sum of the fields emitted by S1S_1 and S2S_2. Since S1S_1 and S2S_2 are point sources they emit spherical waves. Therefore, similarly to Eq. (45) we find that the field in P1P_1 is proportional to

U(P1,t)U(S1,tS1P1/c)S1P1+U(S2,tS2P1/c)S2P1,\begin{align*} U(P_1, t) \propto \frac{U(S_1,t-|S_1P_1|/c)}{|S_1P_1|} + \frac{U(S_2,t-|S_2P_1|/c)}{|S_2P_1|}, \end{align*}

and

U(P2,t)U(S1,tS1P2/c)S1P2+U(S2,tS2P2/c)S2P2,\begin{align*} U(P_2, t) \propto \frac{U(S_1,t-|S_1P_2|/c)}{|S_1P_2|} + \frac{U(S_2,t-|S_2P_2|/c)}{|S_2P_2|}, \end{align*}

where we omitted the constant factors in front of Eq. (45).

Two incoherent point sources S_1, S_2 at a distance a from each other and two points P_1, P_2 in a plane at large distance z from the point sources.

Figure 7:Two incoherent point sources S1S_1, S2S_2 at a distance aa from each other and two points P1P_1, P2P_2 in a plane at large distance zz from the point sources.

For zz sufficiently large all distances SiPj|S_iP_j| in the denominators may be replaced by zz and then these equal distances can be omitted. By substituting Eq. (65) and Eq. (66) into Eq. (49) with τ=0\tau=0, we find for the mutual coherence of P1P_1 and P2P_2:

ΓP1P2(0)=U(P1,t)U(P2,t)=ΓS1S1(S1P2S1P1c)+ΓS1S2(S2P2S1P1c)+ΓS2S1(S1P2S2P1c)+ΓS2S2(S2P2S2P1c).\begin{align*} \Gamma_{P_1P_2}(0) &= \langle \, U(P_1,t)U(P_2,t)^*\, \rangle \\ &= \Gamma_{S_1S_1}\left( \frac{ |S_1P_2|-|S_1P_1|}{c}\right) + \Gamma_{S_1S_2}\left( \frac{|S_2P_2|- |S_1P_1|}{c}\right) \\ & & + \Gamma_{S_2S_1}\left( \frac{|S_1P_2|-|S_2P_1|}{c}\right) + \Gamma_{S_2S_2}\left( \frac{|S_2P_2|- |S_2P_1|}{c}\right). \end{align*}

Now we use Eq. (62) and Eq. (63) to get

ΓP1P2(0)=Γ0(S1P2S1P1c)+Γ0(S2P2S2P1c).\begin{align*} \Gamma_{P_1P_2}(0) &= \Gamma_0\left( \frac{ |S_1P_2|-|S_1P_1|}{c}\right) + \Gamma_0\left( \frac{|S_2P_2|- |S_2P_1|}{c}\right). \end{align*}

Similarly,

ΓP1P1(0)=ΓP2P2(0)=2Γ0(0)=2I0.\begin{align*} \Gamma_{P_1P_1}(0) = \Gamma_{P_2P_2}(0)=2\Gamma_0(0)= 2I_0. \end{align*}

Since the width of the self coherence function Γ0\Gamma_0 is the coherence time Δτc\Delta \tau_c, result Eq. (68) confirms that for the fields in P1P_1 and P2P_2 to be coherent, the difference in distance of each of the source points to points P1P_1 and P2P_2 should be smaller than the coherence length Δlc=cΔτc\Delta l_c = c \Delta \tau_c . To express the result in the angle α\alpha subtended by the source at the midpoint of P1P2P_1P_2 we choose coordinates such that Pj=(xj,0,z)P_j=(x_j,0,z) for j=1,2j=1,2. If the distance to the source is so large that S1P1S_1P_1 and S1P2S_1P_2 are almost parallel, we see from Figure 7 that

S1P2S1P1QP2=α2x1x2.\begin{align*} |S_1P_2|-|S_1P_1|\approx |QP_2| = \frac{\alpha}{2}|x_1-x_2|. \end{align*}

Similarly,

S2P1S2P2α2x1x2.\begin{align*} |S_2P_1|-|S_2P_2|\approx \frac{\alpha}{2}|x_1-x_2|. \end{align*}
For z very large, S_1P_1 and S_1P_2 are almost parallel and |S_1P_2|-|S_1P_1|\approx |QP_2|= |x_1-x_2| \alpha/2.

Figure 8:For zz very large, S1P1S_1P_1 and S1P2S_1P_2 are almost parallel and S1P2S1P1QP2=x1x2α/2|S_1P_2|-|S_1P_1|\approx |QP_2|= |x_1-x_2| \alpha/2.

Hence, with Γ0(τ)=Γ0(τ)\Gamma_0(-\tau)=\Gamma_0(\tau)^*, Eq. (68) becomes

ΓP1P2(0)=2ReΓ0(α2(x1x2)c).\begin{align*} \Gamma_{P_1P_2}(0) = 2\text{Re}\, \Gamma_0\left( \frac{\alpha}{2} \frac{(x_1-x_2)}{c}\right). \end{align*}

We conclude that for the fields in P1P_1 and P2P_2 to be coherent, the product of the angle α\alpha which the source subtends at the midpoint of P1P2P_1P2 and the distance of P1P2P_1P_2 should be smaller than the coherence length Δlc=cΔτc\Delta l_c = c \Delta \tau_c. The smaller this product is the higher the degree of spatial coherence of P1P_1 and P2P_2.

The angle α\alpha decreases when the distance to the sources is increased and/or when the size of the source is decreased. Loosely speaking one can say that as the light propagates, it becomes more coherent. In both cases when the distance to the source increases and when the size of the source is decreased, the difference in distance of all point source to P1P_1 and P2P_2 decreases and will ultimately become smaller than the coherence length. Furthermore, for smaller α\alpha the fringe patterns on the distant screen in Young’s experiment due to different point sources more strongly overlap which leads to a stronger overall fringe contrast.

As example consider quasi-monochromatic light for which (see Eq. (35)):

Γ0(τ)=I0eiωˉτ, for all τ.\begin{align*} \Gamma_0(\tau) = I_0 e^{-i\bar{\omega}\tau}, \text{ for all $\tau$}. \end{align*}

where ωˉ\bar{\omega} is the center frequency. In this case the coherence length Δlc\Delta l_c of the source is so large that the contributions to the total field of all individual point sources are coherent. Hence the only remaining criterion for coherence of the total fields in P1P_1 and P2P_2 is that the fringe patterns due to the different point sources in Young’s experiment sufficiently overlap. Indeed, in this case of very long coherence time Δτc\Delta \tau_c we have

ΓP1P2(0)=2I0cos[α2ωˉx1x2c],\begin{align*} \Gamma_{P_1P_2}(0) = 2 I_0 \cos\left[\frac{\alpha}{2}\frac{\bar{\omega}|x_1-x_2|}{c}\right], \end{align*}

and hence the degree of mutual coherence is:

γP1P2(0)=ΓP1P2(0)ΓP1P1(0)ΓP2P2(0)=cos[α2ωˉx1x2c].\begin{align*} \gamma_{P_1P_2}(0) &= \frac{\Gamma_{P_1P_2}(0)}{\sqrt{\Gamma_{P_1P_1}(0)} \sqrt{\Gamma_{P_2P_2}(0)}} \\ &= \cos\left[\frac{\alpha}{2}\frac{\bar{\omega}|x_1-x_2|}{c}\right]. \end{align*}

We see that when

x1x2<λˉ/(2α),\begin{align*} |x_1-x_2| < \bar{\lambda}/(2 \alpha), \end{align*}

the fields in P1P_1 and P2P_2 are at least partially mutually coherent.

6.1Example: Solar Coherence Length

We determine the maximum distance dd between two points on earth for which sun light is coherent. The sun subtends on earth the angle:

α=AU2R0.015,\alpha = \frac{\text{AU}}{2R_\circ}\approx 0.015,

where AU\text{AU} and RR_\circ are the distance of the sun to the earth and the radius of the sun. Hence, for green light λ=550 nm\lambda=550~\text{nm} and by requiring

d<λˉ4αd < \frac{\bar{\lambda}}{4\alpha} \nonumber

for appreciable mutual coherence, we find dmax20d_{\max}\approx 20 micron.

7Stellar Interferometry

The property that the spatial coherence of two points decreases for increasing angle which the source subtends halfway between the two points, is used in stellar interferometry. It works as follows: we want to know the size of a certain star. The size of the star, being an extended spatially incoherent source, determines the spatial coherence of the light we receive on earth. Thus, by measuring the interference of the light collected by two transversely separated telescopes, one can effectively create a double-slit experiment, with which the degree of spatial coherence of the star light on earth can be measured, and thereby the angle which the star subtends on earth. The resolution in retrieving the angle from the spatial coherence is larger when the distance between the telescopes is larger (see Eq. (75)). Then, if we know the distance of the star by independent means, e.g. from its spectral brightness, we can deduce its size from its angular size.

The method can also be used to derive the intensity distribution at the surface of the star. It can be shown that the degree of spatial coherence as function of the relative position of the telescopes is the Fourier transform of this intensity distribution. Hence, by moving the telescopes around and measuring the spatial coherence for many positions, the intensity distribution at the surface of the star can be derived from a back Fourier transform.

Left: a stellar interferometer with two telescopes that can be moved around to measure the interference at many relative positions. Right: single telescope with two outer movable mirrors. The telescope can move around it axis. The larger the distance d the higher the resolution.

Figure 9:Left: a stellar interferometer with two telescopes that can be moved around to measure the interference at many relative positions. Right: single telescope with two outer movable mirrors. The telescope can move around it axis. The larger the distance dd the higher the resolution.

8Fringe contrast

We have seen that when the interference term ReU1U2\text{Re} \langle U_1 U_2^* \rangle vanishes, no fringes form, while when this term is nonzero, there are fringes. The fringe contrast is expressed directly in measurable intensities. Given some interference intensity pattern I(x)I(x) as in Figure 10, the fringe contrast is defined as

V=ImaxIminImax+Imin.fringecontrast.\begin{align*} \mathcal{V}=\frac{I_{\text{max}}-I_{\text{min}}}{I_{\text{max}}+I_{\text{min}}}. \hspace{1.5cm} \mathbf{fringe contrast}. \end{align*}

For example, if we have two perfectly coherent, monochromatic point sources emitting the fields U1U_1, U2U_2 with intensities I1=U12I_1=|U_1|^2, I2=U22I_2=|U_2|^2, then the interference pattern is with Eq. (58):

I(τ)=I1+I2+2I1I2cos(ωτ+φ).\begin{align*} I(\tau)=I_1+I_2+2\sqrt{I_1 I_2}\cos(\omega \tau +\varphi). \end{align*}

We then get

Imax=I1+I2+2I1I2,Imin=I1+I22I1I2,\begin{align*} I_{\text{max}}=I_1+I_2+2\sqrt{I_1 I_2}, \quad I_{\text{min}}=I_1+I_2-2\sqrt{I_1 I_2}, \end{align*}

so

V=2I1I2I1+I2.\begin{align*} \mathcal{V}=\frac{2\sqrt{I_1 I_2}}{I_1+I_2}. \end{align*}

In case I1=I2I_1=I_2, we find V=1\mathcal{V}=1.

In contrast, when U1U_1 and U2U_2 are completely incoherent, we find

I(τ)=I1+I2,\begin{align*} I(\tau)=I_1+I_2, \end{align*}

from which follows

Imax=Imin=I1+I2,\begin{align*} I_{\text{max}}=I_{\text{min}}=I_1+I_2, \end{align*}

which gives V=0\mathcal{V}=0.

Illustration of I_{\text{max}} and I_{\text{min}} of an interference pattern I(x) that determines the fringe contrast\mathcal{V}.

Figure 10:Illustration of ImaxI_{\text{max}} and IminI_{\text{min}} of an interference pattern I(x)I(x) that determines the fringe contrastV\mathcal{V}.

9Fabry-Perot resonator

In interferometry two mutually coherent waves are added and the intensity of the sum of the two fields is measured. This intensity contains information about the phase difference of the waves from which for example a path length difference can be deduced. One distinguishes between two types of interferometers: wavefront splitting interferometers and amplitude splitting interferometers. Examples of the first type are Young’s two slit experiment and Lloyd’s mirror (Figure 11). Examples of amplitude splitting interferometers are the Michelson interferometer and the Fabry-Perot interferometer. The latter is not only a spectrometer of extremely high resolution but is also the resonance cavity in a laser.

Lloyd’s mirror as example of wavefront splitting interferometry.

Figure 11:Lloyd’s mirror as example of wavefront splitting interferometry.

A Fabry-Perot interferometer consists of two parallel highly reflecting surfaces with vacuum or a dielectric in between. These surfaces can be optical flats which have been coated by a metal like silver on one side. Consider a coordinate system as in Figure 12 such that the reflecting surfaces are at z=0z=0 and z=dz=d. The refractive indices of the half spaces z<0z<0 and z>dz>d are n1n_1 and n3n_3, respectively, and the refractive index of the medium between the surfaces is n2n_2. We will first assume that all refractive indices are real.

Let there be a plane wave with unit amplitude incident from z<0z<0 under angle θ1\theta_1 with the normal as shown in Figure 12. The incident wave is assumed to be either s- or p-polarized. There are a reflected plane wave in z<0z<0, two plane waves in medium 2 one propagating in the positive zz-direction and the other in the negative zz-direction and there is a transmitted plane wave in z>dz>d. It follows from the boundary conditions that the tangential component of the electric and magnetic field are continuous across the interfaces, that the tangential components of the wave vectors of all these plane waves are identical.

Let rijr_{ij} and tijt_{ij} be the reflection and transmission coefficient for a wave that is incident from medium ii on the interface with medium jj. When the wave is s-polarized, r12r_{12} and t12t_{12} are given by the Fresnel coefficients (see the Rayleigh-Sommerfeld Diffraction Integral section in the Diffraction chapter), whereas if the wave is p-polarized, they are given by the p-polarized Fresnel coefficients.

Fabry-Perot with 3 layers.The light comes from the bottom and is reflected by each interface.

Figure 12:Fabry-Perot with 3 layers.The light comes from the bottom and is reflected by each interface.

The incident wave, which has amplitude 1 in point A, is partially reflected and partially transmitted by the interface z=0z=0. The reflected wave gets amplitude r12r_{12}. The transmitted field propagates in medium 0<z<d0<z<d to the interface at z=dz=d and is then partially reflected with reflection coefficient r23r_{23} back to the interface z=0z=0. Because the path length inside medium 2 is 2d/cosθ2 2d /\cos \theta_2, the complex amplitude B of this wave in point B after transmission by the interface z=0z=0 is

t21r23t21e2ik0n2dcosθ2,t_{21} r_{23} t_{21} e^{ 2 i k_0 n_2 \frac{d}{\cos \theta_2}},

where k0k_0 is the wave number in vacuum. To compute the interference of the directly reflected wave and the wave that has made one round trip in medium 2, the two fields should be evaluated at the same wavefront such as wavefront CB in Figure 12. The directly reflected field in C is obtained by propagating from B over the distance

AC=ABsinθ1=2dtanθ2sinθ1=2dn2n1sin2θ2cosθ1.\begin{align*} \text{AC} &= \text{AB} \sin \theta_1 \\ &= 2 d \tan\theta_2 \sin \theta_1 \\ &= 2 d \frac{n_2}{n_1 } \frac{\sin^2 \theta_2}{\cos\theta_1}. \end{align*}

where Snell’s law: n1sinθ1=n2sinθ2n_1 \sin\theta_1 = n_2 sin \theta_2 has been used. Hence the total field due to the direct reflection at z=0z=0 and one round trip Eq. (85)

r12ei2k0n2sin2θ2cosθ2+t21r23t21e2ik0n2dcosθ2=ei2k0n2dsin2θ2cosθ2(r12+t21r23t12e2ik0n2dcosθ2).\begin{align*} r_{12} e^{i 2k_0 n_2 \frac{\sin^2 \theta_2}{cos \theta_2}} + t_{21} r_{23} t_{21} e^{ 2 i k_0 n_2 \frac{d}{\cos \theta_2}} \\ = e^{ i 2 k_0 n_2 d \frac{\sin^2\theta_2}{\cos\theta_2}} \left( r_{12} + t_{21}r_{23}t_{12} e^{2 i k_0 n_2d \cos \theta_2}\right). \end{align*}

The common phase factor in front of the brackets may be omitted since it does not influence the reflected intensity. We then obtain

r12+t21r23t12e2ikz(2)d,r_{12} + t_{21}r_{23}t_{12} e^{2 i k^{(2)}_z d},

where,

kz(2)=k0n2cosθ2,k_z^{(2)}= k_0 n_2 \cos\theta_2,

is the zz-component of the wave vector in medium 2 of the wave that propagates in the postive zz-direction.

Incorporating the contributions of waves having made two or more round trips in the slab leads to the reflection coefficient of the Fabry-Perot when the field is incident from medium 1:

r=r12+t21r13t12e2ikz(2)d[1+r23r21e2ikz(2)d+(r23r21e2ikz(2)d)2+]=r12+t21r13t12e2ikz(2)d11r23r21e2ikz(2)d=r12r23e2ikz(2)d1r23r21e2ikz(2)d,\begin{align*} r &= r_{12} + t_{21}r_{13}t_{12} e^{2 i k^{(2)}_z d} \left[ 1 + r_{23} r_{21} e^{2 i k^{(2)}_z d} + ( r_{23} r_{21} e^{2 i k_z^{(2)} d})^2 + \ldots \right] \\ &= r_{12} + t_{21}r_{13}t_{12} e^{2 i k^{(2)}_z d}\frac{1}{ 1 - r_{23} r_{21} e^{2 i k_z^{(2)} d}} \\ &= \frac{r_{12}-r_{23} e^{2 i k^{(2)}_z d}}{1- r_{23}r_{21} e^{2 i k^{(2)}_z d}}, \end{align*}

where in the last step we used

t21=1+r21,t12=1+r12,r12=r21\begin{align*} t_{21}&= 1 + r_{21}, \\ t_{12}&= 1+ r_{12}, \\ r_{12}&= -r_{21} \end{align*}

Similarly, the amplitude of the transmitted field in z=dz=d gives the transmission coefficient of the Fabry-Perot when the field is incident from medium 1:

t=t12t23eikz(2)d[1+r21r23e2kz(2)d+(r21r23eikz(2)d)2+]=t12t23eikz(2)d1r21r23e2ikz(2)d.\begin{align*} t &= t_{12} t_{23} e^{i k^{(2)}_z d} \left[ 1 + r_{21}r_{23} e^{2 k_z^{(2)} d} + ( r_{21}r_{23} e^{i k^{(2)}_z d})^2 + \ldots \right] \\ &= \frac{ t_{12} t_{23} e^{i k^{(2)}_z d}}{1- r_{21} r_{23} e^{ 2 i k^{(2)}_z d}}. \end{align*}
Transmission coefficient versus the phase change \delta due to the Fabry-Perot. One can see the resonances occurring at every multiple of \pi.

Figure 13:Transmission coefficient versus the phase change δ\delta due to the Fabry-Perot. One can see the resonances occurring at every multiple of π\pi.

Finally, the electric field between the reflectors is given by

U(z)=t12eikz(2)z[1+r21r23e2ikz(2)d+(r21r23e2ikz(2)d)2++]+t12eikz(2)(dz)[1+r21r23e2ikz(2)d+(r21r23e2ikz(2)d)2++]=t12eikz(2)z+r23eikz(2)(dz)1r21r23e2ikz(2)d,\begin{align*} U(z) &= t_{12} e^{i k^{(2)}_z z} \left[ 1 + r_{21} r_{23} e^{2 i k^{(2)}_z d} + ( r_{21} r_{23} e^{2 i k^{(2)}_z d})^2+\ldots +\right] \\ & & + t_{12} e^{i k^{(2)}_z (d-z)}\left[ 1 + r_{21} r_{23} e^{2 i k^{(2)}_z d} + ( r_{21} r_{23} e^{2 i k^{(2)}_z d})^2+\ldots +\right] \\ &= t_{12} \frac{ e^{i k^{(2)}_z z} + r_{23} e^{i k^{(2)}_z(d-z)}} {1- r_{21} r_{23} e^{ 2 i k^{(2)}_z d}}, \end{align*}

where the factor exp[i(kxx+kyy)]\exp[i(k_x x+ k_y y)] which gives the dependence on (x,y)(x,y) has been omitted.

Define

G=(r12r23)2(1r23r21)2,\begin{align*} G &= \frac{(|r_{12}|-|r_{23}|)^2} {(1-|r_{23}||r_{21}|)^2}, \end{align*}
F=4r23r21(1r23r21)2.\begin{align*} \\ F &= \frac{ 4|r_{23}||r_{21}|}{(1-|r_{23}||r_{21}|)^2}.\end{align*}

FF is called the coefficient of Finesse of the Fabry-Perot. It is large when the mirrors are very good reflectors. The reflected and transmitted powers, relative to the incident power are then

R=r2=G+Fsin2(kz(2)d)1+Fsin2(kz(2)d),R=|r|^2 = \frac{G + F \sin^2(k^{(2)}_z d)}{1+ F \sin^2(k^{(2)}_z d)},

and

T=t2=1R2=1G1+Fsin2(kz(2)d).\begin{align*} T &= |t|^2 = 1- |R|^2 \\ &= \frac{1-G}{1+ F \sin^2(k^{(2)}_z d)}. \end{align*}

We define

δ=kz(2)d,\delta = k^{(2)}_z d,

which is the phase change due to one pass through the middle layer of the Fabry-Perot. Then Eq. (96) and Eq. (97) become

R=G+Fsin2(kz(2)d)1+Fsin2δ.R = \frac{G + F \sin^2(k^{(2)}_z d)}{1+ F \sin^2 \delta}.
T=1G1+Fsin2δ.T = \frac{1-G}{1+ F \sin^2 \delta}.

If the reflection by the mirrors is high: r211|r_{21}|\approx 1, r231|r_{23}|\approx 1, then FF is large. This implies

R1,T0,R \approx 1, \quad T\approx 0,

for all δ\delta except when sin(δ)=0\sin(\delta)=0, i.e. when

δ=mπ,\delta = m\pi,

for some positive integer mm. With k0=2π/λ0k_0=2\pi/\lambda_0 this becomes in terms of wavelength:

2dλ0n2cosθ2=m.\frac{2 d}{\lambda_0}n_2 \cos \theta_2 = m.

The wavelengths correspond to the maximum values of the transmission:

Tmax=1G.T_{max} = 1-G.

and they are therefore called resonances. The width Δδ\Delta \delta at a resonance is defined as the full width at half maximum (FWHM) of the transmission, i.e.

1G1+sin2(mπ+Δδ/2)=12(1G),\frac{1-G}{1+ \sin^2(m\pi + \Delta \delta/2)} = \frac{1}{2}(1-G),

which implies with sin2(mπ+Δδ/2)(Δδ/2)2\sin^2(m\pi + \Delta \delta/2) \approx (\Delta \delta/2)^2:

Δδ=2F.\Delta \delta = \frac{2}{\sqrt{F}}.

Using again k0=2π/λ0k_0=2\pi/\lambda_0 and the fact that the width in terms of wavelength is small:

Δλ0λ0λ0Δ(1λ0)==λ0Δδ2πn2dcosθ2=Δδmπ=2mπF\begin{align*} \frac{|\Delta \lambda_0|}{\lambda_0} &\approx & \lambda_0 \Delta\left(\frac{1}{\lambda_0}\right) \\ &= = \lambda_0 \frac{\Delta \delta}{2\pi n_2 d \cos\theta_2} \\ &= \frac{\Delta \delta}{m \pi} \\ &= \frac{2}{m \pi\sqrt{F}} \end{align*}

where Eq. (103) has been used. The resolution is defined as

Resolution=λ0Δλ0=mπr23r211r23r21.\text{Resolution} = \frac{\lambda_0}{|\Delta \lambda_0|} = \frac{m\pi \sqrt{r_{23}||r_{21}|}}{1-|r_{23}||r_{21}|}.

The free spectral range is the distance between adjacent resonances:

Δδfree=π\Delta \delta_{free} = \pi

With a similar derivation as for Eq. (107)

(Δλ0)freeλ0λ0Δ(1λ0)free=Δδfreemπ=1m.\begin{align*} \frac{|(\Delta \lambda_0)_{free}|}{\lambda_0}&\approx& -\lambda_0 \Delta\left(\frac{1}{\lambda_0}\right)_{free} \\ &= \frac{\Delta \delta_{free}}{m\pi} \\ &= \frac{1}{m}. \end{align*}

A Fabry-Perot can be used as a high resolution spectrometer. Eq. (108) implies that the resolution increases for higher order mm. However, MM can not be made arbitrary large because increasing mm means according to Eq. (110) that the free spectral range decreases. The ratio

(Δλ0)freeΔλ0=π2F,\frac{(\Delta \lambda_0)_{free}}{\Delta \lambda_0} = \frac{\pi}{2} \sqrt{F},

should therefore be large.

9.1Example: Fabry-Perot Resolution

For a wavelength of λ0=600 nm\lambda_0=600~\text{nm} and nfd=12 mmn_f d= 12~\text{mm} we have for normal incidence m=40000m=40000. Then, if the reflection coefficients satisfy r122=r232=0.9|r_{12}|^2=|r_{23}|^2=0.9, we have F=360F=360 and G=0G=0. The resolution is more than one million which is better than the grating spectrometers, which will be discussed in the Fresnel and Fraunhofer examples section of the Diffraction chapter.

Remark. Although in the derivation we have assumed that all refractive indices are real, the final formulae also apply to the case that n2n_2 is complex. In that case kz(2)k^{(2)}_z and the reflection coefficients are complex.

10Interference and polarization

In the study of interference we have so far ignored the vectorial nature of light by assuming that all the fields have the same polarization. Suppose now that we have two real vector fields E1\mathbf{\mathcal{E}}_1, E2\mathbf{\mathcal{E}}_2. The (instantaneous) intensity of each field is (apart from a constant factor) given by

E1E1,E2E2.\begin{align*} \mathbf{\mathcal{E}}_1\cdot \mathbf{\mathcal{E}}_1, \quad \mathbf{\mathcal{E}}_2\cdot \mathbf{\mathcal{E}}_2. \end{align*}

If the two fields interfere, the instantaneous intensity is given by

(E1+E2)(E1+E2)=E1E1+E2E2+2E1E2,\begin{align*} (\mathbf{\mathcal{E}}_1+\mathbf{\mathcal{E}}_2)\cdot(\mathbf{\mathcal{E}}_1+\mathbf{\mathcal{E}}_2) = \mathbf{\mathcal{E}}_1\cdot \mathbf{\mathcal{E}}_1+\mathbf{\mathcal{E}}_2\cdot \mathbf{\mathcal{E}}_2+2\mathbf{\mathcal{E}}_1\cdot \mathbf{\mathcal{E}}_2, \end{align*}

where 2E1E22\mathbf{\mathcal{E}}_1\cdot \mathbf{\mathcal{E}}_2 is the interference term. Suppose the polarization of E1\mathbf{\mathcal{E}}_1 is orthogonal to the polarization of E2\mathbf{\mathcal{E}}_2, e.g.

E1=(E1x),E2=(0E2y).\begin{align*} \mathbf{\mathcal{E}}_1=\begin{pmatrix}\mathcal{E}_{1x}\\ \end{pmatrix}, \quad \mathbf{\mathcal{E}}_2=\begin{pmatrix}0\\ \mathcal{E}_{2y} \\ \end{pmatrix}. \end{align*}

Then E1E2=0\mathbf{\mathcal{E}}_1\cdot \mathbf{\mathcal{E}}_2=0, which means the two fields can not interfere. This observation is the

Next we write the fields in terms of orthogonal components

E1=(E1E1),E2=(E2E2).\begin{align*} \mathbf{\mathcal{E}}_1=\begin{pmatrix}\mathcal{E}_{1\bot} \\ \mathcal{E}_{1\parallel}\end{pmatrix}, \quad \mathbf{\mathcal{E}}_2=\begin{pmatrix}\mathcal{E}_{2\bot} \\ \mathcal{E}_{2\parallel} \end{pmatrix}. \end{align*}

This is always possible, whether the fields are polarized or randomly polarized. Then Eq. (113) becomes

E1E1+E2E2+2E1E2=E12+E22+2E1E2+E12+E22+2E1E2.\begin{align*} \mathbf{\mathcal{E}}_1\cdot \mathbf{\mathcal{E}}_1+\mathbf{\mathcal{E}}_2\cdot \mathbf{\mathcal{E}}_2+2\mathbf{\mathcal{E}}_1\cdot \mathbf{\mathcal{E}}_2 =\mathcal{E}_{1\bot}^2 + \mathcal{E}_{2\bot}^2 + 2\mathcal{E}_{1\bot} \mathcal{E}_{2\bot} + \mathcal{E}_{1\parallel}^2 + \mathcal{E}_{2\parallel}^2 + 2\mathcal{E}_{1\parallel} \mathcal{E}_{2\parallel}. \end{align*}

If the fields are randomly polarized, the time average of the \bot-part will equal the average of the \parallel-part, so the time-averaged intensity becomes

I=2E12+E22+2E1E2=2E12+E22+2E1E2\begin{align*} \begin{split} I &= 2\langle \mathcal{E}_{1\bot}^2 + \mathcal{E}_{2\bot}^2 + 2\mathcal{E}_{1\bot} \mathcal{E}_{2\bot} \rangle \\ &= 2\langle \mathcal{E}_{1\parallel}^2 + \mathcal{E}_{2\parallel}^2 + 2\mathcal{E}_{1\parallel} \mathcal{E}_{2\parallel} \rangle \end{split} \end{align*}

This is qualitatively the same as what we would get if the fields had parallel polarization, e.g.

E1=(E10),E2=(E20).\begin{align*} \mathbf{\mathcal{E}}_1=\begin{pmatrix}\mathcal{E}_{1\bot} \\ 0\end{pmatrix}, \quad \mathbf{\mathcal{E}}_2=\begin{pmatrix}\mathcal{E}_{2\bot} \\ 0\end{pmatrix}. \end{align*}

This leads to the

This indicates that our initial assumption in the previous sections that all our fields have parallel polarization is not as limiting as it may have appeared at first.

Suppose now that we have some field

E=(EE),\begin{align*} \mathbf{\mathcal{E}}=\begin{pmatrix}\mathcal{E}_{\bot} \\ \mathcal{E}_{\parallel}\end{pmatrix}, \end{align*}

which is randomly polarized. Suppose we separate the two polarizations, and rotate one so that the two resulting fields are aligned, e.g.

E1=(E0),E2=(E0).\begin{align*} \mathbf{\mathcal{E}}_1=\begin{pmatrix}\mathcal{E}_{\bot} \\ 0\end{pmatrix}, \quad \mathbf{\mathcal{E}}_2=\begin{pmatrix}\mathcal{E}_{\parallel} \\ 0\end{pmatrix}. \end{align*}

These fields can not interfere because E\mathcal{E}_{\bot} and E\mathcal{E}_{\parallel} are incoherent. This leads to the

11Chapter Summary

See Veritasium - The original double-slit experiment, starting at 2:15 - Demonstration of an interference pattern obtained with sunlight.

12References

Footnotes
  1. MIT OCW - Fringe Contrast - Path Difference: Demonstration of how fringe contrast varies with propagation distance

  2. MIT OCW - Coherence Length and Source Spectrum: Demonstration of how the coherence length depends on the spectrum of the laser light.

References
  1. Born, M., & Wolf, E. (1999). Principles of Optics (7th ed.). Cambridge University Press.
  2. Michelson, A. A. (1920). On the Application of Interference Methods to Astronomical Measurements. The Astrophysical Journal, 51, 257–262.
  3. Fresnel, A., & Arago, F. (1819). Mémoire sur l’action que les rayons de lumière polarisée exercent les uns sur les autres. Annales de Chimie et de Physique, 2, 288–314.
  4. Young, T. (1804). The Bakerian Lecture: Experiments and Calculations Relative to Physical Optics. Philosophical Transactions of the Royal Society of London, 94, 1–16.
  5. Mandel, L., & Wolf, E. (1995). Optical Coherence and Quantum Optics. Cambridge University Press.