1Introduction¶
Although the model of geometrical optics helps us to design optical systems and explains many phenomena, there are properties of light that require a more elaborate model. For example, interference fringes observed in Young’s double-slit experiment or the Arago spot (Figure 3) indicate that light is more accurately modeled as a wave.

Arago spot observed with a 4 mm diameter circular disc. The bright central spot appears at the center of the disc’s shadow, demonstrating the wave nature of light through constructive interference of diffracted waves. Image captured at 1 m distance using 633 nm laser light.

Arago spot observed with a 2 mm diameter circular disc. The relative size of the central bright spot increases compared to the disc diameter, as diffraction effects become more pronounced with smaller obstacles. Image captured at 1 m distance using 633 nm laser light.

Figure 3:The Arago spot is the bright spot which occurs at the center of the shadow of a circular disc and which is caused by diffraction. The disc has diameter 4 mm, 2 mm and 1 mm, from left to right, the wavelength is 633 nm and the intensity is recorded at 1 m behind the disc and has width of 16 mm
In this chapter we will study the wave model of light. It will be shown that the extent to which light can show interference depends on a property called coherence. In the largest part of the discussion we will assume that all light has the same polarization, so that we can treat the fields as scalar. In the last section we will look at how polarization affects interference, as described by the Fresnel-Arago laws.
It is important to note that the concepts of interference and coherence are not just restricted to optics. Since quantum mechanics dictates that particles have a wave-like nature, interference and coherence also play a role in e.g. solid state physics and quantum information.
2Interference of Monochromatic Fields of the Same Frequency¶
Let us first recall the basic concepts of interference. What causes interference is the fact that light is a wave, which means that it not only has an amplitude but also a phase. Suppose for example we evaluate a time-harmonic field in two points
Here denotes the phase difference between the fields at the two points. If , or is a multiple of , the fields are in phase, and when they are added they interfere constructively
However, when , or more generally , for some integer , then the waves are out of phase, and when they are superimposed, they interfere destructively.
We can sum the two fields for arbitrary more conveniently using complex notation:
Adding gives
For and we retrieve the results obtained before. It is important to realize that what we see or detect physically (the ‘brightness’ of light) does not correspond to the quantities , . After all, and can attain negative values, while there is no such thing as ‘negative brightness’. What and describe are the fields, which may be positive or negative. The ‘brightness’ or the irradiance or intensity is given by taking an average over a long time of (as discussed in Chapter 1), we shall omit the factor . As explained in Chapter 1, we see and measure only the long time-average of , because at optical frequencies fluctuates very rapidly. We recall the definition of the time average over an interval of length at a specific time from Chapter 1:
where is a time interval that is the response time of a typical detector, i.e. which is extremely long compared to the period of visible light which is of the order of . For a time-harmonic function, the long-time average is equal to the average over one period of the field and hence it is independent of the time at which it is taken. Indeed for Eq. (5) we get
Using complex notation one can obtain this result more easily. Let
where
Then we find
hence
To see why this works, recall the time averaging formula and choose .
Remark. To shorten the formulae, we will omit in this chapter the factor in front of the time-averaged intensity.
Hence we define and , and we then find for the time-averaged intensity of the sum of and :
where and are the arguments of and and is the phase difference. The term is known as the interference term. In the famous double-slit experiment (which we will discuss in a later section), we can interpret the terms as follows: let us say is the field that comes from slit 1, and comes from slit 2. If only slit 1 is open, we measure on the screen intensity , and if only slit 2 is open, we measure . If both slits are open, we would not measure , but we would observe fringes due to the interference term .
The intensity Eq. (12) varies when the phase difference varies. These variations are called fringes. The fringe contrast is defined by
It is maximum and equal to 1 when the intensities of the interfering fields are the same. If these intensities are different the fringe contrast is less than 1.
More generally, the intensity of a sum of multiple time-harmonic fields all having the same frequency is given by the coherent sum
However, we will see in the next section that sometimes the fields are unable to interfere. In that case all the interference terms of the coherent sum vanish, and the intensity is given by the incoherent sum
3Coherence¶
In the discussion so far we have only considered monochromatic light, which means that the spectrum of the light consists of only one frequency. Although light from a laser often has a very narrow band of frequencies and therefore can be considered to be monochromatic, purely monochromatic light does not exist. One reason that light can not be perfectly monochromatic is that any source must have been switched on a finite time ago. Hence, all light consists of multiple frequencies and therefore is polychromatic. Classical light sources such as incandescent lamps and also LEDs have relatively broad frequency bands. The question then arises how differently polychromatic light behaves compared to the idealized case of monochromatic light. To answer this question, we must study the topic of coherence. One distinguishes between two extremes: fully coherent and fully incoherent light, while the degree of coherence of practical light is somewhere in between. Generally speaking, the broader the frequency band of the source, the more incoherent the light is. It is a very important observation that no light is actually completely coherent or completely incoherent. All light is partially coherent, but some light is more coherent than others.
An intuitive way to think about these concepts is in terms of the ability to form interference fringes. For example, with laser light, which usually is almost monochromatic and hence coherent, one can form an interference pattern with clear maxima and minima in intensities using a double slit, while with sunlight (which is incoherent) this is much more difficult. Every frequency in the spectrum of sunlight gives its own interference pattern with its own frequency dependent fringe pattern. These fringe patterns wash out due to superposition and the total intensity therefore shows little fringe contrast, i.e. the coherence is less. However, it is not impossible to create interference fringes with natural lightYoung, 1804. The trick is to let the two slits be so close together (of the order of ) that the difference in distances from the slits to the sun is so small for the fields in the slits to be sufficiently coherent to interfere. To understand the effect of polychromatic light, it is essential to understand that the degree to which the fields in two points are coherent, i.e. the ability to form fringes, is determined by the difference in distances between these points and the source. The distance itself to the source is not relevant. This will be made clear in this chapter.
3.1Coherence of Light Sources¶
In a conventional light source such as a gas discharge lamp, photons are generated by spontaneous emission with energy equal to the energy difference between certain electronic states of the atoms of the gas. These transitions have a duration of the order of 10-8 to . Because the emitted wave trains are finite, the emitted light does not have a single frequency; instead, there is a band of frequencies around a center frequency with width roughly equal to the reciprocal of the duration of the wave train. This spread of frequencies is called the natural linewidth. Random thermal motions of the molecules cause further broadening due to the Doppler effect. In addition, the atoms undergo collisions that interrupt the wave trains and therefore further broaden the frequency spectrum.
We first consider a single emitting atom. When collisions are the dominant broadening effect and these collisions are sufficiently brief, so that any radiation emitted during the collision can be ignored, an accurate model for the emitted wave is a steady monochromatic wave train at frequency at the center of the frequency band, interrupted by random phase jumps each time that a collision occurs. The discontinuities in the phase due to the collisions cause a spread of frequencies around the center frequency. An example is shown in Figure 4. The average time between the collisions is typically less than 10-10 s which implies that on average between two collisions roughly 106 harmonic oscillations occur and that during an atom transition of the order of hundred collisions may occur. The coherence time is defined as the maximum time interval over which the phase of the electric field can be predicted. In the case of collisions-dominated emission by a single atom, the coherence time is equal to the average time between subsequent collisions: s.
To understand coherence and incoherence, it is helpful to use this model for the emission by a single atom as harmonic wave trains of many thousands of periods interrupted by roughly hundred random phase jumps. Due to the random phase jumps, the interference term of the sum of harmonic wave trains emitted by two atoms when integrated of the relatively long integration time of a detector becomes a sum over integrals over time intervals of average length :
where the sum is over roughly one hundred random phase jumps during the total duration of the wave trains. The random phase jumps lead to cancellation of the integrals and hence the interference term vanishes. We conclude that over the integration time of typical detectors

Figure 4:The electric field amplitude of the harmonic wave train radiated by a single atom at the center frequency . The vertical lines are collisions separated by periods of free flight with mean duration . The quantity , which is the number of periods in a typical wave train, is chosen unrealistically small (namely 60, whereas a realistic value would be 105) to show the random phase changes.
The coherence time and the width of the frequency line are related as
The coherence length is defined by
Since , we have
where and are the wavelength and the frequency at the center of the line. Hence,
The coherence length and coherence time of a number of sources are listed in Table 1. For a laser, the linewidth is extremely small and the coherence time very long. This is because the photons in a laser are not generated predominantly by spontaneous emission as classical sources, but instead by stimulated emission. Lasers are discussed in .
Table 1:Coherence time and coherence length of several sources
| Source | Mean wavelength | Linewidth | Coherence Length | Coherence Time |
|---|---|---|---|---|
| Mid-IR (3-5 ) | 4.0 | 8.0 | s. | |
| White light | 550 nm | nm | nm | s. |
| Mercury arc | 546.1 nm | nm | mm | s. |
| discharge lamp | 605.6 nm | nm | 0.3 m | s. |
| Stabilized He-Ne laser | 632.8 nm | nm | 400 m | s. |
3.2Polychromatic Light¶
When dealing with coherence one has to consider fields that consist of a range of different frequencies. Let be the real-valued field component. It is always possible to write as an integral over time-harmonic components:
where is the complex amplitude of the time-harmonic field with frequency . When there is only a certain frequency band that contributes, then for outside this band. We define the complex time-dependent field by
Then
Remark: The complex field contains now the time dependence in contrast to the notation used for a time-harmonic (i.e. single frequency) field introduced in Chapter 2, where the time-dependent was a separate factor.
We now compute the intensity of polychromatic light. The instantaneous energy flux is (as for monochromatic light) proportional to the square of the instantaneous real field: . We average the instantaneous intensity over the integration time of common detectors which, as stated before, is very long compared to the period at the center frequency of the field. Using Eq. (6) and
we get
where the averages of and are zero because they are fast-oscillating and go through many cycles during the integration time of the detector. In contrast, has a DC-component which does not average to zero.
Remark: In contrast to the time-harmonic case, the long time average of polychromatic light depends on the time at which the average is taken. However, we assume in this chapter that the fields are omitted by sources that are stationary. The property of stationarity implies that the average over the time interval of long length does not depend on the time that the average is taken. Many light sources, in particular conventional lasers, are stationary. (However, a laser source which emits short high-power pulses cannot be considered as a stationary source). We furthermore assume that the fields are ergodic, which means that taking the time-average over a long time interval amounts to the same as taking the average over the ensemble of possible fields. It can be shown that this property implies that the limit in Eq. (6) indeed existsMandel & Wolf, 1995.
We use for the intensity again the expression without the factor in front, i.e.
The time-averaged intensity has hereby been expressed in terms of the time-average of the squared modulus of the complex field.
Quasi-monochromatic field. If the width of the frequency band is very narrow compared to the center frequency , we speak of a quasi-monochromatic field. In the propagation of quasi-monochromatic fields, we use the formula for time-harmonic fields at . The quasi-monochromatic assumption simplifies the computations considerably and will therefore be used frequently.
4Temporal Coherence and the Michelson Interferometer¶
To investigate the time coherence of a field in a certain point , we let the field in that point interfere with itself but delayed in time, i.e. we let interfere with . Because, when studying temporal coherence, the point is always the same, we omit it from the formula. Furthermore, for easier understanding of the phenomena, we assume for the time being that the field considered is emitted by a single atom (i.e. a point source).
Temporal coherence is closely related to the spectral content of the light: if the light consists of fewer frequencies (think of monochromatic light), then it is more temporally coherent. To study the interference of with , a Michelson interferometer, shown in Figure 5, is a suitable setup. The light that goes through one arm takes time to reach the detector, while the light that goes through the other (longer) arm takes time which means that it was radiated earlier. Therefore, the detector observes the time-averaged intensity . As remarked before, this averaged intensity does not depend on the time the average is taken, it only depends on the time difference between the two beams.

Figure 5:A Michelson interferometer to study the temporal coherence of a field. A beam is split in two by a beam splitter, and the two beams propagate over different distances which corresponds to a time difference and then interfere at the detector.
We have
The detected intensity varies with the difference in arm length.
So far we have considered a field that originates from a single atom. The total field emitted by an extended source is the sum of fields corresponding to all atoms . As has been explained already, the fields emitted by different atoms can not interfere. But the field emitted by an atom can interfere with the delayed field of that same atom and for every atom the interference is given by the same expression Eq. (28). The total intensity is simply that given by that of a single atom multiplied by the number of atoms. In particular, the ratio of the interference term and the other terms is the same for the entire source as for a single atom.
The self coherence function is defined by
The intensity of is
The complex degree of self-coherence is defined by:
Using Bessel’s inequality it can be shown that this is a complex number with modulus between 0 and 1:
The observed intensity can then be written:
We consider two special cases.
Suppose is a monochromatic wave
In that case we get for the self-coherence
and
Hence the interference pattern is given by
So for monochromatic light we expect to detect a cosine interference pattern, which shifts as we change the arm length of the interferometer (i.e. change ). No matter how large the time delay , a clear interference pattern should be observed.
Next we consider what happens when the light is a superposition of two frequencies:
where , where is integration time of the detector. Then:
where in the second line the time average of terms that oscillate with time is set to zero because the averaging is done over time interval satisfying . Hence, the complex degree of self-coherence is:
and Eq. (33) becomes
The interference term is the product of the function , which is a rapidly oscillating function of , and a slowly varying envelope . It is interesting to note that the envelope, and hence , vanishes for some periodically spaced , which means that for certain the degree of self-coherence vanishes and no interference fringes form[3],[4]. Note that when is larger the intervals between the zeros of decrease. If more frequencies are added, the envelope function is not a cosine function but on average decreases with . The typical value of below which interferences are observed is roughly equal to half the first zero of the envelope function. This value is called the coherence time . We conclude with some further interpretations of the degree of self-coherence .
Remarks.
In stochastic signal analysis is called the autocorrelation of . Informally, one can interpret the autocorrelation function as the ability to predict the field at time given the field at time .
The Wiener-Khinchin theorem says that (under the assumption of ergodicity and for stationary fields) the Fourier transform of the self coherence function is the spectral power density of :
Using the uncertainty principle, we can see that the larger the spread of the frequencies of (i.e. the larger the bandwidth), the more sharply peaked is. Thus, the light gets temporally less coherent when it consists of a broader range of frequencies. Measuring the spectral power density with a spectroscope and applying a back Fourier transform is an alternative method to obtain the complex self-coherence function.
5Spatial Coherence and Young’s Experiment¶
Temporal coherence concerns the coherence of the field in one point. The absolute value of the degree of self coherence Eq. (31) quantifies how strong the interference is of the field in the point of interest with the field in that same point at a later time. In contrast, spatial coherence is concerned with determining how coherent the fields in two different points are. This is done by letting the fields interfere using a mask with two small holes at the positions of the points of interest and observing the fringe contrast at a distant screen (Young’s experiment).
While for temporal coherence we used a Michelson interferometer, the natural choice to characterize spatial coherence is Young’s experiment, because it allows the fields in two points , which are separated in space to interfere with each other.

Figure 6:The spatial coherence of light from an extended source.
Let and be the position vectors of the points and , respectively. We write the complex field in as a superposition of monochromatic fields as in Eq. (22):
The reason for doing this is that for a monochromatic field in the pinhole, i.e. a field with a well defined frequency, we can derive the disturbance in any point behind the mask. In fact, according to the Huygens-Fresnel Principle, a monochromatic disturbance with frequency in the pinhole at generates a radiating spherical wave with the same frequency , such that in a point behind the mask the field is:
where is the surface area of the pinhole. We assume that the frequency band is sufficiently small such that the frequency factor that multiplies can be replaced by the center frequency . Note that this should not be done in the exponent in Eq. (44) because an error in the phase can easily lead to large errors in the total field. The total field in due to the pinhole at is obtained by integrating the monochromatic components over frequency:
In words:
For the field in due to pinhole 2 we have similarly
The total field in is the sum . Because of the difference in propagation distance , there is a time difference between when the two fields have been emitted by the two pinholes when they arrive at a given time IN point on the screen in Figure 6. This time difference is given by
Furthermore, the amplitudes are reduced by a factor proportional to the reciprocal distance which is different for the two fields. But if the distance of the screen to the mask is large enough, we may assume these factors to be the same and then omit them. Using Eq. (47), the interference pattern on the screen is then, apart from a constant factor, given by
where in the third and last line we used that the time average does not depend on the time it is taken because the light source is assumed to be stationary. We define the mutual coherence function by:
With the intensities
the complex degree of mutual coherence is defined by
It can be proved using Bessel’s inequality that
We can now write Eq. (48) as
By varying the point over the screen we can vary and by measuring the intensities we can determine the real part of and hence the fringe contrast observed on the screen.
As an example, consider what happens when is a monochromatic field
In that case
and
So we get
where is the phase difference of and . In this case has modulus 1, as expected for a monochromatic field. The intensity on the screen becomes
So indeed we see interference fringes with maximum contrast 1 and hence the fields in and are fully coherent as one would expect for a monochromatic wave. If , then interference maxima occur for
Because , and , we find that maxima occur when
For large distance between the screen and the mask (in the Fraunhofer limit), these path length differences correspond to directions of the maxima given by the angles (see Figure 6):
where is the distance between the slits and is an integer[5].
Remarks.
The mutual coherence is the cross-correlation of the two signals and .
As remarked above, by moving the point of observation over the screen, one can obtain the real part of the complex degree of mutual coherence. To derive also the imaginary part, one can put a piece of glass behind one of the pinholes with thickness such that for the center frequency an additional phase difference of is obtained of the fields in and . If the frequency band is sufficiently narrow this phase difference applies in good approximations to all frequencies in the band.
6More on Spatial Coherence¶
We first consider the case that the source is so small (e.g. a single emitting atom) that it can be considered to be a point source . In that case it is the fields in two points , somewhere in space are coherent if and only if the difference in time that it takes for light to propagate from to and from to is less than the coherence time . Equivalently, for coherence the difference between the distances and must be less than the coherence length .
An extended classical light source consists of a large set of emitting point sources that emit by spontaneous emission. As we have explained in Section 3.1, the wave trains emitted by different atoms (point sources) in the source suffer random phase jumps due to e.g. collisions and therefore the fields emitted by different point sources in an extended classical light source can not interfere. Such a light source is called spatially incoherent. For a spatially incoherent light source, the spatial coherence in any two points and is determined by measuring the fringe contrast on a distance screen when a mask is used that is perpendicular to the mean direction of propagation of the light and which contains pinholes at and . The fringe contrast and hence the mutual coherence at and is determined by two effects:
First of all it is determined by how coherent the contributions to the total field in and is of the individual point sources in the extended source. This coherence is determined by the extent to which the difference between the distance of to and of to is smaller than the coherence length. If these differences in distances are for all point sources larger than the coherence length, the fringe contrast on the screen n Young’s experiment will be very low and hence the mutual coherence is very low.
The second effect is the size of the extended source. Even if for all point sources in the source the fields in and are coherent, the coherence of the total fields at and due to the entire source can be small. As we know, the contributions of different point sources can not interfere. Hence the intensity observed in Young’s experiment is the sum of the intensities due to the individual point sources in the extended source. The reason that the coherence of the total fields in and due to the entire extended source can be low even though for all point sources individually the mutual coherence in and is high, is that the fringe patterns due to the point sources are shifted with respect to each other which reduces the fringe contrast and hence the mutual coherence. The shift of the fringe patterns is due to the different positions in the extended source of the point sources which cause that the phase difference between the fields in and varies with the point sources.
We will show that when and have a large distance to the extended source, the two conditions mentioned above for the fields in and to be spatially mutually coherent are equivalently to the requirement that:
To show this we consider two mutually incoherent point sources and in the plane. Their mutual coherence function satisfies:
where is the self-coherence which we assume to be the same for both point sources. has width given by the coherence time of the source and on average decreases with (although not always monotonically). Eq. (62) expresses the fact that two point sources are mutually incoherent. Using that the long-time average does not depend on the origin of time which was based on the assumption that the source is stationary, we find:
Furthermore, for : , which is the intensity of either source.
We assume for convenience that the two points , to be at a large distance from the two point sources and that the line is parallel to the extended source as shown in Figure 7. We will compute the mutual coherence for zero time delay (we can also compute the mutual coherence for more general time delays , i.e. , but it will suffice for our purpose to take ). The fields in and are the sum of the fields emitted by and . Since and are point sources they emit spherical waves. Therefore, similarly to Eq. (45) we find that the field in is proportional to
and
where we omitted the constant factors in front of Eq. (45).

Figure 7:Two incoherent point sources , at a distance from each other and two points , in a plane at large distance from the point sources.
For sufficiently large all distances in the denominators may be replaced by and then these equal distances can be omitted. By substituting Eq. (65) and Eq. (66) into Eq. (49) with , we find for the mutual coherence of and :
Now we use Eq. (62) and Eq. (63) to get
Similarly,
Since the width of the self coherence function is the coherence time , result Eq. (68) confirms that for the fields in and to be coherent, the difference in distance of each of the source points to points and should be smaller than the coherence length . To express the result in the angle subtended by the source at the midpoint of we choose coordinates such that for . If the distance to the source is so large that and are almost parallel, we see from Figure 7 that
Similarly,

Figure 8:For very large, and are almost parallel and .
Hence, with , Eq. (68) becomes
We conclude that for the fields in and to be coherent, the product of the angle which the source subtends at the midpoint of and the distance of should be smaller than the coherence length . The smaller this product is the higher the degree of spatial coherence of and .
The angle decreases when the distance to the sources is increased and/or when the size of the source is decreased. Loosely speaking one can say that as the light propagates, it becomes more coherent. In both cases when the distance to the source increases and when the size of the source is decreased, the difference in distance of all point source to and decreases and will ultimately become smaller than the coherence length. Furthermore, for smaller the fringe patterns on the distant screen in Young’s experiment due to different point sources more strongly overlap which leads to a stronger overall fringe contrast.
As example consider quasi-monochromatic light for which (see Eq. (35)):
where is the center frequency. In this case the coherence length of the source is so large that the contributions to the total field of all individual point sources are coherent. Hence the only remaining criterion for coherence of the total fields in and is that the fringe patterns due to the different point sources in Young’s experiment sufficiently overlap. Indeed, in this case of very long coherence time we have
and hence the degree of mutual coherence is:
We see that when
the fields in and are at least partially mutually coherent.
6.1Example: Solar Coherence Length¶
We determine the maximum distance between two points on earth for which sun light is coherent. The sun subtends on earth the angle:
where and are the distance of the sun to the earth and the radius of the sun. Hence, for green light and by requiring
for appreciable mutual coherence, we find micron.
7Stellar Interferometry¶
The property that the spatial coherence of two points decreases for increasing angle which the source subtends halfway between the two points, is used in stellar interferometry. It works as follows: we want to know the size of a certain star. The size of the star, being an extended spatially incoherent source, determines the spatial coherence of the light we receive on earth. Thus, by measuring the interference of the light collected by two transversely separated telescopes, one can effectively create a double-slit experiment, with which the degree of spatial coherence of the star light on earth can be measured, and thereby the angle which the star subtends on earth. The resolution in retrieving the angle from the spatial coherence is larger when the distance between the telescopes is larger (see Eq. (75)). Then, if we know the distance of the star by independent means, e.g. from its spectral brightness, we can deduce its size from its angular size.
The method can also be used to derive the intensity distribution at the surface of the star. It can be shown that the degree of spatial coherence as function of the relative position of the telescopes is the Fourier transform of this intensity distribution. Hence, by moving the telescopes around and measuring the spatial coherence for many positions, the intensity distribution at the surface of the star can be derived from a back Fourier transform.

Figure 9:Left: a stellar interferometer with two telescopes that can be moved around to measure the interference at many relative positions. Right: single telescope with two outer movable mirrors. The telescope can move around it axis. The larger the distance the higher the resolution.
8Fringe contrast¶
We have seen that when the interference term vanishes, no fringes form, while when this term is nonzero, there are fringes. The fringe contrast is expressed directly in measurable intensities. Given some interference intensity pattern as in Figure 10, the fringe contrast is defined as
For example, if we have two perfectly coherent, monochromatic point sources emitting the fields , with intensities , , then the interference pattern is with Eq. (58):
We then get
so
In case , we find .
In contrast, when and are completely incoherent, we find
from which follows
which gives .

Figure 10:Illustration of and of an interference pattern that determines the fringe contrast.
9Fabry-Perot resonator¶
In interferometry two mutually coherent waves are added and the intensity of the sum of the two fields is measured. This intensity contains information about the phase difference of the waves from which for example a path length difference can be deduced. One distinguishes between two types of interferometers: wavefront splitting interferometers and amplitude splitting interferometers. Examples of the first type are Young’s two slit experiment and Lloyd’s mirror (Figure 11). Examples of amplitude splitting interferometers are the Michelson interferometer and the Fabry-Perot interferometer. The latter is not only a spectrometer of extremely high resolution but is also the resonance cavity in a laser.

Figure 11:Lloyd’s mirror as example of wavefront splitting interferometry.
A Fabry-Perot interferometer consists of two parallel highly reflecting surfaces with vacuum or a dielectric in between. These surfaces can be optical flats which have been coated by a metal like silver on one side. Consider a coordinate system as in Figure 12 such that the reflecting surfaces are at and . The refractive indices of the half spaces and are and , respectively, and the refractive index of the medium between the surfaces is . We will first assume that all refractive indices are real.
Let there be a plane wave with unit amplitude incident from under angle with the normal as shown in Figure 12. The incident wave is assumed to be either s- or p-polarized. There are a reflected plane wave in , two plane waves in medium 2 one propagating in the positive -direction and the other in the negative -direction and there is a transmitted plane wave in . It follows from the boundary conditions that the tangential component of the electric and magnetic field are continuous across the interfaces, that the tangential components of the wave vectors of all these plane waves are identical.
Let and be the reflection and transmission coefficient for a wave that is incident from medium on the interface with medium . When the wave is s-polarized, and are given by the Fresnel coefficients (see the Rayleigh-Sommerfeld Diffraction Integral section in the Diffraction chapter), whereas if the wave is p-polarized, they are given by the p-polarized Fresnel coefficients.

Figure 12:Fabry-Perot with 3 layers.The light comes from the bottom and is reflected by each interface.
The incident wave, which has amplitude 1 in point A, is partially reflected and partially transmitted by the interface . The reflected wave gets amplitude . The transmitted field propagates in medium to the interface at and is then partially reflected with reflection coefficient back to the interface . Because the path length inside medium 2 is , the complex amplitude B of this wave in point B after transmission by the interface is
where is the wave number in vacuum. To compute the interference of the directly reflected wave and the wave that has made one round trip in medium 2, the two fields should be evaluated at the same wavefront such as wavefront CB in Figure 12. The directly reflected field in C is obtained by propagating from B over the distance
where Snell’s law: has been used. Hence the total field due to the direct reflection at and one round trip Eq. (85)
The common phase factor in front of the brackets may be omitted since it does not influence the reflected intensity. We then obtain
where,
is the -component of the wave vector in medium 2 of the wave that propagates in the postive -direction.
Incorporating the contributions of waves having made two or more round trips in the slab leads to the reflection coefficient of the Fabry-Perot when the field is incident from medium 1:
where in the last step we used
Similarly, the amplitude of the transmitted field in gives the transmission coefficient of the Fabry-Perot when the field is incident from medium 1:

Figure 13:Transmission coefficient versus the phase change due to the Fabry-Perot. One can see the resonances occurring at every multiple of .
Finally, the electric field between the reflectors is given by
where the factor which gives the dependence on has been omitted.
Define
is called the coefficient of Finesse of the Fabry-Perot. It is large when the mirrors are very good reflectors. The reflected and transmitted powers, relative to the incident power are then
and
We define
which is the phase change due to one pass through the middle layer of the Fabry-Perot. Then Eq. (96) and Eq. (97) become
If the reflection by the mirrors is high: , , then is large. This implies
for all except when , i.e. when
for some positive integer . With this becomes in terms of wavelength:
The wavelengths correspond to the maximum values of the transmission:
and they are therefore called resonances. The width at a resonance is defined as the full width at half maximum (FWHM) of the transmission, i.e.
which implies with :
Using again and the fact that the width in terms of wavelength is small:
where Eq. (103) has been used. The resolution is defined as
The free spectral range is the distance between adjacent resonances:
With a similar derivation as for Eq. (107)
A Fabry-Perot can be used as a high resolution spectrometer. Eq. (108) implies that the resolution increases for higher order . However, can not be made arbitrary large because increasing means according to Eq. (110) that the free spectral range decreases. The ratio
should therefore be large.
9.1Example: Fabry-Perot Resolution¶
For a wavelength of and we have for normal incidence . Then, if the reflection coefficients satisfy , we have and . The resolution is more than one million which is better than the grating spectrometers, which will be discussed in the Fresnel and Fraunhofer examples section of the Diffraction chapter.
Remark. Although in the derivation we have assumed that all refractive indices are real, the final formulae also apply to the case that is complex. In that case and the reflection coefficients are complex.
10Interference and polarization¶
In the study of interference we have so far ignored the vectorial nature of light by assuming that all the fields have the same polarization. Suppose now that we have two real vector fields , . The (instantaneous) intensity of each field is (apart from a constant factor) given by
If the two fields interfere, the instantaneous intensity is given by
where is the interference term. Suppose the polarization of is orthogonal to the polarization of , e.g.
Then , which means the two fields can not interfere. This observation is the
Next we write the fields in terms of orthogonal components
This is always possible, whether the fields are polarized or randomly polarized. Then Eq. (113) becomes
If the fields are randomly polarized, the time average of the -part will equal the average of the -part, so the time-averaged intensity becomes
This is qualitatively the same as what we would get if the fields had parallel polarization, e.g.
This leads to the
This indicates that our initial assumption in the previous sections that all our fields have parallel polarization is not as limiting as it may have appeared at first.
Suppose now that we have some field
which is randomly polarized. Suppose we separate the two polarizations, and rotate one so that the two resulting fields are aligned, e.g.
These fields can not interfere because and are incoherent. This leads to the
11Chapter Summary¶
Interference occurs when two or more coherent waves overlap; the resulting intensity depends on their relative phase.
Temporal coherence measures how well a wave correlates with itself over time; related to bandwidth by .
Coherence length is the path difference over which fringes remain visible.
Spatial coherence measures correlation between different points in a wave field at the same time.
Young’s double-slit experiment: Fringe spacing , where is screen distance and is slit separation.
Michelson interferometer measures path differences and coherence length; used for spectroscopy and surface metrology.
Visibility (fringe contrast) quantifies interference quality.
Van Cittert-Zernike theorem: The degree of spatial coherence equals the Fourier transform of the source intensity distribution.
Fabry-Perot interferometer uses multiple-beam interference for high-resolution spectroscopy; resolution increases with mirror reflectivity.
Fresnel-Arago Laws: Orthogonal polarizations cannot interfere; parallel polarizations interfere like unpolarized light.
See Veritasium - The original double-slit experiment, starting at 2:15 - Demonstration of an interference pattern obtained with sunlight.
12References¶
MIT OCW - Fringe Contrast - Path Difference: Demonstration of how fringe contrast varies with propagation distance
MIT OCW - Coherence Length and Source Spectrum: Demonstration of how the coherence length depends on the spectrum of the laser light.
- Born, M., & Wolf, E. (1999). Principles of Optics (7th ed.). Cambridge University Press.
- Michelson, A. A. (1920). On the Application of Interference Methods to Astronomical Measurements. The Astrophysical Journal, 51, 257–262.
- Fresnel, A., & Arago, F. (1819). Mémoire sur l’action que les rayons de lumière polarisée exercent les uns sur les autres. Annales de Chimie et de Physique, 2, 288–314.
- Young, T. (1804). The Bakerian Lecture: Experiments and Calculations Relative to Physical Optics. Philosophical Transactions of the Royal Society of London, 94, 1–16.
- Mandel, L., & Wolf, E. (1995). Optical Coherence and Quantum Optics. Cambridge University Press.