Appendix 2: The Principle of Least Squares - Advanced Laboratory Methods in Physics

Least Squares and Sample Means¶

Let’s say we make $N$ measurements, $x_i$ , of a quantity. To find the value $X$ whose deviations from our measurements are minimized according to the principle of least squares, we need:

\sum(x_i-X)^2 = \text{minimum}

(1)

Let’s denote the mean of the measurements as $\bar{x}$ . We can rewrite the sum of squared deviations as:

\sum (x_i - X)^2 = \sum[(x_i - \bar{x}) + (\bar{x}-X)]^2

(2)

Expanding the squared term:

\sum (x_i - X)^2 = \sum[(x_i - \bar{x})^2 + (\bar{x}-X)^2 + 2(x_i - \bar{x})(\bar{x}-X)]

(3)

The cross-term $\sum(x_i - \bar{x})$ equals zero by definition of the mean, so:

\sum (x_i - X)^2 = \sum(x_i - \bar{x})^2 + N(\bar{x}-X)^2

(4)

Fitting a Straight Line Using Least Squares¶

For each observation, the deviation from our proposed line is:

\delta y_i = y_i - (mx_i + b)

(6)

According to the principle of least squares, we want to minimize the sum of the squares of these deviations:

\sum(\delta y_i)^2 = \sum[y_i - (mx_i + b)]^2

(7)

Expanding this expression:

\sum(\delta y_i)^2 = \sum[y_i^2 + m^2 x_i^2 + b^2 - 2m x_i y_i - 2b y_i + 2m x_i b]

(8)

Or more compactly:

M = \sum y_i^2 + m^2\sum x_i^2 + Nb^2 + 2mb\sum x_i - 2m\sum x_i y_i - 2b\sum y_i

(9)

Where $M$ represents the sum of squared deviations that we want to minimize.

From the first condition:

2m\sum x_i^2 + 2b\sum x_i - 2\sum(x_i y_i) = 0

(11)

From the second condition:

2Nb + 2m\sum x_i - 2\sum y_i = 0

(12)

Solving these equations simultaneously gives us:

m = \frac{N \sum(x_i y_i) - \sum x_i\sum y_i}{N\sum x_i^2 - (\sum x_i)^2}

(13)

b = \frac{\sum x_i^2 \sum y_i - \sum x_i\sum (x_i y_i)}{N\sum x_i^2 - (\sum x_i)^2}

(14)

For the standard deviation of each $y_i$ value from our fitted line, we use:

S_y = \sqrt{\frac{\sum(\delta y_i)^2}{N-2}}

(15)

The standard deviations of the slope and intercept are then:

S_m = S_y \sqrt{\frac{N}{N\sum x_i^2 - (\sum x_i)^2}}

(16)

S_b = S_y \sqrt{\frac{\sum x_i^2}{N\sum x_i^2 - (\sum x_i)^2}}

(17)

Weighted Least Squares¶

Weighted Mean of Observations¶

Straight-Line Fitting with Weighted Least Squares¶

Important

For observations with unequal precision, we modify our least squares approach by assigning weights. If the $y$ values have varying precision, but the $x$ values are considered exact, the equations for the slope and intercept become:

m = \frac{\sum w_i \sum w_i x_i y_i - \sum w_i x_i \sum w_i y_i }{\sum w_i \sum w_i x_i^2-(\sum w_i x_i )^2}

(20)

b = \frac{\sum w_i y_i \sum w_i x_i^2 - \sum w_i x_i \sum w_i x_i y_i }{\sum w_i \sum w_i x_i ^2-(\sum w_i x_i)^2}

(21)

Where $w_i$ represents the weight of each observation, calculated as:

w_i = \frac{1}{S_{yi}^2}

(22)

The weighted standard deviation about the best-fit line is:

S_y = \sqrt{\frac{\sum w_i \delta_i^2}{N-2}}

(23)

And the standard deviations of the slope and intercept are:

S_m^2 = \frac{S_y^2}{W}

(24)

S_b^2 = S_y^2\left(\frac{1}{\sum w_i} + \frac{\bar{x}^2}{W}\right)

(25)

Where:

W = \sum(w_i (x_i-\bar{x})^2)

(26)

And $\bar{x}$ is the weighted mean of the $x$ values:

\bar{x} = \frac{\sum w_i x_i }{\sum w_i}

(27)

Appendices

Appendix I: The Gaussian Distribution - Mathematical Properties and Derivation

Appendices

Appendix 3: Introduction to Jupyter and Python for Data Analysis