WAPP+: Parameter Errors -- details

WAPP⁺: Parameter Errors -- details

As discussed elsewhere, in simple cases (and approximately in many other cases) χ² is a simple quadratic form in the parameters, for example:

= a A² + b AB + c B² + d A + e B + f

so the constant χ² curves are ellipses. Parameter error determination depends on understanding the region where χ² is near its minimum value, for example:

χ²(a,b) ≤ χ²_min + 1

In particular we need to find the maximum reach of these ellipses, for example δA and δB shown below:

If we shift the origin for the parameters a and b to be at the center of the ellipse, using:

Δb = B - B_center

the quadratic form can be simply expressed as a matrix equation:

We then generalize that a bit to include N re-origined parameters α_i put together into a column vector.

The symmetric matrix M is called the curvature matrix. In those cases where χ² is a simple quadratic form, this result is exact. More generally we can view this equation as the first non-zero term in a Taylor expansion of Δχ² at the minimum. M is then closely related to the matrix of second partial derivatives of χ² (which is known as the Hessian). We will soon have use for the inverse of the matrix M, which is known as the covariance matrix.

The points we seek (the extreme reach of these ellipses) may be identified by gradient of χ² (wrt the parameters α_i): at the extreme points the gradient points directly along the corresponding coordinate axis as shown below:

Note that because all the ellipses are self-similar, the extreme points all lie along a ray (shown in dotted red above).

The gradient of χ² is:

so the requirement that the gradient points directly along the p^th coordinate axis amounts to the equation:

where e_p is the unit vector in the p^th coordinate direction (e.g., e₁=(1,0,...,0), e₂=(0,1,0,...,0), etc.) and λ is the length of the vector. We can find the locations where the gradient is along the p^th coordinate direction, by inverting the above equation:

the result is a ray of locations parametrized by λ with larger λ corresponding to larger values of the associated Δχ² ellipse. If we substitute the equation for this ray into the equation for Δχ² we can associate the value of λ with the corresponding value of Δχ²:

and

As noted above, generally we are interested in Δχ²=1, hence the mantra: the error in a parameter is the square root of the corresponding diagonal element of the covariance matrix. However I reproduce below a table from Numerical Recipes which at least shows that sometimes other values of Δχ² may be of interest.