chi-squared tests

Consider a set of 10 measurements of leaf-size: {x₁, x₂, ..., x₁₀}. where x₁ is the size of the first leaf, etc. According to some expert, leaf sizes are supposed to be "normally" distributed with mean µ and standard deviation

. Knowing all these numbers you could now calculate the quantity known as chi-square:

where in this case there are 10 x values, so k=10. (This formula says: find how each x deviates from the mean µ, square each difference, add up all the squared-differences and divide by the standard deviation squared.) More general versions of this formula would allow different means and standard deviations for each measurement.

Roughly speaking we expect the measurements to deviate from the mean by the standard deviation, so: |(x_i-µ)| is about the same thing as . Thus in calculating chi-square we'd end up adding up 10 numbers that would be near 1. More generally we expect to approximately equal k, the number of data points. If chi-square is "a lot" bigger than expected something is wrong. Thus one purpose of chi-square is to compare observed results with expected results and see if the result is likely.

X²: a version of to test expected distribution

Click here to calculate X²

In biology the most common application for chi-squared is in comparing observed counts of particular cases to the expected counts. For example, the willow tree (Salix) is dioecious, that is, like people (and unlike most plants) a willow tree will have just male or female sex organs. One might expect that half of the willows are male and half female. If you examine N willow trees and count that x₁ of them are male and x₂ of them are female, you will probably not find that exactly x₁=½N and x₂=½N. Is the difference significant enough to rule out the 50/50 hypothesis? We could almost calculate the chi-squared, but we don't know the standard deviation for each count. Never fear: most counts are distributed according to the Poisson distribution, and as such the standard deviation equals the square root of the expected count. Thus we can calculate X²:

In our simple willow example there are just two cases so k=2, and the expected results are: E₁=½N and E₂=½N. Note that the E_i are generally not whole numbers even though the counts x_i must be whole numbers. If there were more cases (say k cases), we would need to know the probability p_i for each case and then we could calculate each E_i=p_iN, where N is determined by finding the total of the counts:

Finally it should be noted that the technical differences between a Poisson distribution and a normal distribution cause problems for small E_i. As a rule of thumb, avoid using X² if any E_i is less than 5. If k is large this technical difficulty is mitigated.

chi-squared tests

X2: a version of to test expected distribution

X²: a version of to test expected distribution