Creative Commons License Copyright © Michael Richmond. This work is licensed under a Creative Commons License.

How many free parameters are justified?

You will occasionally need to fit a model of some sort to some measurements. Models with more parameters will give a better fit, but is it kosher to keep adding more and more parameters? When is the fit "good enough" that adding more parameters isn't really justified?

There are no exact answers to these questions, but this little guide may provide some guidance.


An example: firing an electron into a cloud chamber

Joe has an electron gun and a cloud chamber. When he fires an electron into the cloud chamber, the electron ionizes air molecules, causing water vapor to condense into little droplets. The droplets trace out the path of the electron, so that Joe can measure its position at any time.


The curved path in this picture shows the track of a positron; see the full story from LBL .

Joe uses a high-speed camera to measure the track of an electron through his chamber. He wonders -- is there a signficant electric or magnetic field inside the chamber? Here are his measurements; note the uncertainty associated with each position.


# motion of an electron in a region which
#    may or may not have a significant electric field
# time(microsec)   position(cm)    pos_uncert(cm)
   0.0                 0               0.5
   1.0               12.3              0.5
   2.0               21.1              0.6
   3.0               31.9              0.5
   4.0               40.6              0.7
   5.0               48.7              0.9

If there are no fields inside the chamber, the electron should move with constant velocity. But if there is an electric or magnetic field, the electron's velocity may change. Joe plots the position versus time to look for curvature.

"Hmmm," thinks Joe, "that looks like a pretty straight line. Let me make a linear fit, using a model with 2 parameters, a and b, like this:"

Joe finds parameters (you can read how Joe used Gnuplot to perform the fit if you wish)


           a  =  1.54
           b  =  9.69

"Well, that's not bad," says Joe, "but it looks as if there might be some curvature there. Maybe I should try a quadratic fit with 3 parameters, like this:"

Joe finds parameters (you can read how Joe used Gnuplot to perform the fit if you wish)


           c  =   0.26
           d  =  11.60 
           e  =  -0.382

"Yes, that quadratic model is definitely better, but is it really appropriate to add one extra parameter?" wonders Joe.


There are several ways to answer that question. I'll mention two.


This page maintained by Michael Richmond. Last modified Apr 8, 2009.

Creative Commons License Copyright © Michael Richmond. This work is licensed under a Creative Commons License.