The choice in averaging method also affects intermediate calculations in the Green/Red contribution to the score. For example, the Red part (which is the number of links that include at least one stop) may be calculated with either formula
- Code: Select all
if(Stops>0,1,0)
or
- Code: Select all
min(Stops,1)
and with a separate column to accumulate the results along the artery.
But these two formulas have different behavior under Method B averaging. With Method B, the number of stops along a link, averaged over multiple runs, might be a fractional value. The first formula would always return 1 for any fraction (greater than 0, of course), while the second formula would return the actual fraction.
For example, suppose there are two travel time runs, one with 0 stops along a segment, the other with 1 stop. The average # of stops in this case is 0.5, and the first formula returns a Red number of 1, while the second formula returns 0.5. In either case, this value then affects the Green/Red ratio and then the corresponding scores.
So if we use Method B for averaging, then we must also choose between these two, now distinguishable, types of formulas for the Green/Red calculation. I'm leaning toward the second formula using the same argument as before: that it allows the CSPI to be the more sensitive because it postpones discarding information until a later stage in the calculation. Someone else may have an argument for preferring the first formula, or someone may even use this as a reason to prefer the Method A style of averaging. Comments most welcome.
Greg