Sailor_man, I sent you the spreadsheet I use. Let me know if you find it of any use. As you have noted, the formula to calculate the TOT multiplier reduces to the simple form when the constants (such as "scratch boat rating") are set and combined. The PHRF-LO formula form is identical to what is shown on the USSailing website. The original formula is given in the report so that guys like you and me can remember 15 or 20 years down the road where the numbers actually came from.
I totally agree that other factors cause the majority of variability in the performance data. Personally I think rating adjustments of less than 6 sec./mile are a waste of time since I'm sure that there is an inherent inaccuracy in our predictions of at least that amount. That being said however I still think the system works very well.
I totally agree that other factors cause the majority of variability in the performance data. Personally I think rating adjustments of less than 6 sec./mile are a waste of time since I'm sure that there is an inherent inaccuracy in our predictions of at least that amount. That being said however I still think the system works very well.
Steve,
Thanks for the spreadsheet. I've taken it and input races from my local summer series for this year. I saw that you had input races from your club, which helped get me going in the right direction.
I agree that +/- 6 sec per mile is probably about as good as one can expect for accuracy of TOD ratings. That type of variation has some impact on results but not much at the end of the day. Other stuff, such as windshifts, new sails, crew skill, etc., is more important and worth far more than +/- 6 sec per mi. In the same vein, I also think that the choice of a Q Factor probably has less impact than +/- 6 sec per mile and so although it clearly has some impact on results one shouldn't make it more important than it is. I find the Q Factor analysis interesting as an intellectual matter as much as anything.
Q Factor Analysis of Local Fleets
As I said, to get a handle on the Q Factor analysis, I input the races from our local summer series. I input three different classes sailing in 18 races for a total of 53 races (one class had no boats finish one race). The results were interesting. What I found was:
1. Of the 53 races I entered, it was impossible to calculate a Q factor for 10 of the races, almost 20%. In all 10 cases, positive infinity wouldn't compensate for the "bias" toward higher-rated (i.e., slower) boats. [I used 99,999 instead of infinity in my calculations.]
That a "Q Factor analysis" wouldn't work for almost 20% of the races I looked at raises a troubling question as to whether it is a valid method of analyzing bias in TOD ratings. As a check I did a similar type of analysis on the TOD ratings directly - plotting TOD corrected time vs. TOD rating and adjusting a "fudge" factor until the slope of the linear regression line was zero (I used the formula CT = ET - (Fudge * TOD * DIST). Using that method I was able to analyze all 53 races - in other words, whereas I could not come up with an "optimum" Q factor for 10 of the 53 races I was able to come up with an optimum TOD "fudge" factor for all 53 races. That suggests to me that the TOD "fudge" factor analysis is more valid than the Q factor analysis, even if in the end you come up with the same conclusion of bias toward lower-rated boats.
2. One of the classes (PH-B) that I analyzed has most of the talent at the faster end of the class. That class did indeed show a heavy bias toward lower-rated (i.e., faster) boats, as you would expect with most of the better-sailed boats at the lower end of the rating band. The optimum Q factor for that class for the 2010 season was -0.0246 average, -0.0523 median. The rating band was 36 sec/mi and the class averaged 12 boats per race.
3. The other two classes (PH-C and PH-D) have the talent spread fairly equally across the rating band (42 and 64 sec/mi, respectively; averaging 14 and 7 boats per race). Both classes showed a marked bias toward the higher-rated (i.e., slower) boats, which presumably shows that talent is weighted slightly toward the slower end of those classes even though it might not appear that way on inspection. For PH-C the optimum Q was 38,889 average and 1.258 median; for PH-D the optimum Q was 17,649 average and 0.457 median.
4. Combining all 53 races, I came up with an overall optimum Q of 18,869 average and 0.180 median. That translates into a fairly significant overall bias in favor of slower (i.e., higher-rated) boats (a Q factor showing no "bias" would be close to 0.09).
5. For the record, the "average" Q factors don't have much meaning and I include them only to show that the distribution is nothing like a "normal" distribution where the average and median are fairly close together. I did some histograms of my results for all 53 races, as shown below:

One thing to note is that my third histogram (showing the results of just 40 of the 53 races I analyzed) looks somewhat similar to the histogram from the November 2008 Report of the PHRF-LO Technical Advisory Committee, the so-called Time on Time Study (posted on the PHRF-LO website at http://www.phrf-lo.org/images/Documents/tot_q_study.pdf), in that it shows a long "tail" to the right:

PRELIMINARY CONCLUSIONS
It is a bit early yet but it seems to me that the Q Factor analysis is extremely sensitive to the makeup of the classes/races being analyzed. The classes I looked at had rating bands of 36, 42 and 84 sec/mi for an average of 54 sec/mi. That is very similar to the 60 sec/mi average rating band found in the 2008 Time on Time Study. So, from that perspective the classes I looked at seem to be the same kind as those from the Study.
In the class where all the talent was obviously concentrated at the lower end of the rating band, the Q Factor analysis showed bias for the lower-rated boats. That hardly seems surprising and may have nothing to do with "rating bias." I haven't yet thought of a way to try and separate the "rating bias" from the talent factor. The other two classes showed a bias toward the higher-rated boats, which is the opposite of what the PHRF-LO Time on Time Study showed. However, that may simply be a reflection of the fact that although the talent seems equally spread across the rating bands for those classes it actually is concentrated slightly with the higher-rated boats.
Overall, I am not yet convinced that this so-called "rating bias" is real. It very well may be. I have heard about it for years and have always kind of assumed it was real but now I am not so sure. In any evernt, the Q Factor analysis doesn't seem to be a very good way to prove that such bias actually exists as it seems to be incredibly sensitive to the makeup of the classes being analyzed. I analyzed as a single "race" only those boats that started together because for our evening races the wind does die and that could have a significant impact on the analysis. I wonder if there might be other systematic errors that I should be considering.
I think I might put in at least one more year of results and see if the Q Factor analysis shows something different.
sailor_man
[Update - After initially posting this I rechecked all of my calculations and I found that there was an optimum Q Factor for the case where I had found no solution using negative infinity. I have updated my histogram and the wording above to reflect that correction.]









