Dept of Maths and Computing Datasets for Statistical Analysis USQ Homepage

Datasets for Statistical Analysis: Errors in Hand et al.

Errors in Hand et al.

  • Dataset 271 (Windmill data): The original paper shows the variables in the reverse order; that is, the response variable is the DC output, and the explanatory variable is the wind velocity. This makes more sense than the definitions in Hand et al.
  • Dataset 302 (Skin Cancers): The age group 74 to 84 appears to be missing. I haven't chased up the original sources to explore this further.
  • Dataset 420 (Seed germination): The orignal paper has the two types of seed in reverse order (that is, the left-side table refers to O. aegyptiaca 75 and the right-side table to O. aegyptiaca 73). Perhaps the original paper is in error?
  • Page 443 (the index): The entry for Dogs has been misaligned; rather than appearing as a major heading, it appears as a sub-heading to Doctors (!).
  • Dataset 475 (multivariate analysis books): The source of the data is quoted as Gifi (1920), who gives the contents of multivariate statistics books, all of which are published after 1957!
And now some silly ones I have noted (I probably have made more on this very Web page):
  • A spelling error on p348 ("pateints" rather than "patients")
  • On p289, the dataset number is given as 254 rather than 354.


Constructed by Peter Dunn
Last change: 19 June 2002