Datasets for Statistical Analysis Home Techniques Index Application Index

Datasets for Statistical Analysis

Back to Datasets home

Smoking and respiratory function

Access full data file (fev.dat) | Access data file subset (fevsub.dat)
Keywords:
lung capacity
Categories:
regression;
health
Description:
The data give information on the health and smoking habits of a sample of 654 youths, aged 3 to 19, in the area of East Boston during middle to late 1970s.

In the full data set, there are 654 observations on 5 variables

The reduced data set contains the 65 observations for smokers only (and hence only 4 variables as Smoke is unnecessary)

Variables:
Age The age of the subject in completed years
FEV The forced expiratory volume, a measure of lung capacity, in litres
Ht Height (in inches)
Gender The gender of the subject: Females coded as 0, males as 1
Smoke The smoking status of the subject: 0 means a non-smoker; 1 means a smoker
Data Quality:
There are no missing values.
Source:
Kahn, Michael (2005). An Exhalent Problem for Teaching Statistics. The Journal of Statistical Education, 13(2). Available on-line
Notes:
References:
Kahn,M. (2003). Data Sleuth, STATS, 37, 24.