Posted by Lyn on Saturday, April 16, 2011 at 12:22am.
Tuyns et al. (1977) carried out a casecontrol study of esophageal cancer in the region known as IlleetVilaine in Brittany, France. The referring data set is oesoph_new.dta, and use logistic regression models to answer each of the following questions. For each question, carefully state the appropriate logistic regression model and relevant hypothesis, both in the contest of the problem and in terms of model parameters. Use both the Wald and likelihood ratio methods to carry out any hypothesis tests, and provide relevant estimated Odds Ratios (with 95% confidence intervals) where appropriate.
a. Investigate the relationship between alcohol consumption and incidence of esophageal cancer. Treat alcohol consumption as a dichotomous variable (> 80 g/day vs. < 80 g/day), ignoring age.
b. Investigate the relationship between alcohol consumption and incidence of oesophageal cancer, controlling for the potential confounding effects of age. Treat alcohol consumption as a dichotomous variable (> 80 g/day vs. < 80 g/day), and age as a dichotomous variable (25 to 54 years old or 55 to 75+ years old). Give your assessment of the extent of confounding by age using the models fit in (a) and (b).
c. Investigate the evidence of interaction between age and alcohol consumption in relation to incidence of esophageal cancer. Treat alcohol consumption and age as dichotomous variables as in (b).
d. Investigate the relationship between alcohol consumption and incidence of esophageal cancer. First, treat alcohol consumption as a categorical variable with four categories (0 to 39 g/day, 40 to 79 g/day, 80 to 119 g/day, and > 120 g/day), by using indicator variables for the various categories (select 0 to 39 g/day as the reference group); second, treat alcohol consumption as an ordered variable by appropriately coding the four categories of
2 consumption. Compare the two analyses and discuss whether an increasing trend in risk, as alcohol consumption increases, adequately fits the pattern of risks for the four categories.

Statistics  PsyDAG, Saturday, April 16, 2011 at 11:11am
Cannot find your data.