# Statistics

posted by .

Tuyns et al. (1977) carried out a case-control study of esophageal cancer in the region known as Ille-et-Vilaine in Brittany, France. The referring data set is oesoph_new.dta, and use logistic regression models to answer each of the following questions. For each question, carefully state the appropriate logistic regression model and relevant hypothesis, both in the contest of the problem and in terms of model parameters. Use both the Wald and likelihood ratio methods to carry out any hypothesis tests, and provide relevant estimated Odds Ratios (with 95% confidence intervals) where appropriate.

a. Investigate the relationship between alcohol consumption and incidence of esophageal cancer. Treat alcohol consumption as a dichotomous variable (> 80 g/day vs. < 80 g/day), ignoring age.

b. Investigate the relationship between alcohol consumption and incidence of oesophageal cancer, controlling for the potential confounding effects of age. Treat alcohol consumption as a dichotomous variable (> 80 g/day vs. < 80 g/day), and age as a dichotomous variable (25 to 54 years old or 55 to 75+ years old). Give your assessment of the extent of confounding by age using the models fit in (a) and (b).

c. Investigate the evidence of interaction between age and alcohol consumption in relation to incidence of esophageal cancer. Treat alcohol consumption and age as dichotomous variables as in (b).

d. Investigate the relationship between alcohol consumption and incidence of esophageal cancer. First, treat alcohol consumption as a categorical variable with four categories (0 to 39 g/day, 40 to 79 g/day, 80 to 119 g/day, and > 120 g/day), by using indicator variables for the various categories (select 0 to 39 g/day as the reference group); second, treat alcohol consumption as an ordered variable by appropriately coding the four categories of
2 consumption. Compare the two analyses and discuss whether an increasing trend in risk, as alcohol consumption increases, adequately fits the pattern of risks for the four categories.

• Statistics -

## Similar Questions

1. ### statistics

In a population-based cohort study, an entire community was interviewed regarding smoking habits and then followed for one year. All lung cancer deaths were ascertained and the following data were available: • Fifteen (15) lung cancer …
2. ### statistics

In a population-based cohort study, an entire community was interviewed regarding smoking habits and then followed for one year. All lung cancer deaths were ascertained and the following data were available: Fifteen (15) lung cancer …
3. ### Statistics

2. Any given data set consists of a set of numerical values. Please indicate by stating yes or no for each of the following statements whether or not it could be correct for any data set. (This question is not referring to the data …
4. ### statistics

Job Sat. INTR. EXTR. Benefits 5.2 5.5 6.8 1.4 5.1 5.5 5.5 5.4 5.8 5.2 4.6 6.2 5.5 5.3 5.7 2.3 3.2 4.7 5.6 4.5 5.2 5.5 5.5 5.4 5.1 5.2 4.6 6.2 5.8 5.3 5.7 2.3 5.3 4.7 5.6 4.5 5.9 5.4 5.6 5.4 3.7 6.2 5.5 6.2 5.5 5.2 4.6 6.2 5.8 5.3 5.7 …
5. ### statistics

Select the most appropriate study design for each of the following questions. (Note: All study design options may not be used and each design option can be used more than once. 1. A study is done to examine the association between …
6. ### STATISTICS

Researchers conducted a case-control study to identify risk factors for kidney cancer. They asked 50 cases and 50 controls about 100 different exposures and personal characteristics, and calculated the odds ratio for kidney cancer …
7. ### STATISTICS

Researchers conducted a case-control study to identify risk factors for kidney cancer. They asked 50 cases and 50 controls about 100 different exposures and personal characteristics, and calculated the odds ratio for kidney cancer …
8. ### Statistics

Please help! I know that E is incorrect. Researchers conducted a case-control study to identify risk factors for kidney cancer. They asked 50 cases and 50 controls about 100 different exposures and personal characteristics, and calculated …
9. ### Epidemiology

Evidence of an increased risk of lung cancer associated with cigarette smoking was sought by Doll and Hill. In one study, 649 lung cancer cases were matched by age, and gender to 649 controls; 647 of the cases and 622 of the control …
10. ### statistics

For a given set of data, it is known that x = 10 and y = 4. The gradient of the regression line y on x is 0.6. Find the equation of the regression line and use it to estimate y when x = 8.

More Similar Questions