Data Sets
Mack, et al. (Breslow-Day)
BresMack.text : is the documentation to Appendix III of Breslow and Day (1980) that presents data from the matched case-control study of endometrial cancer described in Mack et al. (1976). This file was downloaded in the fall of 1999 from Norman Breslow's web site at the Department of Biostatistics of the University of Washington. It describes the variables and corrections to the descriptions in Breslow and Day.
BresMack.txt : is the data set downloaded from Norman Breslow's web site. The programs described in Chapters 5 and 7 use this data set.
Hosmer-Lemeshow Low Birthweight Data
HosLem.sas : generates the data set used as the basis for the analyses in the text, Tables 7.8 and 7.9. The original source of the data was a table in the SAS Technical Report P-229, p. 465-6. The data were scanned and used in this program to generate the data set HLData.dat that is used in the program HLBwt.sas for the analyses shown in Chapter 7. Because the data were scanned from a secondary source, the data and analyses may differ from those shown by Hosmer and Lemeshow.
DCCT Hypoglycemia
dccthypo.txt : is a flat (text) file used in Examples 8.1-8.3. This is used as the basis for the data presented in Table 8.1, see the program Table81.sas in Chapter 8.
Fleming-Harrington CGD Data
FH-CGD.txt . Fleming and Harrington (1991) present the data from a clinical trial of gamma interferon versus placebo in the treatment of children with chronic granulamatous disease (CGD) to reduce the incidence of recurrent pyogenic infections. The data set includes multiple records for each subject to record the time of each successive infection or the date of right censoring. The variables include
|
  |
|
Note: FHcnt.txt is a data set created by FHcnt.sas that has one record per subject contianing the additonal variables nevents: number of severe infections experienced, and futime: the number of days of follow-up. This is used for analyses using Poisson regression in Chapter 8.
Lagakos Squamous Cell Carcinoma
Lagakos.sas reads the data from Lagakos (1978) and creates a SAS data set that is used for the analyses in Chapter 9. This job should be run on your platform to create the SAS data set. The data set was originally used by Lagakos to describe an approach to the analysis of competing risks, there being two modes or causes of failure (spread of disease) - metastatic versus not. For the analyses herein, however, a single outcome is employed - spread of disease of any cause.
DCCT Nephropathy (Microalbuminuria) Data
nephdata.txt contains data related to the onset of microalbuminuria in the DCCT. These data are used for simple survival analyses as presented in Example 9.2. The data set, however, contains additional variables that could be used for supplemental exercises. See DCCTneph.sas. The variables in the data set are
|
  |
|
DCCT Hypoglycemia Recurrent Event Data
Due to their size, the four data sets in this section are provided as a single SAS export file. You can download this file as an uncompressed file (17.6 Mbytes), as a gzip-compressed file (952 Kbytes), or as a zip-compressed file (938 Kbytes). Please run the program impthypo.sas to generate the following SAS data sets on your platform.
Dataset hyevents
Contains one record per hypoglycemia event for each subject. The variables are
|
  |
|
Dataset hytimes
This data set contains a single observation with six sets of array variables:
|
  |
|
Dataset hypomimi
Contains DCCT intensive group recurrent hypoglycemia event observations with time dependent covariate data as described in Example 9.12. Each observation is defined in terms of start and stop times, the associated time dependent covariate (mhba) and the number of events at the stop time, if any. The covariates in the data set are
|
  |
|
Dataset hypomimc
Contains DCCT conventional group recurrent hypoglycemia event observations with time dependent covariate data as described in Example 9.12. Each observation is defined in terms of start and stop times, the associated time dependent covariate (mhba) and the number of events at the stop time, if any. The variables in the data set are the same as those described above.
Veterans Administration Cooperative Urological Research Group
VACURG85.txt presents the data from the VACURG study of prostate cancer described by Byar in the book edited by Andrews and Herzberg (1985) which gives the variable descriptions. These data have been used by many, including Thall and Lachin (1986). The data are also available from StatLib in a slightly different format as Table46.dat of the file Andrews. The variables included are
|
|