Analysis of multiple source/multiple informant data in Stata
We describe regression-based methods for analyzing multiple-source data arising from complex sample survey designs in Stata. We use the term multiple-source data to encompass all cases where data are simultaneously obtained from multiple informants, or raters (e.g., self-reports, family members, health care providers, administrators) or via different/parallel instruments, indicators or methods (e.g., symptom rating scales, standardized diagnostic interviews, or clinical diagnoses). We review regression models for analyzing multiple source risk factors or multiple source outcomes and show that they can be considered special cases of generalized linear models, albeit with correlated outcomes. We show how these methods can be extended to handle the common survey features of stratification, clustering, and sampling weights as well as missing reports, and how they can be fit within Stata. The methods are illustrated using data from the Stirling County Study, a longitudinal community study of psychopathology and mortality.
Year of publication: |
2005-07-12
|
---|---|
Authors: | Horton, Nicholas ; Fitzmaurice, Garrett |
Institutions: | Stata User Group |
Saved in:
freely available
Saved in favorites
Similar items by person
-
Fitting Generalized Estimating Equation (GEE) Regression Models in Stata
Horton, Nicholas, (2001)
-
Variance estimation in complex survey sampling for generalized linear models
Natarajan, Sundar, (2008)
-
Generalized linear models with a coarsened covariate
Lipsitz, Stuart, (2004)
- More ...