Skip to main content

Citation

Weiskopf, Nicole G.; Dorr, David A.; Jackson, Christie; Lehmann, Harold P.; & Thompson, Caroline A. (2023). Healthcare Utilization is a Collider: An Introduction to Collider Bias in Ehr Data Reuse. Journal of the American Medical Informatics Association, 30(5), 971-977. PMCID: PMC10114115

Abstract

OBJECTIVES: Collider bias is a common threat to internal validity in clinical research but is rarely mentioned in informatics education or literature. Conditioning on a collider, which is a variable that is the shared causal descendant of an exposure and outcome, may result in spurious associations between the exposure and outcome. Our objective is to introduce readers to collider bias and its corollaries in the retrospective analysis of electronic health record (EHR) data.
TARGET AUDIENCE: Collider bias is likely to arise in the reuse of EHR data, due to data-generating mechanisms and the nature of healthcare access and utilization in the United States. Therefore, this tutorial is aimed at informaticians and other EHR data consumers without a background in epidemiological methods or causal inference.
SCOPE: We focus specifically on problems that may arise from conditioning on forms of healthcare utilization, a common collider that is an implicit selection criterion when one reuses EHR data. Directed acyclic graphs (DAGs) are introduced as a tool for identifying potential sources of bias during study design and planning. References for additional resources on causal inference and DAG construction are provided.

URL

http://dx.doi.org/10.1093/jamia/ocad013

Reference Type

Journal Article

Year Published

2023

Journal Title

Journal of the American Medical Informatics Association

Author(s)

Weiskopf, Nicole G.
Dorr, David A.
Jackson, Christie
Lehmann, Harold P.
Thompson, Caroline A.

Article Type

Regular

PMCID

PMC10114115

Continent/Country

United States

State

Nonspecific

ORCiD

Thompson, C. - 0000-0001-9990-9756