TY - JOUR
T1 - Imputing missing time-dependent covariate values for the discrete time Cox model
AU - Murad, Havi
AU - Dankner, Rachel
AU - Berlin, Alla
AU - Olmer, Liraz
AU - Freedman, Laurence S.
N1 - Publisher Copyright:
© The Author(s) 2019.
PY - 2020/8/1
Y1 - 2020/8/1
N2 - We describe a procedure for imputing missing values of time-dependent covariates in a discrete time Cox model using the chained equations method. The procedure multiply imputes the missing values for each time-period in a time-sequential manner, using covariates from the current and previous time-periods as well as the survival outcome. The form of the outcome variable used in the imputation model depends on the functional form of the time-dependent covariate(s) and differs from the case of Cox regression with only baseline covariates. This time-sequential approach provides an approximation to a fully conditional approach. We illustrate the procedure with data on diabetics, evaluating the association of their glucose control with the risk of selected cancers. Using simulations we show that the suggested estimator performed well (in terms of bias and coverage) for completely missing at random, missing at random and moderate non-missing-at-random patterns. However, for very strong non-missing-at-random patterns, the estimator was seriously biased and the coverage was too low. The procedure can be implemented using multiple imputation with the Fully conditional Specification (FCS) method (MI procedure in SAS with FCS statement or similar packages in other software, e.g. MICE in R). For use with event times on a continuous scale, the events would need to be grouped into time-intervals.
AB - We describe a procedure for imputing missing values of time-dependent covariates in a discrete time Cox model using the chained equations method. The procedure multiply imputes the missing values for each time-period in a time-sequential manner, using covariates from the current and previous time-periods as well as the survival outcome. The form of the outcome variable used in the imputation model depends on the functional form of the time-dependent covariate(s) and differs from the case of Cox regression with only baseline covariates. This time-sequential approach provides an approximation to a fully conditional approach. We illustrate the procedure with data on diabetics, evaluating the association of their glucose control with the risk of selected cancers. Using simulations we show that the suggested estimator performed well (in terms of bias and coverage) for completely missing at random, missing at random and moderate non-missing-at-random patterns. However, for very strong non-missing-at-random patterns, the estimator was seriously biased and the coverage was too low. The procedure can be implemented using multiple imputation with the Fully conditional Specification (FCS) method (MI procedure in SAS with FCS statement or similar packages in other software, e.g. MICE in R). For use with event times on a continuous scale, the events would need to be grouped into time-intervals.
KW - MICE imputation
KW - Missing data
KW - fully conditional specification imputation
KW - incomplete covariate
KW - missing covariate
KW - multiple imputation
UR - http://www.scopus.com/inward/record.url?scp=85074817373&partnerID=8YFLogxK
U2 - 10.1177/0962280219881168
DO - 10.1177/0962280219881168
M3 - ???researchoutput.researchoutputtypes.contributiontojournal.article???
C2 - 31680633
AN - SCOPUS:85074817373
SN - 0962-2802
VL - 29
SP - 2074
EP - 2086
JO - Statistical Methods in Medical Research
JF - Statistical Methods in Medical Research
IS - 8
ER -