Rationale, aims, and objectives: Missing data represent a challenge in longitudinal studies. The aim of the study is to compare the performance of the multivariate normal imputation and the fully conditional specification methods, using real data set with missing data partially completed 2 years later. Method: The data used came from an ongoing randomized controlled trial with 5-year follow-up. At a certain time, we observed a number of patients with missing data and a number of patients whose data were unobserved because they were not yet eligible for a given follow-up. Both unobserved and missing data were imputed. The imputed unobserved data were compared with the corresponding real information obtained 2 years later. Results: Both imputation methods showed similar performance on the accuracy measures and produced minimally biased estimates. Conclusion: Despite the large number of repeated measures with intermittent missing data and the non-normal multivariate distribution of data, both methods performed well and was not possible to determine which was better.
Missing data in longitudinal studies: Comparison of multiple imputation methods in a real clinical setting
Rosato R.
First
;Pagano E.;Testa S.;Zola P.;di Cuonzo D.Last
2021-01-01
Abstract
Rationale, aims, and objectives: Missing data represent a challenge in longitudinal studies. The aim of the study is to compare the performance of the multivariate normal imputation and the fully conditional specification methods, using real data set with missing data partially completed 2 years later. Method: The data used came from an ongoing randomized controlled trial with 5-year follow-up. At a certain time, we observed a number of patients with missing data and a number of patients whose data were unobserved because they were not yet eligible for a given follow-up. Both unobserved and missing data were imputed. The imputed unobserved data were compared with the corresponding real information obtained 2 years later. Results: Both imputation methods showed similar performance on the accuracy measures and produced minimally biased estimates. Conclusion: Despite the large number of repeated measures with intermittent missing data and the non-normal multivariate distribution of data, both methods performed well and was not possible to determine which was better.File | Dimensione | Formato | |
---|---|---|---|
PREPRINT_ROSATO.pdf
Open Access dal 05/02/2021
Tipo di file:
PREPRINT (PRIMA BOZZA)
Dimensione
572.43 kB
Formato
Adobe PDF
|
572.43 kB | Adobe PDF | Visualizza/Apri |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.