A computationally efficient sequential regression imputation algorithm for multilevel data


Akkaya Hocagil T., Yucel R. M.

Journal of Applied Statistics, cilt.51, sa.11, ss.2258-2278, 2024 (SCI-Expanded) identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 51 Sayı: 11
  • Basım Tarihi: 2024
  • Doi Numarası: 10.1080/02664763.2023.2277669
  • Dergi Adı: Journal of Applied Statistics
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, ABI/INFORM, Aerospace Database, Applied Science & Technology Source, Business Source Elite, Business Source Premier, CAB Abstracts, Computer & Applied Sciences, Veterinary Science Database, zbMATH
  • Sayfa Sayıları: ss.2258-2278
  • Anahtar Kelimeler: computational efficiency, fast variable by variable imputation, multilevel data, multiple imputation by chained equations, Sequential regression imputation
  • Ankara Üniversitesi Adresli: Hayır

Özet

Due to the computational burden, especially in high-dimensional settings, sequential imputation may not be practical. In this paper, we adopt computationally advantageous methods by sampling the missing data from their perspective predictive distributions, which leads to significantly improved computation time in the class of variable-by-variable imputation algorithms. We assess the computational performance in a comprehensive simulation study. We then compare and contrast the performance of our algorithm with commonly used alternatives. The results show that our method has a significant advantage over the commonly used alternatives with respect to computational efficiency and inferential quality. Finally, we demonstrate our methods in a substantive problem aimed at investigating the effects of area-level behavioral, socioeconomic, and demographic characteristics on poor birth outcomes in New York State among singleton births.