Evaluating Left-Censored Data Through Substitution, Parametric, Semi-parametric, and Nonparametric Methods: A Simulation Study

Tekindal, Mustafa; DOĞANAY ERDOĞAN, BEYZA; YAVUZ, YASEMİN

doi:10.1007/s12539-015-0132-9

Evaluating Left-Censored Data Through Substitution, Parametric, Semi-parametric, and Nonparametric Methods: A Simulation Study

Tekindal M. A., DOĞANAY ERDOĞAN B., YAVUZ Y.

INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES, cilt.9, sa.2, ss.153-172, 2017 (SCI-Expanded, Scopus)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 9 Sayı: 2
Basım Tarihi: 2017
Doi Numarası: 10.1007/s12539-015-0132-9
Dergi Adı: INTERDISCIPLINARY SCIENCES-COMPUTATIONAL LIFE SCIENCES
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Sayfa Sayıları: ss.153-172
Anahtar Kelimeler: Left-censor, Kaplan-Meier estimator, Measurement uncertainty, Regression on order statistics, Limit of detection, Limit of quantification, WATER-QUALITY DATA, DETECTION LIMITS, STATISTICS, EXPOSURE
Ankara Üniversitesi Adresli: Evet

Özet

In this study, an attempt was made to determine the degrees of bias in particular sampling sizes and methods. The aim of the study was to determine deviations from the median, the mean, and the standard deviation (SD) in different sample sizes and at different censoring rates for log-normal, exponential, and Weibull distributions in the case of full and censored data sampling. Thus, the concept of "censoring'' and censoring types was handled in the first place. Then substitution, parametric (MLE), nonparametric (KM), and semi-parametric (ROS) methods were introduced for the evaluation of left-censored observations. Within the scope of the present study, the data were produced uncensored based on the different parameters of each distribution. Then the datasets were left-censored at the ratios of 5, 25, 45, and 65 %. The censored data were estimated through substitution (LOD and LOD/root 2), parametric (MLE), semiparametric (ROS), and nonparametric (KM) methods. In addition, evaluation was made by increasing the sample size from 20 to 300 by tens. Performance comparison was made between the uncensored dataset and the censored dataset on the basis of deviations from the median, the mean, and the SD. The results of simulation studies show that LOD/root 2 and ROS methods give better results than other methods in deviation from the mean in different sample sizes and at different censoring rates, while ROS gives better results than other methods in deviation from the median in almost all sample sizes and at almost all censoring rates.