A Comparison of Robust Model Choice Criteria Within a Metalearning Study


Vidnerová P., Kalina J., GÜNEY Y.

3rd Workshop on Analytical Methods in Statistics, AMISTAT 2019, Liberec, Çek Cumhuriyeti, 16 - 19 Eylül 2019, cilt.329, ss.125-141 identifier

  • Yayın Türü: Bildiri / Tam Metin Bildiri
  • Cilt numarası: 329
  • Doi Numarası: 10.1007/978-3-030-48814-7_7
  • Basıldığı Şehir: Liberec
  • Basıldığı Ülke: Çek Cumhuriyeti
  • Sayfa Sayıları: ss.125-141
  • Anahtar Kelimeler: Automatic method selection, Linear regression, Robust Akaike criterion, Robust estimation
  • Ankara Üniversitesi Adresli: Evet

Özet

© 2020, Springer Nature Switzerland AG.The methodology of automatic method selection (metalearning) allows to recommend the most suitable method (e.g. algorithm or statistical estimator) from several alternatives for a given dataset, based on information learned over a training database of datasets. Practitioners have become accustomed to using metalearning in the context of regression modeling, which is useful in a variety of applications in different fields. Still, none of previous metalearning studies on regression targeted at regression complexity issues and the majority of available metalearning studies for regression considered the standard mean square error as the prediction error measure. In this paper, a metalearning study focused on comparing different method selection criteria for the regression task is presented. A prediction rule, recommending the best regression estimator (possibly robust), is constructed over 31 training datasets. These are publicly available datasets, in which the linear model was carefully examined to be suitable. The results with the highest classification accuracy are obtained if the choice of the best estimator is based on robust versions of Akaike information criterion, particularly the version derived from MM-estimators. The work also advocates an implicitly weighted robust prediction mean square error.