Modified fuzzy regression functions with a noise cluster against outlier contamination


Chakravarty S., Demirhan H., BAŞER F.

EXPERT SYSTEMS WITH APPLICATIONS, cilt.205, 2022 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 205
  • Basım Tarihi: 2022
  • Doi Numarası: 10.1016/j.eswa.2022.117717
  • Dergi Adı: EXPERT SYSTEMS WITH APPLICATIONS
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, PASCAL, Aerospace Database, Applied Science & Technology Source, Communication Abstracts, Computer & Applied Sciences, INSPEC, Metadex, Public Affairs Index, Civil Engineering Abstracts
  • Anahtar Kelimeler: Artificial neural networks, Noise cluster, Outlier, Possibilistic clustering, Robustness, Support vector machines, C-MEANS, ALGORITHM
  • Ankara Üniversitesi Adresli: Evet

Özet

Outliers significantly impact the reliability of the analyses informing the expert systems. Depending on the context in which data is collected, the traditional approaches of eliminating such outliers altogether or replacing them with pre-defined values could lead to the loss of genuine information. Machine learning techniques need to be tuned up to give suitable weights to unusual observations such as outliers. Robust methods are needed to minimise the influence of outliers on the reliability of such methods. In this study, we develop a fuzzy regression functions approach by the use of a noise cluster within fuzzy clustering algorithms that are potentially more robust against outlier contamination than the classical fuzzy c-means algorithm. The accuracy of the proposed approach is compared with those of traditional techniques such as artificial neural networks utilising various training algorithms, support vector machines using various kernel functions, and Fuzzy Regression Functions with a noise cluster and the fuzzy c-means algorithm. In total, 91 distinct implementations have been applied against 30 benchmark datasets with outliers present in both the independent and dependent features. The relative performances of these implementations are examined and the robustness of the proposed modified fuzzy regression functions framework is found superior to the considered alternatives.