Prediction of vegetation indices from downsampled hyperspectral data using machine learning: A novel framework for olive crop monitoring


Estrada J., ÇETİN N., SAÇILIK K., Guo Z., Cheein F. A.

PLOS ONE, cilt.21, sa.3 March, 2026 (SCI-Expanded, Scopus) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 21 Sayı: 3 March
  • Basım Tarihi: 2026
  • Doi Numarası: 10.1371/journal.pone.0323158
  • Dergi Adı: PLOS ONE
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, BIOSIS, Chemical Abstracts Core, EMBASE, Index Islamicus, Linguistic Bibliography, MEDLINE, Psycinfo, zbMATH, Directory of Open Access Journals
  • Ankara Üniversitesi Adresli: Evet

Özet

Accurate plant health monitoring relies on hyperspectral imagery to extract vegetation spectral signatures and compute vegetation indices (VIs), which are critical for phenotyping and crop condition assessment. However, the requirement for high spectral resolution significantly increases the cost and complexity of data acquisition. In this study, we proposed a novel machine learning-based framework for predicting VIs from down-sampled hyperspectral reflectance data. The aim was to reduce the dependency on high-resolution spectral imagery without compromising prediction accuracy. The framework integrated correlation-based feature selection with four regression models to identify and utilize the most informative spectral bands from coarsely sampled data. The system was trained and validated using a data set consisting of 555 spectral signatures collected from olive leaves at five stages of dehydration, with spectral resolutions ranging from 1 to 100 nm. A total of 25 vegetation indices, commonly used in the estimation of water stress, chlorophyll, and nitrogen, were predicted on various sampling scales. Experimental results show that even with 100 nm spectral resolution, the proposed framework achieves high prediction accuracy, with coefficients of determination reaching 0.99 for RVSI, VOPT, and SPADI indices. These findings demonstrate that accurate vegetation index estimation is achievable with significantly fewer spectral bands, offering a cost-effective solution for large-scale plant health monitoring. This framework lays the groundwork for the development of low-cost, data-efficient remote sensing systems for precision agriculture, especially in crops such as olives, where health dynamics are sensitive to water and nutrient status.