Early stage lung cancer detection from speech sounds in natural environments


ANKIŞHAN H., Ulucanlar H., AKTÜRK İ., ALPHAN KAVAK K., Bağcı U., Mustafa Yenigün B.

Biomedical Signal Processing and Control, cilt.96, 2024 (SCI-Expanded) identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 96
  • Basım Tarihi: 2024
  • Doi Numarası: 10.1016/j.bspc.2024.106628
  • Dergi Adı: Biomedical Signal Processing and Control
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Compendex, EMBASE, INSPEC
  • Anahtar Kelimeler: Early-stage lung cancer detection, Relationship between speech patterns and lung cancer, Speech analysis in cancer diagnosis, Voice biomarkers for lung cancer detection
  • Ankara Üniversitesi Adresli: Evet

Özet

In the diagnosis of early-stage lung cancer, conventional methods often rely on periodic imaging techniques using medical devices. However, recent studies suggest that speech sounds could offer valuable insights into the diagnosis of disease. This study investigates the different characteristics of speech sounds recorded in natural environments between individuals diagnosed with lung cancer and those who are healthy. Using signal processing techniques and a self-supervised contrastive learning approach, we investigate the classification of these speech sounds for the diagnosis of early-stage lung cancer. Our results show that it is possible to utilize naturally recorded speech sounds. Using the Graph Attention Transformer Fine-Tuning Contrastive Learning (GAT-ftCL) model, which leverages graph neural networks to capture complex relationships in data and fine-tunes the learning process through contrastive learning, we achieve an accuracy of 90.90% in distinguishing individuals diagnosed with lung cancer from healthy individuals. Furthermore, the model achieves a remarkable accuracy of 92.85% in specifically identifying individuals with early stage (stage 1) lung cancer within the stage 1 group and healthy individuals. These results underline the diagnostic potential of natural speech sounds, especially in the detection of early-stage lung cancer.