Agriculture (Switzerland), cilt.15, sa.21, 2025 (SCI-Expanded, Scopus)
Wool traits such as fiber diameter, fiber length, and greasy fleece yield are economically significant characteristics in sheep breeding programs. Traditional genome-wide association studies (GWAS) have identified relevant genomic regions but often fail to capture the non-linear and polygenic architecture underlying these traits. In this study, we implemented a two-stage machine learning (ML)-based GWAS framework to dissect the genetic basis of wool traits in Central Anatolian Merino sheep. Phenotypic records were collected from 228 animals, genotyped with the Illumina OvineSNP50 BeadChip. In the first stage, feature selection was conducted using LASSO, Ridge Regression, and Elastic Net, generating a consensus SNP panel per trait. In the second stage, association modeling with Random Forest and Support Vector Regression (SVR) identified the most predictive models (R2 up to 0.86). Candidate gene annotation highlighted biologically relevant loci: MTHFD2L and EPGN (folate metabolism and keratinocyte proliferation) for fiber diameter; COL5A2, COL3A1, ITFG1, and ELMO1 (extracellular matrix integrity and actin remodeling) for staple length; and FAP, DPP4, PLCH1, and NPTX1 (extracellular matrix remodeling, proteolysis, and sebaceous gland function) for greasy fleece yield. These findings demonstrate the utility of ML-enhanced GWAS pipelines in identifying biologically meaningful markers and propose novel targets for genomic selection strategies to improve wool quality and yield in indigenous sheep populations.