DSpace Repository

Enhancing crop yield prediction in Senegal using advanced machine learning techniques and synthetic data

Show simple item record

dc.contributor.author Razavi, M.A.
dc.contributor.author Pouyan Nejadhashemi, A.
dc.contributor.author Majidi, B.
dc.contributor.author Razavi, H.S.
dc.contributor.author Kpodo, J.
dc.contributor.author Eeswaran, R.
dc.contributor.author Ciampitti, I.
dc.contributor.author Vara Prasad, P.V.
dc.date.accessioned 2025-01-27T02:40:58Z
dc.date.available 2025-01-27T02:40:58Z
dc.date.issued 2024
dc.identifier.uri http://repo.lib.jfn.ac.lk/ujrr/handle/123456789/10998
dc.description.abstract In this study, we employ advanced data-driven techniques to investigate the complex relationships between the yields of five major crops and various geographical and spatiotemporal features in Senegal. We analyze how these features influence crop yields by utilizing remotely sensed data. Our methodology incorporates clustering algorithms and correlation matrix analysis to identify significant patterns and dependencies, offering a compre- hensive understanding of the factors affecting agricultural productivity in Senegal. To optimize the model's performance and identify the optimal hyperparameters, we implemented a comprehensive grid search across four distinct machine learning regressors: Random Forest, Extreme Gradient Boosting (XGBoost), Categorical Boosting (CatBoost), and Light Gradient-Boosting Machine (LightGBM). Each regressor offers unique functional- ities, enhancing our exploration of potential model configurations. The top-performing models were selected based on evaluating multiple performance metrics, ensuring robust and accurate predictive capabilities. The re- sults demonstrated that XGBoost and CatBoost perform better than the other two. We introduce synthetic crop data generated using a Variational Auto Encoder to address the challenges posed by limited agricultural datasets. By achieving high similarity scores with real-world data, our synthetic samples enhance model robustness, mitigate overfitting, and provide a viable solution for small dataset issues in agriculture. Our approach distin- guishes itself by creating a flexible model applicable to various crops together. By integrating five crop datasets and generating high-quality synthetic data, we improve model performance, reduce overfitting, and enhance re- alism. Our findings provide crucial insights for productivity drivers in key cropping systems, enabling robust recommendations and strengthening the decision-making capabilities of policymakers and farmers in data- scarce regions. en_US
dc.language.iso en en_US
dc.publisher Elsevier en_US
dc.subject Crop yield prediction en_US
dc.subject Variational auto encoder en_US
dc.subject Pattern recognition on spatiotemporal en_US
dc.subject Physiographical variables en_US
dc.subject Synthetic tabular data generation en_US
dc.subject Ensemble learning en_US
dc.title Enhancing crop yield prediction in Senegal using advanced machine learning techniques and synthetic data en_US
dc.type Article en_US
dc.identifier.doi https://doi.org/10.1016/j.aiia.2024.11.005 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record