Tree-Based Modeling for Large-Scale Management in Agriculture: Explaining Organic Matter Content in Soil

Lee, Woosik and Lee, Juhwan (2024) Tree-Based Modeling for Large-Scale Management in Agriculture: Explaining Organic Matter Content in Soil. Applied Sciences, 14 (5). p. 1811. ISSN 2076-3417

[thumbnail of applsci-14-01811.pdf] Text
applsci-14-01811.pdf - Published Version

Download (1MB)

Abstract

Machine learning (ML) has become more prevalent as a tool used for biogeochemical analysis in agricultural management. However, a common drawback of ML models is the lack of interpretability, as they are black boxes that provide little insight into agricultural management. To overcome this limitation, we compared three tree-based models (decision tree, random forest, and gradient boosting) to explain soil organic matter content through Shapley additive explanations (SHAP). Here, we used nationwide data on field crops, soil, terrain, and climate across South Korea (n = 9584). Using the SHAP method, we identified common primary controls of the models, for example, regions with precipitation levels above 1400 mm and exchangeable potassium levels exceeding 1 cmol+ kg−1, which favor enhanced organic matter in the soil. Different models identified different impacts of macronutrients on the organic matter content in the soil. The SHAP method is practical for assessing whether different ML models yield consistent findings in addressing these inquiries. Increasing the explainability of these models means determining essential variables related to soil organic matter management and understanding their associations for specific instances.

Item Type: Article
Subjects: Asian STM > Multidisciplinary
Depositing User: Managing Editor
Date Deposited: 23 Feb 2024 05:03
Last Modified: 23 Feb 2024 05:03
URI: http://journal.send2sub.com/id/eprint/3132

Actions (login required)

View Item
View Item