Gupta, Ankita, Kaur, Lakhwinder, and Kaur, Gurmeet
PeerJ Computer Science. May 19, 2023, Vol. 9, e1268
Data warehousing/data mining, Algorithm, Aquatic resources -- Taiwan -- India, Droughts -- India -- Taiwan, Wheat industry -- Methods, Machine learning -- Methods, Data mining -- Methods, and Algorithms -- Methods
The workflow of this research is based on numerous hypotheses involving the usage of pre-processing methods, wheat canopy segmentation methods, and whether the existing models from the past research can be adapted to classify wheat crop water stress. Hence, to construct an automation model for water stress detection, it was found that pre-processing operations known as total variation with L1 data fidelity term (TV-L1) denoising with a Primal-Dual algorithm and min-max contrast stretching are most useful. For wheat canopy segmentation curve fit based K-means algorithm (Cfit-kmeans) was also validated for the most accurate segmentation using intersection over union metric. For automated water stress detection, rapid prototyping of machine learning models revealed that there is a need only to explore nine models. After extensive grid search-based hyper-parameter tuning of machine learning algorithms and 10 K fold cross validation it was found that out of nine different machine algorithms tested, the random forest algorithm has the highest global diagnostic accuracy of 91.164% and is the most suitable for constructing water stress detection models.