Kumar, G.P.Dwarakish, G.S.2026-02-032025Water Conservation Science and Engineering, 2025, 10, 2, pp. -23663340https://doi.org/10.1007/s41101-025-00403-xhttps://idr.nitk.ac.in/handle/123456789/20147Accurate rainfall data is crucial for agriculture, hydrology, and climate research as it guides water management, crop planning, and disaster preparedness. Missing data affects reliability, requiring effective imputation. The purpose of this study is to address the critical challenge of imputing missing daily rainfall data, which is especially important given rainfall’s nonlinear distribution and variability in missingness patterns. This research aims to develop and evaluate advanced imputation algorithms to improve data completeness and integrity in humid tropical regions. This study evaluates ten imputation algorithms: K-nearest neighbors (KNN), classification and regression trees (CART), predictive mean matching (pmm), random forest (rf), mean method, and Bayesian methods (norm. boot, lasso. norm, norm, norm. nob, midastouch) for addressing missing daily rainfall data. Using 37 years of data from thirteen stations in the Kali River Basin, the methods leverage descriptive statistics to enhance accuracy in humid tropical regions. The proposed algorithms incorporate descriptive statistics of the rainfall time series and are evaluated using 37 years of daily data from thirteen selected rainfall stations in the Kali River Basin, a humid tropical region. Model performance was assessed at four missingness levels (1%, 5%, 10%, and 20%) and evaluated with accuracy metrics root mean square error (RMSE), mean absolute error (MAE), index of agreement (d), and RMSE-observations standard deviation ratio (RSR). Among the evaluated methods, KNN consistently demonstrated superior performance across all levels of missingness (RMSE = 13.22 to 15.42; MAE = 4.68 to 6.08; d = 0.87 to 0.90; RSR = 0.57 to 0.61), followed closely by CART (RMSE = 16.48 to 20.77; MAE = 6.20 to 8.31; d = 0.81 to 0.86; RSR = 0.71 to 0.79). Overall, KNN, CART, pmm, and rf emerged as reliable methods for imputing missing rainfall data of varying lengths, contributing to more accurate weather forecasting and climate change analyses. © The Author(s), under exclusive licence to Springer Nature Singapore Pte Ltd. 2025.Bayesian networksCropsData accuracyData reductionDecision treesDisaster preventionInformation managementNearest neighbor searchRainRandom forestsRegression analysisRiversTime seriesTropical engineeringTropicsClassification treesData qualityImputation methodsK-near neighborMissing dataNearest-neighbourPerformancePerformance scoreRainfall dataRegression treesMean square erroralgorithmcomparative studydata qualitymachine learningnearest neighbor analysisrainfallriver basinspatial distributiontropical regionSharda BasinComparison of the Multiple Imputation Approaches for Imputing Rainfall Data: A Humid Tropical River Basin Case Study