Spatial and temporal variations in river water quality of the Middle Ganga Basin using unsupervised machine learning techniques

dc.contributor.authorKrishnaraj, A.
dc.contributor.authorDeka, P.C.
dc.date.accessioned2026-02-05T09:27:48Z
dc.date.issued2020
dc.description.abstractIn this study, cluster analysis (CA), principal component analysis (PCA) and correlation were applied to access the river water quality status and to understand spatiotemporal patterns in the Ganga River Basin, Uttara Pradesh. The study was carried out using data collected over 12 years (2005–2017) regarding 20 water quality parameters (WQPs) covering spatially from upstream to downstream Ankinghat to Chopan, respectively (20 stations under CWC Middle Ganga Basin). The temporal variations of river water quality were established using the Spearman non-parametric correlation coefficient test (Spearman R). The highest Spearman R (?0.866) was observed for temperature with the season and a very significant p value of (0.0000). The parameters EC, pH, TDS, T, Ca, Cl, HCO<inf>3</inf>, Mg, NO<inf>2</inf> + NO<inf>3</inf>, SiO<inf>2</inf> and DO had a significant correlation with the season (p < 0. 05). K-means clustering algorithm grouped the stations into four different clusters in dry and wet seasons. Based on these clusters, box and whisker plots were generated to study individual clusters in different seasons. The spatial patterns of river WQ on both seasons were examined. PCA was applied to screen out the most significant water quality parameters due to spatial and seasonal variations out of a large data set. It is a data reduction process and a more conventional way of speeding up any machine learning algorithms. A reduced number of three principal components (PCs) were drawn for 20 WQPs with an explained total variance of 75.84% and 80.57% is observed in the dry and wet season, respectively. The parameters DO, EC_ Gen, P-Tot, SO<inf>4</inf> are the most dominating parameters with PC score more than 0.8 in the dry season; similarly, TDS, K, COD, Cl, Na, SiO<inf>2</inf> in the wet season. The different components of water quality monitoring, such as spatiotemporal patterns, scrutinize the most relevant water quality parameters and monitoring stations are well addressed in this study and could be used for the better management of the Ganga River Basin. © 2020, Springer Nature Switzerland AG.
dc.identifier.citationEnvironmental Monitoring and Assessment, 2020, 192, 12, pp. -
dc.identifier.issn1676369
dc.identifier.urihttps://doi.org/10.1007/s10661-020-08624-4
dc.identifier.urihttps://idr.nitk.ac.in/handle/123456789/23556
dc.publisherSpringer Science and Business Media Deutschland GmbH
dc.subjectCalcium compounds
dc.subjectChlorine compounds
dc.subjectCluster analysis
dc.subjectK-means clustering
dc.subjectLearning algorithms
dc.subjectMachine learning
dc.subjectMagnesium compounds
dc.subjectMicrocomputers
dc.subjectQuality control
dc.subjectSilica
dc.subjectSilicon
dc.subjectSodium compounds
dc.subjectWater quality
dc.subjectWatersheds
dc.subjectBox and Whisker plots
dc.subjectCorrelation coefficient
dc.subjectPrincipal Components
dc.subjectSpatial and temporal variation
dc.subjectSpatiotemporal patterns
dc.subjectUnsupervised machine learning
dc.subjectWater quality monitoring
dc.subjectWater quality parameters
dc.subjectRivers
dc.subjectbicarbonate
dc.subjectcalcium
dc.subjectchloride
dc.subjectdissolved oxygen
dc.subjectmagnesium
dc.subjectnitrogen
dc.subjectoxygen
dc.subjectpotassium
dc.subjectriver water
dc.subjectsilicon dioxide
dc.subjectsodium
dc.subjectcluster analysis
dc.subjectcorrelation
dc.subjectmachine learning
dc.subjectprincipal component analysis
dc.subjectspatiotemporal analysis
dc.subjectwater quality
dc.subjectArticle
dc.subjectbiochemical oxygen demand
dc.subjectchemical oxygen demand
dc.subjectcorrelation analysis
dc.subjectdry season
dc.subjectelectric conductivity
dc.subjectenvironmental monitoring
dc.subjectenvironmental temperature
dc.subjectfeature selection
dc.subjectIndia
dc.subjectk means clustering
dc.subjectlearning algorithm
dc.subjectpH
dc.subjectrainy season
dc.subjectriver basin
dc.subjectseasonal variation
dc.subjectunsupervised machine learning
dc.subjectriver
dc.subjectseason
dc.subjectwater pollutant
dc.subjectGanges Basin
dc.subjectEnvironmental Monitoring
dc.subjectSeasons
dc.subjectSilicon Dioxide
dc.subjectUnsupervised Machine Learning
dc.subjectWater Pollutants, Chemical
dc.subjectWater Quality
dc.titleSpatial and temporal variations in river water quality of the Middle Ganga Basin using unsupervised machine learning techniques

Files

Collections