Filling streamflow data gaps in Indian catchments using machine learning

DR Home
→
Earth Sciences
→
Conference Papers
→
View Item

dc.contributor.author	Solanki, Hiren
dc.contributor.author	Mishra, Vimal
dc.coverage.spatial	Austria
dc.date.accessioned	2025-03-28T15:38:36Z
dc.date.available	2025-03-28T15:38:36Z
dc.date.issued	2025-04-27
dc.identifier.citation	Solanki, Hiren and Mishra, Vimal, "Filling streamflow data gaps in Indian catchments using machine learning", in the EGU General Assembly 2025, Vienna, AT, Apr. 27-May 02, 2025.
dc.identifier.uri	https://meetingorganizer.copernicus.org/EGU25/EGU25-15030.html
dc.identifier.uri	https://repository.iitgn.ac.in/handle/123456789/11149
dc.description.abstract	Complete hydrological time series are critical for effective water resource management, flood and drought forecasting, hydroelectric power optimization, irrigation planning, ecological preservation, and climate change impact assessments. However, significant data gaps in streamflow and water level observations, compounded by extreme hydroclimatic events and quality control issues, hinder accurate modeling and informed decision-making in Indian catchments. The current challenges are particularly pronounced in regions with high climatic variability, where missing data spans 6 to 12 months. To address this, we employed geomorphological, meteorological, and hydrological parameters in combination with the Random Forest method to gap-fill streamflow data at 352 stations across India, except the transboundary basins. To enhance model accuracy and training, we categorized stations into similar-behaving classes using a k-means clustering algorithm based on catchment characteristics. This clustering increased the availability of training data for machine learning models. Streamflow data from each class was trained with 80% of the available data and validated on the remaining 20%. Our results indicate that clustering significantly improves performance, with over 100 stations reporting a >25% increase in Nash-Sutcliffe Efficiency (NSE). Model performance was evaluated for continuous data gaps of 1 week, 1 month, 3 months, 6 months, and 1 year, revealing a decline in accuracy with longer gaps. Despite this, the mean NSE exceeded 0.85 across all clusters. The gap-filled datasets provide robust hydrographs, enabling precise streamflow variability modeling, climate-hydrology interaction evaluation, and improved water resource management strategies.
dc.description.statementofresponsibility	by Hiren Solanki and Vimal Mishra
dc.language.iso	en_US
dc.title	Filling streamflow data gaps in Indian catchments using machine learning
dc.type	Conference Paper
dc.relation.journal	EGU General Assembly 2025

Files in this item

Files	Size	Format	View
There are no files associated with this item.

This item appears in the following Collection(s)

Conference Papers [61]

Show simple item record

Search Digital Repository

Browse

All of DSpace
This Collection
- Titles
- Authors
- By Advisor
- By Issue Date
- Subjects
- By Type
- By Degree
- By Department

Filling streamflow data gaps in Indian catchments using machine learning

Files in this item

This item appears in the following Collection(s)

Search Digital Repository

Browse

All of DSpace

This Collection

My Account