Filling streamflow data gaps in Indian catchments using machine learning

Show simple item record

dc.contributor.author Solanki, Hiren
dc.contributor.author Mishra, Vimal
dc.coverage.spatial Austria
dc.date.accessioned 2025-03-28T15:38:36Z
dc.date.available 2025-03-28T15:38:36Z
dc.date.issued 2025-04-27
dc.identifier.citation Solanki, Hiren and Mishra, Vimal, "Filling streamflow data gaps in Indian catchments using machine learning", in the EGU General Assembly 2025, Vienna, AT, Apr. 27-May 02, 2025.
dc.identifier.uri https://meetingorganizer.copernicus.org/EGU25/EGU25-15030.html
dc.identifier.uri https://repository.iitgn.ac.in/handle/123456789/11149
dc.description.abstract Complete hydrological time series are critical for effective water resource management, flood and drought forecasting, hydroelectric power optimization, irrigation planning, ecological preservation, and climate change impact assessments. However, significant data gaps in streamflow and water level observations, compounded by extreme hydroclimatic events and quality control issues, hinder accurate modeling and informed decision-making in Indian catchments. The current challenges are particularly pronounced in regions with high climatic variability, where missing data spans 6 to 12 months. To address this, we employed geomorphological, meteorological, and hydrological parameters in combination with the Random Forest method to gap-fill streamflow data at 352 stations across India, except the transboundary basins. To enhance model accuracy and training, we categorized stations into similar-behaving classes using a k-means clustering algorithm based on catchment characteristics. This clustering increased the availability of training data for machine learning models. Streamflow data from each class was trained with 80% of the available data and validated on the remaining 20%. Our results indicate that clustering significantly improves performance, with over 100 stations reporting a >25% increase in Nash-Sutcliffe Efficiency (NSE). Model performance was evaluated for continuous data gaps of 1 week, 1 month, 3 months, 6 months, and 1 year, revealing a decline in accuracy with longer gaps. Despite this, the mean NSE exceeded 0.85 across all clusters. The gap-filled datasets provide robust hydrographs, enabling precise streamflow variability modeling, climate-hydrology interaction evaluation, and improved water resource management strategies.
dc.description.statementofresponsibility by Hiren Solanki and Vimal Mishra
dc.language.iso en_US
dc.title Filling streamflow data gaps in Indian catchments using machine learning
dc.type Conference Paper
dc.relation.journal EGU General Assembly 2025


Files in this item

Files Size Format View

There are no files associated with this item.

This item appears in the following Collection(s)

Show simple item record

Search Digital Repository


Browse

My Account