Robust Inverse Framework using Knowledge-guided Self-Supervised Learning: An application to Hydrology

Published in KDD 22: Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2022

In this paper, we propose a novel Knowledge-guided Self-Supervised Learning (KGSSL) inverse framework to extract system characteristics from driver(input) and response(output) data

Download paper here

BibTeX:

@inproceedings{10.1145/3534678.3539448,
author = {Ghosh, Rahul and Renganathan, Arvind and Tayal, Kshitij and Li, Xiang and Khandelwal, Ankush and Jia, Xiaowei and Duffy, Christopher and Nieber, John and Kumar, Vipin},
title = {Robust Inverse Framework Using Knowledge-Guided Self-Supervised Learning: An Application to Hydrology},
year = {2022},
isbn = {9781450393850},
publisher = {Association for Computing Machinery},
address = {New York, NY, USA},
url = {https://doi.org/10.1145/3534678.3539448},
doi = {10.1145/3534678.3539448},
abstract = {Machine Learning is beginning to provide state-of-the-art performance in a range of environmental applications such as streamflow prediction in a hydrologic basin. However, building accurate broad-scale models for streamflow remains challenging in practice due to the variability in the dominant hydrologic processes, which are best captured by sets of process-related basin characteristics. Existing basin characteristics suffer from noise and uncertainty, among many other things, which adversely impact model performance. To tackle the above challenges, in this paper, we propose a novel Knowledge-guided Self-Supervised Learning (KGSSL) inverse framework to extract system characteristics from driver(input) and response(output) data. This first-of-its-kind framework achieves robust performance even when characteristics are corrupted or missing. We evaluate the KGSSL framework in the context of stream flow modeling using CAMELS (Catchment Attributes and MEteorology for Large-sample Studies) which is a widely used hydrology benchmark dataset. Specifically, KGSSL outperforms baseline by 16% in predicting missing characteristics. Furthermore, in the context of forward modelling, KGSSL inferred characteristics provide a 35% improvement in performance over a standard baseline when the static characteristic are unknown.},
booktitle = {Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining},
pages = {465–474},
numpages = {10},
keywords = {forward modeling, inverse modeling, self-supervised learning},
location = {Washington DC, USA},
series = {KDD '22}
}