Rain Data Validation

This document explains the rain data validation procedure as we started to apply within the FLUSSHYGIENE project.

Rain Data Files

The rain data provided by Berliner Wasserbetriebe (BWB) come in the form of Excel files (rain data files) that store for each five minute period the cumulative rain height measured by different rain gauges at the end of the period. The data are raw, i.e. they represent what was logged by the rain gauges.

Rain Correction Files

In order to calibrate the rain gauges BWB staff visits the rain gauges on a regular basis and applies a certain amount of water to them. This amount of water that appears as additional rain height in the data but does not represent actual rain needs to be excluded from the rain data. Therefore, BWB provides another set of Excel files (rain correction files) that contain for each day and each gauge the expected actual total rain height at that gauge and day.

Rain Data Validation

The task to be performed during “rain data validation” is to

Method

We follow a semi-automatic approach in which we use

The first script reads

The user can then check the CSV files created by the second script, edit them appropriately and save them under a different name (userdiff_*.csv in the userdiffs folder).

It is important to save them out of the folder autodiffs because the files within this folder are always overwritten by the second script.

After editing the userdiff-files, the script

macos_desktopdir/R_Development/RPackages/kwb.rain/inst/extdata/user_validation.R

can be used to apply the corrections defined by the user to the raw data and to store the resulting validated data.