# Changes between Version 108 and Version 109 of udg/ecoms/dataserver/catalog

Ignore:
Timestamp:
Nov 8, 2016 11:38:26 AM (6 years ago)
Comment:

--

### Legend:

Unmodified
 v108 The '''ECOMS UDG''' collects and provides information (mainly at 6-hourly and/or daily resolution) for a reduced number of variables from a number of datasets (seasonal hindcasts, reanalysis and observations) obtained from different data providers. The following list of variables has been identified according to the user's needs, receiving feedback from EUPORIAS WP22 (climate information indices, CIIs), WP23 (impact models), WP21 (calibration and downscaling) and SPECS WP61 (pilot applications) and WP52 (calibration and downscaling). See  the section on the [wiki:../../EndUserNeeds assessment of user's needs] for more details. The '''ECOMS UDG''' collects and provides information (mainly at 6-hourly and/or daily resolution, but also some monthly data) for a reduced number of variables from a number of datasets (seasonal hindcasts, reanalysis and observations) obtained from different data providers. The following list of variables has been identified according to the user's needs, receiving feedback from EUPORIAS WP22 (climate information indices, CIIs), WP23 (impact models), WP21 (calibration and downscaling) and SPECS WP61 (pilot applications) and WP52 (calibration and downscaling). See  the section on the [wiki:../../EndUserNeeds assessment of user's needs] for more details. Note that the '''R names''' below correspond to the vocabulary names used in the [wiki:RPackage R data access package] for homogenization purposes. Note that, data homogenization and aggregation (i.e. daily means from 6h data) is only provided through the R data access package. Note that the '''R names''' below correspond to the vocabulary names used in the [wiki:RPackage R data access package] for harmonization purposes. Data harmonization and temporal aggregation (i.e. daily means from 6h data) is '''only''' provided through the R data access package. In order to specify the particular '''temporal frequency/aggregation''' available for the variables in the different datasets, the following codes are used in the table below: '''6h''' (6-hourly instantaneous data). '''12h''' (12-hourly instantaneous data). '''24h''' (24-hourly instantaneous data). '''DM''' (daily mean value). '''DX''' (daily maximum value). '''DN''' (daily minimum value).  '''DA''' (daily accumulated data). '''DAr''' (accumulated since the initialization time –runtime) NOTE: The R package performs deaccumulation on a daily basis to match the standard definition. '''fx''' (static field) The original '''temporal frequency/aggregation''' of the variables stored at the UDG, the following codes are used in the table below: '''6h''' (6-hourly instantaneous data). '''12h''' (12-hourly instantaneous data). '''24h''' (24-hourly instantaneous data). '''DM''' (daily mean value). '''DX''' (daily maximum value). '''DN''' (daily minimum value).  '''DA''' (daily accumulated data). '''DAr''' (accumulated since the initialization time –runtime) NOTE: The R package performs deaccumulation on a daily basis to match the standard definition. '''fx''' (static field). Similarly, the prefix M indicates monthly data in the following way: '''MM''' (monthly mean), '''MA''' (monthly accumulated). In the table below, boldface codes (e.g. '''6h''') indicate variables already available through the ECOMS UDG. '''Italics''' are used for work in progress (variables to be included in the next update). '''e''' indicates that a variable exists in the original dataset but it is not planned to be included yet in ECOMS-UDG; '''blanks''' indicate that the variables do not exist in the original dataset. Codes ended by '''(*)''' indicate variables which do NOT exist in the dataset, but are derived/approximated from other available ones through the [wiki:RPackage R  data access package]. For more details on the particular approximations used see the [https://github.com/SantanderMetGroup/loadeR/blob/master/R/conversion.R conversion formulae]. Variables ended by '''(#)''' indicate daily aggregated values obtained from the corresponding original 3-hourly data. In the table below, boldface codes (e.g. '''6h''') indicate variables already available through the ECOMS UDG. '''Italics''' are used for work in progress (variables to be included in the next update). '''e''' indicates that a variable exists in the original dataset but it is not planned to be included yet in ECOMS-UDG; '''blanks''' indicate that the variable does not exist in the original dataset. Codes ended by '''(*)''' indicate variables which do NOT exist in the dataset, but are derived/approximated from other available ones through the [wiki:RPackage R  data access package]. For more details on the particular approximations used see the [https://github.com/SantanderMetGroup/loadeR/blob/master/R/conversion.R conversion formulae]. Variables ended by '''(#)''' indicate daily aggregated values obtained from the corresponding original 3-hourly data. || || ||  '''Observations:'''   ||||  '''Reanalysis:'''  ||||||||||  '''Seasonal forecasting models:'''  || '''Data Homogeneization:''' The different nature of the datasets, and the idiosyncratic naming and storage conventions often applied by the modelling centres, makes necessary an homogenization across datasets in order to implement a truly user-friendly toolbox for data access.  To this aim, the [wiki:RPackage R package for data access] has been developed. Data homogenization is achieved through the creation of a common vocabulary. The particular variables of each dataset are then translated -and transformed if necessary- into the common vocabulary by means of a ''dictionary''. Both features -vocabulary and dictionary- are described [wiki:RPackage/homogeneization here]. In particular, some typical transformations performed by the loadECOMS interface are deaccumulation of initialization-accumulated variables to daily accumulated (i.e.: '''DAr''' --> '''DA''') and scaling and/or offset of variables to match standard units (e.g. -273.15 for conversion K --> ºC). '''Data Harmonization:''' The different nature of the datasets, and the idiosyncratic naming and storage conventions often applied by the modelling centres, makes necessary an harmonization across datasets in order to implement a truly user-friendly toolbox for data access.  To this aim, the [wiki:RPackage R package for data access] has been developed. Data harmonization is achieved through the creation of a common vocabulary. The particular variables of each dataset are then translated -and transformed if necessary- into the common vocabulary by means of a ''dictionary''. Both features -vocabulary and dictionary- are described [wiki:RPackage/homogeneization here]. In particular, some typical transformations performed by the loadECOMS interface are deaccumulation of initialization-accumulated variables to daily accumulated (i.e.: '''DAr''' --> '''DA''') and scaling and/or offset of variables to match standard units (e.g. -273.15 for conversion K --> ºC). [=#datasets]