Version 9 (modified by gutierjm, 7 years ago) (diff)


Although the ECOMS-UDG provides a web interface to explore and access the datasets (shown in this section), it is strongly recommented the use of OPeNDAP client libraries to remotely access the data from scientific computing environments (see the R package or the Python and Matlab functions). Note that the different datasets are stored in ECOMS-UDG in its original format, with their particular conventions and temporal aggregations for the variables. The above packages allow accessing the different datasets in an homogeneous form, using a single table of variables.

Any of the datasets included in the UDG can be explored via Web by clicking in the corresponding name (e.g. System4_annual_15):

Currently, only the OPeNDAP access service is fully operative. Therefore, in this example, we will illustrate the use of this service, which allows selecting time/spatial data slices from the OPeNDAP data access form and downloading the resulting data in both ASCII and Binary formats. First of all, an authentication dialog will request a valid user name and password (see user registration for details).

Note that the variables provided (e.g. minimum temperature) are stored as gridsets. Thus, in addition to these variables, also auxiliary coordinate variables (lat, lon, run, time, member) should be handled for geo-temporal data referencing. Moreover, three time coordinates are included as referece for different grid variables because they are defined for different forecast times (one extra time for precipitation and different temporal resolution for mean sea level pressure). Note that this highly complicates the direct analysis of the data and, hence, this options is only recommend for data exploration.

In the following we show how to use this service to explore the structure of the datasets and to obtain simple pieces of information in ASCII format.

By default, if no specifications are given in the different subsetting boxes of the OpenDAP form, the whole data on the whole spatio/temporal and member ranges of the dataset would be accessed. However, this option will raise an error due to the large size of the request (the maximum size of a single request has been set to 100 Mbytes for the sake of multi-connection efficiency). The basic steps to retrieve subsets of data are the following:

  1. Select a variable click on the checkbox to its left.
  2. Constrain the variable: Edit the information that appears in the text boxes below the variable. This is a vector of integers indicating index positions of length three, with the following order: [start:stride:end].
  3. To get ASCII or binary values for the selected variables, click on the Get ASCII or Get Binary buttons of the Action field. Note that the URL displayed in the Data URL field is updated as you select and/or constrain variables. The URL in this field can be cut and pasted in various OPeNDAP clients.

The main disadvantage of the OPeNDAP service from the end-user point of view is that the specifications for subsetting dimensions are not given in their original magnitudes (i.e., latitudes and longitudes are not given in decimal degrees), but by the indexes of their position along their respective axes (note that first index value is always 0). Thus, to find out the indexes for the desired selection, we need to dump and analyze the particular values defined in the coordinate variable. For instance, this figure shows the 241 values defined for the lat (latitude) coordinate, as provided by the Get ASCII option (selecting the corresponding check-box).

Using these facilities it can be obtained after some calculations that the closest lat and lon coordinates for a particular location of interest (e.g. Madrid) are 66 and 475, respectively. Thus, the time series for Madrid corresponding to the example described in the previous section (minimum temperature forecasts for January with one-month lead time, i.e. from the simulations started the first of December) could be requested as shown in Figure

Note that the indexes selected for the run coordinate correspond to the December initializations (index positions 11, 23,..., and that indexes start in 0) and for the time coordinate correspond to January (positions, 31 to 62, in days after the run time). The proper use of this service requires a full understanding of the data structure and, therefore, it is only advised for data exploration.

Attachments (6)

Download all attachments as: .zip