| 2 | |
| 3 | == WRF4G Components == |
| 4 | |
| 5 | WRF4G is a software that has 2 main components, one runs in the '''user's interface (UI)''' (usually their PCs) and manages the experiments and the resources used by the experiment. This component is in charge of the design, execution and monitoring of experiments. Once submitted to a resource, WRF4G start to run in a '''Computing Node'''. |
| 6 | |
| 7 | A user can use the following resources as Computing Nodes: |
| 8 | * The same UI where their are running the WRF4G framework. |
| 9 | * Workstation or stand-alone server. |
| 10 | * Cluster (PBS, SGE, SLURM) |
| 11 | * Mainframe HPC. |
| 12 | * Grid infrastructures (currently supported globus and gLite) |
| 13 | |
| 14 | In the examples shown bellow, we will use only a Computing Node that will be the UI where the framework is running. |
| 15 | |
57 | | * wrf4g_ps returns the following information: |
58 | | * Realization: Realization name. It is taken from the field experiment_name in experiment.wrf4g. |
59 | | * Status: It can be take the following values: Prepared, Submitted, Running, Failed and Done). |
60 | | * Chunks [Chunk running/Total Chunks]: A realization is split into chunks. Each chunk is sent as a job. |
61 | | * Computer resource: Resource (cluster) where the job is running. |
62 | | * WN: Computing node where the job is running. |
63 | | * Run.Sta: Job status in the WN (Downloading data, running ungrib, real, wrf, ...) |
64 | | * ext: Exit Code. If exit code is different from 0, there has been an error. Error codes are explained in $WRF4G_LOCATION/lib/bash/wrf4g_exit_codes.sh |
65 | | * % : percentage of simulation finished. |
67 | | |
68 | | * Check the simulation output and log files in $WRF4G_LOCATION/repository/output |
69 | | |
70 | | '''To run a different experiment, you only need to modify the following files according to your needs:''' |
71 | | |
72 | | * experiment.wrf4g: Experiment configuration: type of experment, simulation dates, restarts, physics,... |
73 | | * resources.wrf4g: With this file the resources used by a WRF experiment are configured: data repository, number of processes for MPI, version of WRF to run,.. By default there is a file that stores a configuration that affects to every experiment ($WRF4G_LOCATION/etc/resources.wrf4g). If a file with this name (resources.wrf4g) is found in the folder where experiment.wrf4g is stored, this file will be used instead of the application one ($WRF4G_LOCATION/etc/resources.wrf4g) |
74 | | * $WRF4G_LOCATION/etc/framework4g.conf: Database and computing resources configuration. After modifying this file, '''wrf4g_framework restart''' must be run. |
75 | | |
76 | | For example, to add a new PBS cluster to run jobs: |
77 | | |
78 | | * First modify the file $WRF4G_LOCATION/etc/framework4g.conf to add more resources ([https://meteo.macc.unican.es/trac/WRF4G/wiki/DRM4G More information about how to add resources]): |
79 | | {{{ |
80 | | mycomputer local://localhost?LRMS_TYPE=fork;NODECOUNT=1 |
81 | | ui01 local://localhost?LRMS_TYPE=pbs;QUEUE_NAME=estadistica |
82 | | }}} |
83 | | |
84 | | * Change the following fields in $WRF4G_LOCATION/etc/resources.wrf4g or copy $WRF4G_LOCATION/etc/resources.wrf4g where experiment.wrf4g is located: |
85 | | * The repository data if the cluster do not access them directly. |
86 | | * NP to the number of MPI processess you want to use |
87 | | * REQUIREMENTS if you want to choose the resource that will run the experiment. REQUIREMENTS ='HOSTNAME = "*ui01*";' |
88 | | |
89 | | * Restart the wrf4g_framework: '''wrf4g_framework restart''' |
90 | | * Submit the experiment with a different name (change the name in experiment.wrf4g) |
91 | | {{{ |
92 | | cd $WRF4G_LOCATION/experiments/single/snd50c |
93 | | wrf4g_submitter.sh |
94 | | }}} |
95 | | |
96 | | == Veryfing installation == |
97 | | * List the computing resources available for running jobs. |
98 | | |
99 | | {{{ |
100 | | $ ~/wrf4g$ gwhost |
101 | | HID PRIO OS ARCH MHZ %CPU MEM(F/T) DISK(F/T) N(U/F/T) LRMS HOSTNAME |
102 | | 0 1 GNU/Linux2.6.32 unkno 1000 0 1822/3270 2139/8662 0/1/1 FORK mycomputer |
103 | | }}} |
104 | | |