2. Global Workflow Components

The Global Workflow is an end-to-end workflow includes several components stitched together to prepare, analyze, produce, and post-process forecast data.

The major components of the GW system are:

  • Workflow

  • Pre-processing

  • Analysis

  • Forecast

  • Post-processing

  • Verification

  • Archiving

The GW repository contains the workflow and script layers. External components will be checked out as git submodules. All of the submodules of the system reside in their respective repositories on GitHub.

2.1. Component Repositories

Components included as submodules:

Note

When running the system in forecast-only mode the DA components are not needed and are hence not built.

2.2. Additional Resources for New Users

Additional documentation and resources relevant to GW is shown in the table below for new users.

Documentation

Location

UFS_WM

https://ufs-weather-model.readthedocs.io/en/develop/

UFS UTILS

https://noaa-emcufs-utils.readthedocs.io/en/latest/

GFS UTILS

https://www.emc.ncep.noaa.gov/emc/pages/numerical_forecast_systems/gfs/documentation.php

WaveWatchIII

https://github.com/NOAA-EMC/WW3/wiki/WAVEWATCH-III-User-Guide

GDAS

https://www.ncei.noaa.gov/products/weather-climate-models/global-data-assimilation

GSI

https://github.com/NOAA-EMC/GSI/tree/develop/doc

wxflow

https://wxflow.readthedocs.io/en/latest/

Verif Global

https://github.com/NOAA-EMC/EMC_verif-global/wiki

2.3. External dependencies

2.3.1. Libraries

All the libraries that are needed to run the end-to-end GW are built using a package manager. These are served via spack-stack. These libraries are already installed on supported NOAA HPC platforms.

For more information:

https://github.com/JCSDA/spack-stack/wiki/Porting-spack-stack-to-a-new-system

2.3.2. Observation data (OBSPROC/prep)

2.3.2.1. Data

Observation data, also known as dump data, is prepared in production and then archived in the Global Dump Archive (GDA) for use in cycled experiments. The GDA is available on supported platforms and is identified as $DMPDIR in the workflow. The workflow automatically knows where to find GDA data on supported platforms:

  • Ursa: /scratch3/NCEPDEV/global/role.glopara/dump

  • Orion/Hercules: /work/noaa/rstprod/dump

  • WCOSS2: /lfs/h2/emc/global/noscrub/emc.global/dump

  • Derecho: /lustre/desc1/p/nral0032/global/dump_nr (unrestricted)

2.3.2.1.1. GDA Structure

The GDA mimics its production layout:

  • GDAS/GFS: DMPDIR/gdas[gfs].PDY/CC/atmos/FILES

  • Real‑Time Ocean Forecast System (RTOFS): DMPDIR/rtofs.PDY/FILES

The GDA also contains special versions of some datasets and experimental data that is being evaluated ahead of implementation into production. The special subfolder suffixes are:

SUFFIX

MEANING

nr

Non-restricted versions of restricted files in production. Produced in production. Restricted data is fully stripped from files. These files remain as is.

ur

Un-restricted versions of restricted files in production. Produced and archived on a 48hrs delay. Some restricted datasets are unrestricted. Data amounts: restricted > un-restricted > non-restricted Limited availability. Discontinued in mid-2023.

x

Experimental global datasets under evaluation for production. Dates and types vary depending on upcoming upgrades.

y

Similar to “x” but only used when there is a duplicate experimental file in the x subfolder with the same name. Dfferent from both the production versions, if exists, and the x versions. This suffix is rarely used.

p

Pre-production dataset copy, as produced by National Centers for Environmental Prediction (NCEP) Central Operations (NCO) during final 30-day parallel ahead of implementation. Not always archived.

2.3.2.2. Data processing

Upstream observation handling (collection, quality control, and packaging) of global workflow is performed using Observation Processing (OBSPROC) group’s codes and scripts. The workflow uses two packages from OBSPROC to prepare observation (dump) data for use by the analysis system:

  1. https://github.com/NOAA-EMC/obsproc

  2. https://github.com/NOAA-EMC/prepobs

Package versions and paths on supported platforms are defined in the GW system configuration file and module files.