Xarray 8: GRIB Data¶

Overview¶

Explore the differences between GRIB and NetCDF formats
Work with GRIB output from the HRRR regional model
Visualize data that is non-global in extent

Prerequisites¶

Concepts	Importance	Notes
Xarray Lessons 1-7	Necessary

Time to learn: 30 minutes

Imports¶

import numpy as np
from datetime import datetime
import xarray as xr
import pandas as pd
import matplotlib.pyplot as plt
from matplotlib import colors
import cartopy.crs as ccrs
import cartopy.feature as cfeat
from metpy.plots import USCOUNTIES

GRIB vs NetCDF formats¶

Advantages of GRIB:

WMO-certifed format. Almost all NWP output that is generated by a national meteorological service will be in GRIB format.
Compresses well (GRIB version 2 only)
Still used for initial / boundary conditions in WRF.
“Well-behaved” GRIB data can be fairly easily translated into self-describing NetCDF format. A THREDDS server performs this translation on demand, thus presenting the user with data that looks to be in NetCDF format even if the actual data is in GRIB.

Disadvantages of GRIB:

Not self-describing (unlike NetCDF). A GRIB data file must be accompanied by a set of tables that decodes fields into their actual physicall-relevant names.
Despte being a WMO standard, there is no central source to maintain such tables. Often times, as new models get released, the existing tables that one might be using may not be updated with new parameters. Best strategy: update your GRIB software libraries often (and hope other things don’t break as a result).
As tables change, there is no guarantee that a GRIB file you read one year might decode the same way another year.
Grids are written out sequentially to an output GRIB file. As a result, performing temporal and especially spatial subsetting, such as we can do in Xarray with NetCDF or Zarr data, is very difficult.

Make the map¶

tl1 = "HRRR Composite Reflectivity (dBZ)"
tl2 = str('Valid at: '+ timeStr)
title_line = (tl1 + '\n' + tl2 + '\n')

First draw a radar map that covers the entire domain of the HRRR.

Warning: The HRRR stands for High Resolution Rapid Refresh. The horizontal dimensions are 1059 x 1799 ... approximately 3 km between gridpoints. As a result, the maps will take longer to render.

Tip: Omitting some of the cartographic features, especially the filled ones like *ocean*, *land*, and *lakes*, will speed things up.

res = '50m'
fig = plt.figure(figsize=(18,12))
ax = plt.subplot(1,1,1,projection=proj_data)
ax.set_extent((lonW,lonE,latS,latN))
#ax.add_feature (cfeat.LAND.with_scale(res))
#ax.add_feature (cfeat.OCEAN.with_scale(res))
ax.add_feature(cfeat.COASTLINE.with_scale(res))
#ax.add_feature (cfeat.LAKES.with_scale(res), alpha = 0.5)
ax.add_feature (cfeat.STATES.with_scale(res))
# don't include county lines here
#ax.add_feature(USCOUNTIES,edgecolor='grey', linewidth=1, zorder = 3 );
CF = ax.contourf(refc.longitude,refc.latitude,refc[idx],levels=refl_range,cmap=cmap,transform=ccrs.PlateCarree())
cbar = plt.colorbar(CF,fraction=0.046, pad=0.03,shrink=0.5)
cbar.ax.tick_params(labelsize=10)
cbar.ax.set_ylabel("Reflectivity (dBZ)",fontsize=10)
title = ax.set_title(title_line,fontsize=16)

../../_images/c8e1ff4edc48df8dc85d7117b13376eea53037d359c3425aebe844ec8c1e6309.png

Now, plot the map over NYS.

res = '50m'
fig = plt.figure(figsize=(18,12))
ax = plt.subplot(1,1,1,projection=proj_sub)
ax.set_extent((lonW_sub,lonE_sub,latS_sub,latN_sub),crs=ccrs.PlateCarree())
#ax.add_feature (cfeat.LAND.with_scale(res))
#ax.add_feature (cfeat.OCEAN.with_scale(res))
ax.add_feature(cfeat.COASTLINE.with_scale(res))
#ax.add_feature (cfeat.LAKES.with_scale(res), alpha = 0.5)
ax.add_feature (cfeat.STATES.with_scale(res))
ax.add_feature(USCOUNTIES,edgecolor='grey', linewidth=1 );
CF = ax.contourf(refc.longitude,refc.latitude,refc[idx],levels=refl_range,cmap=cmap,transform=ccrs.PlateCarree())
cbar = plt.colorbar(CF,fraction=0.046, pad=0.03,shrink=0.5)
cbar.ax.tick_params(labelsize=10)
cbar.ax.set_ylabel("Reflectivity (dBZ)",fontsize=10)
title = ax.set_title(title_line,fontsize=16)

../../_images/9fb2cc476aabee2910af772867b6a41f946b907ad10aa8342e4f347fb49bec7e.png

To think about:
1. How would you plot different times? Is there just a single cell whose value needs to be change?
2. What if you wanted to loop over all seven times? Why does a Jupyter notebook make it a bit more difficult to do this?
3. Try loading and plotting the WIND Dataset. It consists of u- and v- components of wind at 10 and 80 m. Can you use the same call to xarray.open_dataset that you did for reflectivity?

Summary¶

Using Xarray’s cfgrib data engine, we can analyze and display data in GRIB format.
In general, GRIB data is “messier” than data in a self-describing format, such as NetCDF.

What’s Next?¶

In the next notebook, we’ll explore HRRR data that is stored “in the cloud”, in a format called Zarr.

ATM433/533 Fall 2023

Xarray 8: GRIB Data

Contents