Tutorial 7: Other Computational Tools in Xarray

Tutorial 7: Other Computational Tools in Xarray#

Week 1, Day 1, Climate System Overview

Content creators: Sloane Garelick, Julia Kent

Content reviewers: Katrina Dobson, Younkap Nina Duplex, Danika Gupta, Maria Gonzalez, Will Gregory, Nahid Hasan, Paul Heubel, Sherry Mi, Beatriz Cosenza Muralles, Jenna Pearson, Agustina Pesce, Chi Zhang, Ohad Zivan

Content editors: Paul Heubel, Jenna Pearson, Chi Zhang, Ohad Zivan

Production editors: Wesley Banfield, Paul Heubel, Jenna Pearson, Konstantine Tsafatinos, Chi Zhang, Ohad Zivan

Our 2024 Sponsors: CMIP, NFDI4Earth

#

Pythia credit: Rose, B. E. J., Kent, J., Tyle, K., Clyne, J., Banihirwe, A., Camron, D., May, R., Grover, M., Ford, R. R., Paul, K., Morley, J., Eroglu, O., Kailyn, L., & Zacharias, A. (2023). Pythia Foundations (Version v2023.05.01) https://zenodo.org/record/8065851

#

Tutorial Objectives#

Estimated timing of tutorial: 15 minutes

Thus far, we’ve learned about various climate processes in the videos, and we’ve explored tools in Xarray that are useful for analyzing and interpreting climate data in the tutorials.

In this tutorial, you’ll continue using the SST data from CESM2 and practice using some additional computational tools in Xarray to resample your data, which can help with data comparison and analysis. The functions you will use are:

.resample(): Groupby-like functionality specifically for time dimensions. Can be used for temporal upsampling and downsampling. Additional information about resampling in Xarray can be found here.
.rolling(): Useful for computing aggregations on moving windows of your dataset e.g. computing moving averages. Additional information about resampling in Xarray can be found here.
.coarsen(): Generic functionality for downsampling data. Additional information about resampling in Xarray can be found here.

Setup#

# installations ( uncomment and run this cell ONLY when using google colab or kaggle )
#!pip install pythia_datasets cftime nc-time-axis

# imports
import matplotlib.pyplot as plt
import xarray as xr
from pythia_datasets import DATASETS

/opt/hostedtoolcache/Python/3.9.18/x64/lib/python3.9/site-packages/pythia_datasets/__init__.py:4: UserWarning: pkg_resources is deprecated as an API. See https://setuptools.pypa.io/en/latest/pkg_resources.html. The pkg_resources package is slated for removal as early as 2025-11-30. Refrain from using this package or pin to Setuptools<81.
  from pkg_resources import DistributionNotFound, get_distribution

Install and import feedback gadget#

Figure Settings#

Video 1: Carbon Cycle and the Greenhouse Effect#

Submit your feedback#

If you want to download the slides: https://osf.io/download/sb3n5/

Submit your feedback#

Section 1: High-level Computation Functionality#

In this tutorial, you will learn about several methods for dealing with the resolution of data. Here are some links for quick reference, and we will go into detail in each of them in the sections below.

.resample(): Groupby-like functionality especially for time dimensions. Can be used for temporal upsampling and downsampling
.rolling(): Useful for computing aggregations on moving windows of your dataset e.g. computing moving averages
.coarsen(): Generic functionality for downsampling data

First, let’s load the same data that we used in the previous tutorials (monthly SST data from CESM2):

Section 1.1: Resampling Data#

For upsampling or downsampling temporal resolutions, we can use the .resample() method in Xarray. For example, you can use this function to downsample a dataset from hourly to 6-hourly resolution.

Our original SST data is monthly resolution. Let’s use .resample() to downsample to annual frequency:

# resample from a monthly to an annual frequency
tos_yearly = ds.tos.resample(time="AS")
tos_yearly

<string>:6: FutureWarning: 'AS' is deprecated and will be removed in a future version. Please use 'YS' instead of 'AS'.

DataArrayResample, grouped over '__resample_dim__'
15 groups with labels 2000-01-01, 00:00:00, ..., 201....

# calculate the global mean of the resampled data
annual_mean = tos_yearly.mean()
annual_mean_global = annual_mean.mean(dim=["lat", "lon"])
annual_mean_global.plot()

[<matplotlib.lines.Line2D at 0x7f6ed946f7c0>]

../../../_images/b8650176c4ef0852827a1c58f7ff8bf61f13fb76a1e5317aa219be1fa231685e.png

Section 1.4: Compare the Resampling Methods#

Now that we’ve tried multiple resampling methods on different temporal resolutions, we can compare the resampled datasets to the original.

original_global = ds.mean(dim=["lat", "lon"])

original_global.tos.plot(size=6)
coarse_data.tos.plot()
tos_m_avg_global.plot()
annual_mean_global.plot()


plt.legend(
    [
        "original data (monthly)",
        "coarsened (4 months)",
        "moving average (6 months)",
        "annually resampled (12 months)",
    ]
)

<matplotlib.legend.Legend at 0x7f6ec7f7a220>

../../../_images/8293a321baf8ee6a83d1de2ac633ed83528aa41dce493d16995ffcddd32569a4.png

Questions 1.4: Climate Connection#

What type of information can you obtain from each time series?
In what scenarios would you use different temporal resolutions?

Click for solution

Submit your feedback#

Summary#

In this tutorial, we’ve explored Xarray tools to simplify and understand climate data better. Given the complexity and variability of climate data, tools like .resample(), .rolling(), and .coarsen() come in handy to make the data easier to compare and find long-term trends. You’ve also looked at valuable techniques like calculating moving averages.

Resources#

Code and data for this tutorial is based on existing content from Project Pythia.

Tutorial 7: Other Computational Tools in Xarray

Contents

Tutorial 7: Other Computational Tools in Xarray#

#

#

Tutorial Objectives#

Setup#

Install and import feedback gadget#

Figure Settings#

Video 1: Carbon Cycle and the Greenhouse Effect#

Submit your feedback#

Submit your feedback#

Section 1: High-level Computation Functionality#

Section 1.1: Resampling Data#

Section 1.2: Moving Average#

Section 1.3: Coarsening the Data#

Section 1.4: Compare the Resampling Methods#

Questions 1.4: Climate Connection#

Submit your feedback#

Summary#

Resources#