Model-Data Confrontation in the Spectral Domain

Model-Data Confrontation in the Spectral Domain#

In this notebook, we demonstrate the use of the spectral analysis features of Pyleoclim and reproduce the results of Zhu et al. (2019). The goal is to perform comparison of the climate variability across scales simulated by climate models and observed in proxy records.

Data Exploration#

Let’s start by importing the packages that we will need:

import pandas as pd
import numpy as np
import xarray as xr
import pyleoclim as pyleo
pyleo.set_style('web')  # set the visual style

PMIP3 last millennium simulations#

The PMIP3 (Braconnot et al. 2012) simulations of the past millennium (past1000) of global mean surface temperature (GMST) are stored in a text file and can be imported with Pandas conveniently.

# load the raw data
df = pd.read_table('./data/PMIP3_GMST.txt')

# display the raw data
df

	Year	bcc_csm1_1	CCSM4	FGOALS_gl	FGOALS_s2	IPSL_CM5A_LR	MPI_ESM_P	CSIRO	GISS-E2-R_r1i1p121	GISS-E2-R_r1i1p127	...	CESM_member_1	CESM_member_2	CESM_member_3	CESM_member_4	CESM_member_5	CESM_member_6	CESM_member_7	CESM_member_8	CESM_member_9	CESM_member_10
0	850	-0.570693	-0.431830	NaN	-0.620995	-0.475963	-0.170230	NaN	0.116333	0.155407	...	0.036672	0.067692	0.085340	-0.000616	0.157021	0.048458	0.038173	-0.027151	0.143404	-0.053464
1	851	-0.698903	-0.411177	NaN	-0.753160	-0.742970	-0.303124	-0.398695	0.068174	0.210337	...	0.246816	0.181400	0.251417	0.170710	0.165139	0.324856	0.191677	0.120951	0.216921	0.068698
2	852	-0.575440	-0.404802	NaN	-0.743508	-0.758939	-0.422623	-0.406343	0.060088	0.240585	...	0.187429	0.065922	0.190229	0.264551	0.092629	0.386593	0.068904	0.292246	0.101564	0.200259
3	853	-0.724757	-0.552719	NaN	-0.869331	-0.746460	-0.335177	-0.353557	-0.074396	0.030596	...	0.202443	0.089054	-0.031298	0.205805	0.049447	0.023312	-0.041356	0.206064	0.212954	0.288272
4	854	-0.724328	-0.734938	NaN	-0.826238	-0.684093	-0.650792	-0.416140	-0.402800	-0.330589	...	0.062795	0.137882	-0.233049	-0.227240	-0.156577	-0.339176	-0.103825	0.058420	-0.006102	-0.006619
...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...	...
1161	2011	1.013544	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	...	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN
1162	2012	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	...	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN
1163	2013	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	...	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN
1164	2014	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	...	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN
1165	2015	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	...	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN

1166 rows × 22 columns

The file includes several ensemble members for CESM and GISS simulations, for which we substitue their ensemble mean series.

# create a new pandas.DataFrame to store the processed data
df_new = df.copy()

# remove the data columns for CESM and GISS ensemble members
for i in range(10):
    df_new = df_new.drop([f'CESM_member_{i+1}'], axis=1)
    
df_new = df_new.drop(['GISS-E2-R_r1i1p127.1'], axis=1)
df_new = df_new.drop(['GISS-E2-R_r1i1p127'], axis=1)
df_new = df_new.drop(['GISS-E2-R_r1i1p121'], axis=1)

# calculate the ensemble mean for CESM and GISS, and add the results into the table
df_new['CESM'] = df[[f'CESM_member_{i+1}' for i in range(10)]].mean(axis=1)

df_new['GISS'] = df[[
    'GISS-E2-R_r1i1p127.1',   
    'GISS-E2-R_r1i1p127',
    'GISS-E2-R_r1i1p121',
]].mean(axis=1)

# display the processed data
df_new

	Year	bcc_csm1_1	CCSM4	FGOALS_gl	FGOALS_s2	IPSL_CM5A_LR	MPI_ESM_P	CSIRO	HadCM3	CESM	GISS
0	850	-0.570693	-0.431830	NaN	-0.620995	-0.475963	-0.170230	NaN	-0.620517	0.049553	0.127429
1	851	-0.698903	-0.411177	NaN	-0.753160	-0.742970	-0.303124	-0.398695	-0.553043	0.193858	0.138796
2	852	-0.575440	-0.404802	NaN	-0.743508	-0.758939	-0.422623	-0.406343	-0.560791	0.185033	0.098170
3	853	-0.724757	-0.552719	NaN	-0.869331	-0.746460	-0.335177	-0.353557	-0.438949	0.120470	-0.054552
4	854	-0.724328	-0.734938	NaN	-0.826238	-0.684093	-0.650792	-0.416140	-0.812194	-0.081349	-0.407169
...	...	...	...	...	...	...	...	...	...	...	...
1161	2011	1.013544	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN
1162	2012	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN
1163	2013	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN
1164	2014	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN
1165	2015	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN	NaN

1166 rows × 11 columns

pyleoclim capabilities can now be leveraged. We now define a pyleoclim.Series object for each simulated GMST time series. A pyleoclim.Series represents a time series object that comes with a collection of methods, such as performing spectral analysis, wavelet analysis, interpolation, plotting, and so on.

For details, see the documentation.

# store each pyleoclim.Series() object into a dictionary
ts_dict = {}
for name in df_new.columns[1:]:
    ts_dict[name] = pyleo.Series(
        time=df_new['Year'].values,  # the time axis
        value=df_new[name].values,   # the value axis
        label=name,                  # optional metadata: the nickname of the series
        time_name='Time',            # optional metadata: the name of the time axis
        time_unit='yrs',             # optional metadata: the unit of the time axis
        value_name='GMST anom.',     # optional metadata: the name of the value axis
        value_unit='K',              # optional metadata: the unit of the value axis
        verbose = False,             # suppresses warnings
    ) 

Once a pyleoclim.Series is defined, we can easily visualize it by calling the pyleoclim.Series.plot() method. For instance, we plot the CCSM4 GMST below:

fig, ax = ts_dict['CCSM4'].plot()

../_images/f0b3bebc4b1309c6d5d311e21bec044f69cf89735614ecbd674e5093fca3b356.png

Note that the return of the plot() method is a list of a matplotlib.pyplot.figure and a matplotlib.pyplot.axis. That means all possible matplotlib manipulations can follow. For instance, let’s change the limit of the y-axis and the label below.

fig, ax = ts_dict['CCSM4'].plot(label='CCSM4 series')
ax.set_ylim([-4, 2])

(-4.0, 2.0)

../_images/3ca3459435993457156175ee407a3a59b8c7973cf44e808009695747d6eabff1.png

With the same mechanism, we may plot two time series in the same figure as following, in which we use the argument ax=ax to specify that the we’d like to plot the series of GISS into the same matplotlib.pyplot.axis.

fig, ax = ts_dict['CCSM4'].plot()
ts_dict['GISS'].plot(ax=ax)  # the argument "ax=ax" indicates we'd like to plot into the "ax" we got from the previous line of code 
ax.set_ylim([-4, 2])

(-4.0, 2.0)

../_images/6d48ae1697e0a7db8bdf4de54b1792166660cc83168d3faaef090fd6c4be66ef.png

To plot a collection of time series at once, we define a pyleoclim.MultipleSeries object, which takes a list of pyleoclim.Series objects as input.

Since we have defined a dictionary of a collection of pyleoclim.Series objects, we may first convert this dictionary into a list, and then use that list to define a pyleoclim.MultipleSeries object.

ts_list = [v for k, v in ts_dict.items()]  # a pythonic way to convert the pyleo.Series items in the dictionary to a list
ms_pmip = pyleo.MultipleSeries(ts_list)

Now that the pyleoclim.MultipleSeries called “ms_pmip” is defined, we can visualize all the time series at once by calling the pyleoclim.MultipleSeries.plot() method.

fig, ax = ms_pmip.plot()

../_images/2aeea068843cd8a5a46c3b42fa3f71eab42ce864b6320d021f3b15d7b1b70e25.png

You may notice that the legend is not in its best place, and we may want to move it to the right side. We can achieve that by passing a dictionary of arguments for matplotlib.pyplot.axis.legend() (see the matplotlib documentation for details) as below:

fig, ax = ms_pmip.plot(lgd_kwargs={'bbox_to_anchor': (1, 1)})  # move the legend to the right side

../_images/86d128f055c45d9a9f3b8a755477ccd730e9bb21fefe05f04268e24c28db9fec.png

Now that the loading of PMIP3 simulations is complete, let’s move on to proxies, the last millennium reanalysis (LMR, Hakim et al. 2016; Tardif et al. 2019), and deglacial simulations.

References#

Braconnot, P., Harrison, S. P., Kageyama, M., Bartlein, P. J., Masson-Delmotte, V., Abe-Ouchi, A., et al. (2012). Evaluation of climate models using palaeoclimatic data. Nature Climate Change, 2(6), 417–424. https://doi.org/10.1038/nclimate1456
Foster, G. (1996). Wavelets for period analysis of unevenly sampled time series. The Astronomical Journal, 112, 1709. https://doi.org/10.1086/118137
Hakim, G. J., Emile‐Geay, J., Steig, E. J., Noone, D., Anderson, D. M., Tardif, R., et al. (2016). The last millennium climate reanalysis project: Framework and first results. Journal of Geophysical Research: Atmospheres, 121(12), 6745–6764. https://doi.org/10.1002/2016JD024751
Kirchner, J. W., & Neal, C. (2013). Universal fractal scaling in stream chemistry and its implications for solute transport and water quality trend detection. Proceedings of the National Academy of Sciences, 110(30), 12213–12218. https://doi.org/10.1073/pnas.1304328110
Tardif, R., Hakim, G. J., Perkins, W. A., Horlick, K. A., Erb, M. P., Emile-Geay, J., et al. (2019). Last Millennium Reanalysis with an expanded proxy database and seasonal proxy modeling. Climate of the Past, 15(4), 1251–1273. https://doi.org/10.5194/cp-15-1251-2019
Zhu, F., Emile-Geay, J., McKay, N. P., Hakim, G. J., Khider, D., Ault, T. R., Steig, E. J., Dee, S., Kirchner, J. W. (2019). Climate models can correctly simulate the continuum of global-average temperature variability. Proceedings of the National Academy of Sciences, 201809959. https://doi.org/10.1073/pnas.1809959116

Model-Data Confrontation in the Spectral Domain

Contents

Model-Data Confrontation in the Spectral Domain#

Data Exploration#

PMIP3 last millennium simulations#

Proxies, LMR, and deglacial simulations#

Spectral analysis#

A WWZ perspective#

Visualization of the PSDs#

Estimation of the scaling exponents#

References#