Future Climate Analogue Mapping -- Notes | ||||||||||||||||||||||
Data
There are two basic data sets used in the analogue mapping—present
and future. The present climate is represented by the CRU CL
2.0 data set (Climate Research Unit, University of East Anglia
http://www.cru.uea.ac.uk/cru/data/hrg/).
These data consist of 1961-1990 long-term averages on a
10-min grid. The data
were regridded onto a 10-km grid (Lambert Azimuthal Equal-Area
projection, centered at 100E and 50N) coregistered with the USGS
Seasonal Land Cover 1-km data set for North America.
The interpolation was done via locally weighted trend-surface
regression, with elevation as a covariate (thereby producing a
topographically corrected interpolated values).
There are 218,882 non-ice-covered grid cells for North
America.
Average monthly temperature and precipitation were used in the
analogue calculations, and, along with monthly percent-possible
sunshine, were also used to create a set of 40 “bioclimatic”
variables (e.g. growing degree-days, the Priestley-Taylor moisture
index "alpha" (the ratio of actual equilibrium evapotranspiration to
potential equilibrium evapotranspiration or AE/PE), etc.) using the
Cramer-Prentice approach for the moisture-balance calculations.
Future climates are represented by the WCRP CMIP-3 climate
simulations done as part of the IPCC Fourth Assessment (http://www-pcmdi.llnl.gov/ipcc/about_ipcc.php) For this
demonstration, output was used from two models, the NCAR Community
Climate System Model 3 (CCSM3) and UK Met Office Hadley Center
Climate Model 3 (HadCM3) for the SRES A2 emissions scenarios.
Simulated “anomalies,” or the differences between the
1961-1990 “20th-century control” simulation averages and decadal
averages for two 21st-century intervals (2040-2049 and 2090-2099)
were calculated over each model’s “native” grid. These
anomalies were then
interpolated onto the North American 10-km grid, and added to the regridded CRU CL 2.0 long-term averages.
This procedure produces 10-km data sets for the middle and
end of the 21-st century for each emissions scenario/climate model
combination.
Bioclimatic variables for the future climate data sets were obtained
in the same fashion as for the “present” climate data set.
Climate simulations for other SRES emissions scenarios will
be included later.
The climate data were stored as netCDF files (http://www.unidata.ucar.edu/software/netcdf/), which can be opened and displayed using Panoply (http://www.giss.nasa.gov/tools/panoply/
). A single monthly
temperature or precipitation netCDF file is 62 Mbytes, while one
containing the values for 40 bioclimatic variables is 173 Mbytes. |
||||||||||||||||||||||
Analogue Calculations
Analogues are displayed here using statistical distance or dissimilarity
measures, where low distances or dissimilarities indicate similar or
analogous climates. For each
particular target point, four sets of analogues were obtained for each
combination of climate scenario (and time) and choice of
analogue-calculation parameters (see below):
1) “present vs. future”
analogues that show the dissimilarity between the present climate at a
target point and the future climates over the “field” of grid points;
these show where the present climate of the target point will occur in
the future; 2) “future vs. present” analogues that show the
dissimilarity between the future climate at a target point and the
present climate over the field of grid points; these show where the
future climate at the target point occurs at present; 3) “present vs.
present” analogues that show the locations with present-day climates
similar to those at the target point; and 4) “future vs. future”
analogues that show the same thing under a particular future climate
scenario. These last two
analogue patterns describe how unique or common the climate at a target
point is at present, and how that pattern may change in the future.
Each set of four dissimilarity-value maps were also stored as
netCDF files, about 31 Mbytes in size. | ||||||||||||||||||||||
Analogue Bases
The calculation of dissimilarities between climates at different
locations or times requires the specification of a particular set of
climate variables to use. Analogues
could be expressed, for example, in terms of temperature alone, moisture
alone, temperature and moisture, and so on, where the specific set of
variables used is referred to here as an “analogue basis.” Six analogue bases are used here:
|
||||||||||||||||||||||
Transformation of Variables
The individual climate variables
have several different of kinds distributions, ranging from those that
are nearly normal (e.g. temperature variables) to those that are
positively skewed (long right tail, e.g. precipitation), to those with
unusually shaped distributions (e.g. AE/PE, which is negatively skewed,
i.e., with a long left tail).
Skewness influences the calculation of analogues by giving
observations in the tails of skewed distributions disproportionally large
(e.g. in the case of the upper tail of positively skewed distributions) contributions to the
dissimilarity values, and those in the opposite tail disproportionally
small contributions.
Individual dissimilarity values may therefore be influenced more by
where an observation of a particular climate variable falls under its
distribution than by practical differences in the climates of two
locations.
Consequently, the Box-Cox
transformation, a variance-stabilizing power transformation, was used to
transform the individual variables.
The transformation parameter, lambda, was estimated by maximum
likelihood for each variable; this has the practical interpretation of
attempting to transform the distribution of each variable toward the
normal distribution. Lambda
values of 1.0 involve no transformation, 0.5 and 0.3333
amount to the square-root and
cube-root transformation, and a value of 0.0 essentially gives the
logarithmic transformation.
Negatively skewed distributions, like those of AE/PE, are transformed
toward the normal by lambda values > 1.0.
As is common practice, we adopted easily interpretable values,
like 0.5 or 0.3333, in effect “rounding” the maximum likelihood values.
The histogram on the left below
shows the distribution of January precipitation, while that on the right
shows that for transformed January precipitation with lambda = 0.3333,
(i.e. the commonly used “cube-root” transformation for precipitation).
For comparison, analogues were also calculated using
untransformed variables. | ||||||||||||||||||||||
Dissimilarity Measures
Two dissimilarity measures were
used in this demonstration:
1) the widely used Euclidian-distance measure, and 2) the Mahalanobis
distance, a statistical distance measure that takes into account the
covariance among the variables.
Many of the variables (e.g. the monthly temperature variables, or
GDD5 and MTWA), are highly correlated, and in a sense contribute
redundant information to dissimilarity measures like the Euclidian
distance. The Mahalanobis
distance can be thought of as an Euclidian-distance like measure, where
the contributions of the individual variables to the distance are
weighted by the elements of the inverse of the covariance matrix.
The scatterplot on the left below shows the values of January and
July temperature, with the Euclidian distance between each point and the
centroid of the two variables indicated by the size of circle
representing each point, while the scatterplot on the right shows the
same thing for the Mahalanobis distances.
(Note; the obvious moiré
pattern on the scatterplot on the left is created by the
rasterization of the image.) The Mahalanobis distances can be thought of
as the distance to the centroid measured across the isoprobability
contours of a bivariate normal distribution fit to the data (shown in
red). Other dissimilarity
measures could also be considered, like the Minkowski, or city-block
distance. |
||||||||||||||||||||||
Analogue Maps
An important issue is determining
what constitutes an “analogue.”
One way of skirting this issue is to plot the analogues
(dissimilarity values) on a continuous scale, but this would still
require a user to choose some kind of intuitive threshold value to avoid
distraction by low-analogue medium-dissimilarity value points.
The alternative of adopting some kind of single-value threshold
is also unsatisfactory, because information on potential gradients in
dissimilarities will be lost.
For this demonstration, a strict a and more liberal threshold was
used in creating the maps.
The distribution of dissimilarity values created by comparing
present-day observed (CRU) climate at each
point with those of all of the other points was estimated by 5 million
random comparisons between the climate values at individual points (there are 48 x 10^9 total potential comparisons) for
each analogue basis and transformation selection. The 1st
and 5th percentile values were selected as indicators of strong and
weak (or less-strong) analogues.
The histogram below shows the random comparisons within the CRU
10-km data set for a set of bioclimatic variables (i.e. analogue-basis
4), with the 1st
and 5th percentile values shaded as dark and light red, respectively.
These values were used in creating the analogue maps. |
||||||||||||||||||||||
[home] [directions] [Analogue Selector Page] | ||||||||||||||||||||||
|
||||||||||||||||||||||