Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Heat wave magnitude indicator #1926

Merged
merged 59 commits into from
Oct 7, 2024
Merged
Show file tree
Hide file tree
Changes from 45 commits
Commits
Show all changes
59 commits
Select commit Hold shift + click to select a range
9c77c4f
Add heat wave magnitude indicator definition
seblehner Sep 16, 2024
cdd1e05
Add heat wave magnitude indicator function
seblehner Sep 16, 2024
f73936c
Add windowed_run_sum and rse function
seblehner Sep 16, 2024
d35ca73
Add wrapping functions for ufunc implementation of windowed_run_sum
seblehner Sep 16, 2024
0b55cc6
Add test for heat_wave_magnitude
seblehner Sep 16, 2024
1e0e799
Add new functions to CHANGELOG.rst
seblehner Sep 16, 2024
6c5e3e2
Rename windowed_run_sum to windowed_max_run_sum for explicity
seblehner Sep 16, 2024
394e73f
Add credentials to AUTHORS.rst and .zenodo.json
seblehner Sep 16, 2024
904c32d
Add heat_wave_magnitude to __all__
seblehner Sep 16, 2024
302fe27
Add more tests for heat_wave_magnitude
seblehner Sep 16, 2024
b49d644
Add tests for rse and windowed_max_run_sum
seblehner Sep 16, 2024
9568734
Add heat wave magnitude indicator definition
seblehner Sep 16, 2024
56693c1
Add heat wave magnitude indicator function
seblehner Sep 16, 2024
ccbb3a2
Add windowed_run_sum and rse function
seblehner Sep 16, 2024
95c57a8
Add wrapping functions for ufunc implementation of windowed_run_sum
seblehner Sep 16, 2024
51dd852
Add test for heat_wave_magnitude
seblehner Sep 16, 2024
d5ab1f6
Add new functions to CHANGELOG.rst
seblehner Sep 16, 2024
eb1628d
Rename windowed_run_sum to windowed_max_run_sum for explicity
seblehner Sep 16, 2024
a1b59bd
Add credentials to AUTHORS.rst and .zenodo.json
seblehner Sep 16, 2024
6eb6394
Add heat_wave_magnitude to __all__
seblehner Sep 16, 2024
532da97
Add more tests for heat_wave_magnitude
seblehner Sep 16, 2024
1aa9c32
Add tests for rse and windowed_max_run_sum
seblehner Sep 16, 2024
8b6d13e
Merge branch 'heat-wave-magnitude-indicator' of github.com:seblehner/…
seblehner Sep 16, 2024
9f25215
Change heat_wave_magnitude output unit handling
seblehner Sep 17, 2024
6387cb7
Fix incorrect units for heat_wave_magnitude
seblehner Sep 17, 2024
acdb62a
Change out unit handling to integral
seblehner Sep 17, 2024
64a167c
Add PR number to changelog
seblehner Sep 17, 2024
0850be1
Fix expected value for TestHeatWaveMagnitude test_simple
seblehner Sep 17, 2024
9c6be58
Add TODO entry for french translation to fr.json
seblehner Sep 17, 2024
5915b8a
Replace multi line where call with clip
seblehner Sep 17, 2024
dda8d1c
Shorten rle call input
seblehner Sep 18, 2024
4ae225e
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Sep 18, 2024
c078d7c
Fix windowed run sum for window == 1
seblehner Sep 18, 2024
b31c10d
Add tests for small window lengths
seblehner Sep 18, 2024
729aae2
Remove ufunc and 1d placeholder code
seblehner Sep 18, 2024
1cfe7b1
Add if to rle dtype conversion
seblehner Sep 18, 2024
26663a1
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Sep 18, 2024
8c4f91b
Merge branch 'main' into heat-wave-magnitude-indicator
Zeitsperre Sep 18, 2024
e80f698
Fix error in description
seblehner Sep 18, 2024
62b24d6
Enhance indicator description
seblehner Sep 18, 2024
8d9ab44
Generalize rle and delete rse
seblehner Sep 18, 2024
2cffc3d
Merge branch 'heat-wave-magnitude-indicator' of github.com:seblehner/…
seblehner Sep 18, 2024
bec0296
Add french translation
seblehner Sep 18, 2024
232e68f
Expand indicator description
seblehner Sep 18, 2024
0a57618
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Sep 18, 2024
f61ab29
Update xclim/indicators/atmos/_temperature.py
Zeitsperre Sep 20, 2024
35b868a
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] Sep 20, 2024
7b254ce
Merge branch 'main' into heat-wave-magnitude-indicator
coxipi Sep 20, 2024
c6f98a6
heat_wave_magnitude -> hot_spell_max_magnitude
coxipi Sep 23, 2024
bcbabc2
remove abbreviation in indicator title
coxipi Sep 23, 2024
f632944
change test names
coxipi Sep 23, 2024
774b3ae
Remove rse tests
seblehner Sep 28, 2024
a148027
Merge branch 'main' into heat-wave-magnitude-indicator
Zeitsperre Sep 28, 2024
82254d8
Doc to better explain what rle does
aulemahal Oct 1, 2024
11c733e
Merge branch 'main' into heat-wave-magnitude-indicator
aulemahal Oct 2, 2024
b72ef37
add references
coxipi Oct 3, 2024
aa6f74c
remove ufunc_1dim option
coxipi Oct 3, 2024
e28db54
Merge branch 'main' into heat-wave-magnitude-indicator
coxipi Oct 3, 2024
bfa2576
Merge branch 'main' into heat-wave-magnitude-indicator
coxipi Oct 5, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
5 changes: 5 additions & 0 deletions .zenodo.json
Original file line number Diff line number Diff line change
Expand Up @@ -148,6 +148,11 @@
"name": "Wang, Hui-Min",
"affiliation": "National University of Singapore, Singapore, Singapore",
"orcid": "0000-0002-5878-7542"
},
{
"name": "Lehner, Sebastian",
"affiliation": "GeoSphere Austria, Vienna, Austria",
"orcid": "0000-0002-7562-8172"
}
],
"keywords": [
Expand Down
1 change: 1 addition & 0 deletions AUTHORS.rst
Original file line number Diff line number Diff line change
Expand Up @@ -46,3 +46,4 @@ Contributors
* Javier Diez-Sierra <[email protected]> `@JavierDiezSierra <https://github.com/JavierDiezSierra>`_
* Hui-Min Wang `@Hem-W <https://github.com/Hem-W>`_
* Adrien Lamarche `@LamAdr <https://github.com/LamAdr>`_
* Sebastian Lehner <[email protected]> `@seblehner <https://github.com/seblehner>`_
2 changes: 2 additions & 0 deletions CHANGELOG.rst
Original file line number Diff line number Diff line change
Expand Up @@ -14,11 +14,13 @@ Announcements
New indicators
^^^^^^^^^^^^^^
* New ``heat_spell_frequency``, ``heat_spell_max_length`` and ``heat_spell_total_length`` : spell length statistics on a bivariate condition that uses the average over a window by default. (:pull:`1885`).
* New ``heat_wave_magnitude`` : yields the magnitude of the most intensive heat wave. (:pull:`1926`).

New features and enhancements
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
* New generic ``xclim.indices.generic.spell_mask`` that returns a mask of which days are part of a spell. Supports multivariate conditions and weights. Used in new generic index ``xclim.indices.generic.bivariate_spell_length_statistics`` that extends ``spell_length_statistics`` to two variables. (:pull:`1885`).
* Indicator parameters can now be assigned a new name, different from the argument name in the compute function. (:pull:`1885`).
* ``xclim.indices.run_length.windowed_max_run_sum`` accumulates positive values across runs and yields the the maximum valued run. (:pull:`1926`).

Bug fixes
^^^^^^^^^
Expand Down
12 changes: 12 additions & 0 deletions tests/test_indices.py
Original file line number Diff line number Diff line change
Expand Up @@ -1303,6 +1303,18 @@ def test_simple(self, tasmax_series):
np.testing.assert_array_equal(out, [10, 0, 12, 8, 0, 0, 0, 0, 0, 0, 0, 0])


class TestHeatWaveMagnitude:
def test_simple(self, tasmax_series):
a = np.zeros(365)
a[15:20] += 30 # 5 days
a[40:42] += 50 # too short -> 0
a[86:96] += 30 # at the end and beginning
da = tasmax_series(a + K2C)

out = xci.heat_wave_magnitude(da, thresh="25 C", freq="ME")
np.testing.assert_array_equal(out, [25, 0, 30, 20, 0, 0, 0, 0, 0, 0, 0, 0])


class TestHeatWaveFrequency:
@pytest.mark.parametrize(
"thresh_tasmin,thresh_tasmax,window,expected",
Expand Down
39 changes: 39 additions & 0 deletions tests/test_run_length.py
Original file line number Diff line number Diff line change
Expand Up @@ -124,6 +124,35 @@ def test_rle(ufunc, use_dask, index):
np.testing.assert_array_equal(out, expected)


@pytest.mark.parametrize("use_dask", [True, False])
@pytest.mark.parametrize("index", ["first", "last"])
def test_rse(ufunc, use_dask, index):
if use_dask and ufunc:
pytest.xfail("rse_1d is not implemented for dask arrays.")

values = np.zeros((10, 365, 4, 4))
time = pd.date_range("2000-01-01", periods=365, freq="D")
values[:, 1:11, ...] = 30
da = xr.DataArray(values, coords={"time": time}, dims=("a", "time", "b", "c"))

if ufunc:
pytest.xfail("rse_1d is not implemented.")
else:
if use_dask:
da = da.chunk({"a": 1, "b": 2})

out = rl.rse(da, index=index).mean(["a", "b", "c"])
if index == "last":
expected = np.zeros(365)
expected[1:10] = np.nan
expected[10] = 300
else:
expected = np.zeros(365)
expected[1] = 300
expected[2:11] = np.nan
np.testing.assert_array_equal(out, expected)


@pytest.mark.parametrize("use_dask", [True, False])
@pytest.mark.parametrize("index", ["first", "last"])
def test_extract_events_identity(use_dask, index):
Expand Down Expand Up @@ -370,6 +399,16 @@ def test_simple(self, index):
) + len(a[34:45])


class TestWindowedMaxRunSum:
@pytest.mark.parametrize("index", ["first", "last"])
def test_simple(self, index):
a = xr.DataArray(np.zeros(50, float), dims=("time",))
a[4:6] = 5 # too short
a[25:30] = 5 # long enough, but not max
a[35:45] = 5 # max sum => yields 10*5
assert rl.windowed_max_run_sum(a, 3, dim="time", index=index) == 50


class TestLastRun:
@pytest.mark.parametrize(
"coord,expected",
Expand Down
40 changes: 40 additions & 0 deletions tests/test_temperature.py
Original file line number Diff line number Diff line change
Expand Up @@ -889,6 +889,46 @@ def test_nan_presence(self, tasmax_series):
np.testing.assert_array_equal(hwi, [np.nan])


class TestHeatWaveMagnitude:
def test_simple(self, tasmax_series):
tx = np.zeros(366)
tx[:5] = np.array([30, 30, 30, 30, 30])
tx = tasmax_series(tx + K2C, start="1/1/2000")
hwm = atmos.heat_wave_magnitude(tx, freq="YS")
np.testing.assert_array_equal(hwm, [25])

def test_small_window_single_day(self, tasmax_series):
tx = np.zeros(366)
tx[5:8] = np.array([30, 0, 30])
tx = tasmax_series(tx + K2C, start="1/1/2000")
hwm = atmos.heat_wave_magnitude(tx, window=1, freq="YS")
np.testing.assert_array_equal(hwm, [5])

def test_small_window_double_day(self, tasmax_series):
tx = np.zeros(366)
tx[5:7] = np.array([30, 30])
tx = tasmax_series(tx + K2C, start="1/1/2000")
hwm = atmos.heat_wave_magnitude(tx, window=1, freq="YS")
np.testing.assert_array_equal(hwm, [10])

def test_convert_units(self, tasmax_series):
tx = np.zeros(366)
tx[:5] = np.array([30, 30, 30, 30, 30])
tx = tasmax_series(tx, start="1/1/2000")
tx.attrs["units"] = "C"
hwm = atmos.heat_wave_magnitude(tx, freq="YS")
np.testing.assert_array_equal(hwm, [25])

def test_nan_presence(self, tasmax_series):
tx = np.zeros(366)
tx[:5] = np.array([30, 30, 30, 30, 30])
tx[-1] = np.nan
tx = tasmax_series(tx + K2C, start="1/1/2000")

hwm = atmos.heat_wave_magnitude(tx, freq="YS")
np.testing.assert_array_equal(hwm, [np.nan])
seblehner marked this conversation as resolved.
Show resolved Hide resolved


class TestDailyFreezeThaw:
nc_tasmax = os.path.join("NRCANdaily", "nrcan_canada_daily_tasmax_1990.nc")
nc_tasmin = os.path.join("NRCANdaily", "nrcan_canada_daily_tasmin_1990.nc")
Expand Down
12 changes: 12 additions & 0 deletions xclim/data/fr.json
Original file line number Diff line number Diff line change
Expand Up @@ -377,6 +377,18 @@
"title": "Indice de vague de chaleur",
"abstract": "Nombre de jours de vagues de chaleur. Une vague de chaleur se produit lorsque la température maximale quotidienne excède un seuil donné durant un certain nombre de jours."
},
"HEAT_WAVE_MAGNITUDE": {
"long_name": "Différence cumulative maximale entre la température maximale quotidienne et {thresh} "
"pour les jours faisant partie d'une vague de chaleur. Une vague de chaleur est définie comme une série d'au moins "
"{window} jours consécutifs où la température maximale quotidien est au-dessus de {thresh}",
"description": "Cumul {freq:m} maximal de la différence entre la température et {thresh} pour les jours faisant partie "
"d'une vague de chaleur. Une vague de chaleur est définie comme une série d'au moins {window} jours consécutifs où la "
"température maximale quotidien est au-dessus de {thresh}",
"title": "Magnitude de la vague de chaleur {freq:m} la plus intense. Une vague de chaleur est définie comme une série d'au moins "
"{window} jours consécutifs où la température maximale quotidien est au-dessus de {thresh}.",
"abstract": "Magnitude de la vague de chaleur {freq:m} la plus intense. Une vague de chaleur est définie comme une série d'au moins "
"{window} jours consécutifs où la température maximale quotidien est au-dessus de {thresh}.",
},
"TG": {
"long_name": "Moyenne des températures maximale et minimale quotidennes",
"description": "Température moyenne quotidienne moyenne estimée par la moyenne des températures maximale et minimale quotidiennnes.",
Expand Down
15 changes: 15 additions & 0 deletions xclim/indicators/atmos/_temperature.py
Original file line number Diff line number Diff line change
Expand Up @@ -50,6 +50,7 @@
"heat_spell_total_length",
"heat_wave_frequency",
"heat_wave_index",
"heat_wave_magnitude",
"heat_wave_max_length",
"heat_wave_total_length",
"heating_degree_days",
Expand Down Expand Up @@ -215,6 +216,20 @@ class TempWithIndexing(ResamplingIndicatorWithIndexing):
compute=indices.heat_wave_frequency,
)

heat_wave_magnitude = Temp(
title="Heat wave magnitude",
identifier="heat_wave_magnitude",
units="K d",
long_name="Maximum cumulative difference between daily maximum temperature and {thresh} for days within a heat wave. "
"A heat wave is defined as a series of at least {window} consecutive days with daily maximum temperature above "
"{thresh}.",
description="Magnitude of the most intensive heat wave per {freq}. The magnitude is the cumulative exceedance of daily maximum temperature over {thresh}. A heat wave is defined as a series of at least {window} consecutive days with daily maximum temperature above"
abstract="Magnitude of the most intensive heat wave per {freq}. A heat wave occurs when daily maximum "
"temperatures exceed given thresholds for a number of days.",
Zeitsperre marked this conversation as resolved.
Show resolved Hide resolved
cell_methods="",
compute=indices.heat_wave_magnitude,
)

heat_wave_max_length = Temp(
title="Heat wave maximum length",
identifier="heat_wave_max_length",
Expand Down
50 changes: 50 additions & 0 deletions xclim/indices/_threshold.py
Original file line number Diff line number Diff line change
Expand Up @@ -64,6 +64,7 @@
"growing_season_length",
"growing_season_start",
"heat_wave_index",
"heat_wave_magnitude",
"heating_degree_days",
"hot_spell_frequency",
"hot_spell_max_length",
Expand Down Expand Up @@ -1928,6 +1929,55 @@ def heat_wave_index(
return to_agg_units(out, tasmax, "count")


@declare_units(tasmax="[temperature]", thresh="[temperature]")
def heat_wave_magnitude(
coxipi marked this conversation as resolved.
Show resolved Hide resolved
tasmax: xarray.DataArray,
thresh: Quantified = "25.0 degC",
window: int = 3,
freq: str = "YS",
op: str = ">",
resample_before_rl: bool = True,
) -> xarray.DataArray:
"""Heat wave magnitude index.

Magnitude of the most intensive heat wave event as sum of differences between tasmax
and the given threshold for Heat Wave days, defined as three or more consecutive days
over the threshold.

Parameters
----------
tasmax : xarray.DataArray
Maximum daily temperature.
thresh : xarray.DataArray
Threshold temperature on which to designate a heatwave.
window : int
Minimum number of days with temperature above threshold to qualify as a heatwave.
freq : str
Resampling frequency.
op : {">", ">=", "gt", "ge"}
Comparison operation. Default: ">".
resample_before_rl : bool
Determines if the resampling should take place before or after the run
length encoding (or a similar algorithm) is applied to runs.

Returns
-------
DataArray, [time]
Heat wave magnitude.
"""
thresh = convert_units_to(thresh, tasmax)
over_values = (tasmax - thresh).clip(0)

out = rl.resample_and_rl(
over_values,
resample_before_rl,
rl.windowed_max_run_sum,
window=window,
freq=freq,
)
return to_agg_units(out, tasmax, op="integral")


@declare_units(tas="[temperature]", thresh="[temperature]")
def heating_degree_days(
tas: xarray.DataArray,
Expand Down
55 changes: 53 additions & 2 deletions xclim/indices/run_length.py
Original file line number Diff line number Diff line change
Expand Up @@ -180,7 +180,8 @@ def rle(
xr.DataArray
Values are 0 where da is False (out of runs).
"""
da = da.astype(int)
if da.dtype == bool:
da = da.astype(int)

# "first" case: Algorithm is applied on inverted array and output is inverted back
if index == "first":
Expand All @@ -192,7 +193,7 @@ def rle(
# Keep total length of each series (and also keep 0's), e.g. 100120123 -> 100N20NN3
# Keep numbers with a 0 to the right and also the last number
cs_s = cs_s.where(da.shift({dim: -1}, fill_value=0) == 0)
out = cs_s.where(da == 1, 0) # Reinsert 0's at their original place
out = cs_s.where(da > 0, 0) # Reinsert 0's at their original place

# Inverting back if needed e.g. 100N20NN3 -> 3NN02N001. This is the output of
# `rle` for 111011001 with index == "first"
Expand Down Expand Up @@ -409,6 +410,56 @@ def windowed_run_count(
return out


def windowed_max_run_sum(
da: xr.DataArray,
window: int,
dim: str = "time",
freq: str | None = None,
ufunc_1dim: str | bool = "from_context",
index: str = "first",
) -> xr.DataArray:
"""Return the maximum sum of consecutive float values for runs at least as long as the given window length.

Parameters
----------
da : xr.DataArray
Input N-dimensional DataArray (boolean).
window : int
Minimum run length.
When equal to 1, an optimized version of the algorithm is used.
dim : str
Dimension along which to calculate consecutive run (default: 'time').
freq : str
Resampling frequency.
ufunc_1dim : Union[str, bool]
Use the 1d 'ufunc' version of this function : default (auto) will attempt to select optimal
usage based on number of data points. Using 1D_ufunc=True is typically more efficient
for DataArray with a small number of grid points.
Ignored when `window=1`. It can be modified globally through the "run_length_ufunc" global option.
index : {'first', 'last'}
If 'first', the run length is indexed with the first element in the run.
If 'last', with the last element in the run.

Returns
-------
xr.DataArray, [int]
Total number of `True` values part of a consecutive runs of at least `window` long.
"""
if window == 1 and freq is None:
out = rle(da, dim=dim, index=index).max(dim=dim)

else:
d_rse = rle(da, dim=dim, index=index)
d_rle = rle((da > 0).astype(bool), dim=dim, index=index)

d = d_rse.where(d_rle >= window, 0)
if freq is not None:
d = d.resample({dim: freq})
out = d.max(dim=dim)

return out


def _boundary_run(
da: xr.DataArray,
window: int,
Expand Down
Loading