Xhistogram Tutorial

Histograms are the foundation of many forms of data analysis. The goal of xhistogram is to make it easy to calculate weighted histograms in multiple dimensions over n-dimensional arrays, with control over the axes. Xhistogram builds on top of xarray, for automatic coordiantes and labels, and dask, for parallel scalability.

2D Histogram

Now let’s say we have multiple input arrays. We can calculate their joint distribution:

[7]:

db = xr.DataArray(np.random.randn(nt, nx), dims=['time', 'x'],
                  name='bar') - 2

histogram(da, db, bins=[bins, bins]).plot()

[7]:

<matplotlib.collections.QuadMesh at 0x7f5ac16c3e20>

_images/tutorial_14_1.png

Dask Integration

Should just work, but need examples.