Dask is an opensource, free Python library to increase parallel computations with very large data sets. Some applications include: satelliete images, medical data, genomics, ect. Usually these datasets have larger than memory data sets
Xarray is a Python library that’s similar to pandas and numpy, running on dask. Wraps dask transparently.