Dask is a parallel-computation Python library that features scaled pandas DataFrames and allows running distributed Python code that performs fast Python based data processing.
You can easily install Dask on your platform cluster — for example, by using pip or Conda.
Dask is pre-deployed in the platform's Jupyter Notebook service.
You can find examples of using Dask in the platform's tutorial Jupyter notebooks.
See specifically the dask-cluster tutorial.
For general information about Dask and how to use it, see the Dask documentation.