Patrick Hoefler
Patrick Hoefler is a member of the pandas core team and a Dask maintainer. He is currently working at Coiled where he focuses on Dask development and the integration of a logical query planning layer into Dask. He holds a Msc degree in Mathematics and works towards a Msc in Software engineering at the University of Oxford.
Sessions
06-15
12:00
40min
Dask DataFrame 2.0 - Comparison to Spark, DuckDB and Polars
Patrick Hoefler
Dask is a library for distributed computing with Python that integrates with pandas. Historically, Dask was the easiest choice to use (it’s just pandas) but struggled to achieve robust performance. A re-implementation of Dask DataFrames will bring it up to speed with Spark, DuckDB and Polars.
Warwick