Spark

Setting Sail. A clean setup using Docker, Jupyter & RustRover

20 July 2025·8 mins

A practical guide to setting up a clean and reproducible environment with Docker, Jupyter, and RustRover to work with Sail, whether as a user or contributor. From launching services with docker-compose to debugging locally without installing any dependencies on your machine.

Sail. Sailing Through Giants and Sparks

16 July 2025·7 mins

In this article, I share my critical view on the current state of data engineering, dominated by heavyweight platforms like Spark and Databricks, and introduce Sail, an open-source engine built on top of Apache Arrow and DataFusion, written in Rust, that offers a new path: lightweight, efficient, and powerful.

Spark

Setting Sail. A clean setup using Docker, Jupyter & RustRover

Sail. Sailing Through Giants and Sparks

From Outside to Core

From Core to Spark

Big Data with Zero Code