Skip to main content

DuckLake: NYC Yellow Taxi

This example demonstrates how to use DuckLake as a data source in hugr, featuring the NYC Yellow Taxi trip dataset with ~36M real taxi trip records stored in MinIO S3. It showcases self-describing schemas, time-travel queries with the @at directive, relationships, DDL operations, and DuckLake management functions.

You can find the complete example in the Hugr examples repository.

To read more about the DuckLake data source, see the DuckLake data source documentation.

Prerequisites

You need the example environment running (sh scripts/start.sh), DuckDB CLI installed locally, and MinIO Client (mc) for bucket management.

Example

Loading documentation...