DuckLake: NYC Yellow Taxi
This example demonstrates how to use DuckLake as a data source in hugr, featuring the NYC Yellow Taxi trip dataset with ~36M real taxi trip records stored in MinIO S3. It showcases self-describing schemas, time-travel queries with the @at directive, relationships, DDL operations, and DuckLake management functions.
You can find the complete example in the Hugr examples repository.
To read more about the DuckLake data source, see the DuckLake data source documentation.
Prerequisites
You need the example environment running (sh scripts/start.sh), DuckDB CLI installed locally, and MinIO Client (mc) for bucket management.
Example
- Getting Started
- Extension Schema
- Setup Script
Loading documentation...
Loading schema...
Loading script...