DuckDB

An embeddable SQL database like SQLite, but supports Postgres features - DuckDB / github / HN

The creator of duckdb argues that people using pandas are missing out of the 50 years of progress in database research - HN - DuckDB Internals (CMU Advanced Databases / Spring 2023) - duck DB use a vectorized push-base model (implemented in C++) - support larger than memory execution - DuckDB – The SQLite for Analytics (Mark Raasveldt, CWI)

DuckLake

  • DuckLake - advanced data lake features without traditional lakehouse complexity by using Parquet files and your SQL database. It’s an open, standalone format from the DuckDB team
  • Understanding DuckLake: A Table Format with a Modern Architecture - a new table format for storing data on blobstore
    • data are stored on a blob storage (parquet on S3)
    • metadata are stored in RDB (Postgres)

see also

Written on September 20, 2020, Last update on September 14, 2025
sql db sqlite parquet csv spark