Posts tagged with duckdb

14 posts found

Custom DuckDB Wasm builds for Cloudflare Workers

Custom DuckDB Wasm builds for Cloudflare Workers

What if you could run full SQL queries, including JOINs, aggregations, and even remote Parquet file reads, directly inside a Cloudflare Worker? No database server, no connection...

January 27, 2026
5 min read
async wasm duckdb
Using Iceberg Catalogs in the Browser with DuckDB-Wasm

Using Iceberg Catalogs in the Browser with DuckDB-Wasm

With recent updates of DuckDB itself, DuckDB Wasm and the Iceberg extension it is now possible to query Iceberg catalog directly from the browser, with no backends. involved. Ex...

December 16, 2025
4 min read
wasm duckdb iceberg
TypeScript scripts as DuckDB Table Functions

TypeScript scripts as DuckDB Table Functions

What if you could query any REST API, GraphQL endpoint, or web page directly from DuckDB using SQL? No ETL pipelines, no intermediate files, no complex setup just write a TypeSc...

December 10, 2025
3 min read
typescript arrow duckdb
Using Amazon SageMaker Lakehouse with DuckDB

Using Amazon SageMaker Lakehouse with DuckDB

Preconditions To use the Amazon SageMaker Lakehouse with DuckDB, you first have to create a S3 Table bucket, a namespace and an actual S3 Table. All those steps are described in...

June 8, 2025
5 min read
glue sagemaker duckdb
Welcome to the age of $10/month Lakehouses

Welcome to the age of $10/month Lakehouses

Recap: Data Warehouses, Data Lakes, Lakehouses? As a short recap, what do these mean, and how are they differentiated? Modern Data Warehouses , like Amazon Redshift , Google Big...

May 30, 2025
19 min read
data-lake lakehouse duckdb
Using DuckDB databases as lightweight Data Lake access layer

Using DuckDB databases as lightweight Data Lake access layer

Data Lakes come in a broad variety and lots of different flavors. AWS, Azure, Google Cloud, Snowflake, DataBricks, etc. they all have their specialties, strong and weak sides. C...

May 17, 2025
7 min read
analytics data-lake duckdb
Handling GTFS data with DuckDB

Handling GTFS data with DuckDB

The General Transit Feed Specification GTFS is a standardized, open data format for public transportation schedules and geographic information. In practice, a GTFS feed is simpl...

May 16, 2025
7 min read
data-engineering duckdb gtfs
Querying IP addresses and CIDR ranges with DuckDB

Querying IP addresses and CIDR ranges with DuckDB

I had a use case that eventually required performing IP address lookups in a given list of CIDR ranges, as I maintain an open source project that gathers IP address range data f...

September 20, 2024
2 min read
data-analysis cidr ip-address
Chat with a Duck

Chat with a Duck

A while ago I published sql workbench.com and the accompanying blog post called "Using DuckDB WASM for in browser Data Engineering". The SQL Workbench enables its users to analy...

April 16, 2024
4 min read
ai sql llm
Using DuckDB-WASM for in-browser Data Engineering

Using DuckDB-WASM for in-browser Data Engineering

Introduction DuckDB, the in process DBMS specialized in OLAP workloads, had a very rapid growth during the last year, both in functionality, but also popularity amongst its user...

January 27, 2024
15 min read
sql wasm data-engineering
Gathering and analyzing public cloud provider IP address data with DuckDB & Observerable

Gathering and analyzing public cloud provider IP address data with DuckDB & Observerable

This articles explains how the gathering and analyzing of public cloud provider IP address data is possible with DuckDB and Observerable

April 26, 2023
5 min read
free observable dataengineering
Casual data engineering, or: A poor man's Data Lake in the cloud - Part I

Casual data engineering, or: A poor man's Data Lake in the cloud - Part I

Using AWS Serverless services and DuckDB as near-realtime Data Lake backend infrastructure

April 24, 2023
19 min read
aws sql serverless
Using DuckDB to repartition parquet data in S3

Using DuckDB to repartition parquet data in S3

A common task in S3-based Data Lakes is to repartition data, to optimize query patterns and speed. This article describes a serverless solution using DuckDB

February 26, 2023
5 min read
analytics amazon-s3 data-lake
Using DuckDB in AWS Lambda

Using DuckDB in AWS Lambda

How to run DuckDB in a serverless way on AWS Lambda, with a custom layer.

February 12, 2023
6 min read
lambda aws serverless