Blog Posts

Insights and articles about cloud computing, efficient analytics, and open source.

Using Iceberg Catalogs in the Browser with DuckDB-Wasm

Using Iceberg Catalogs in the Browser with DuckDB-Wasm

With recent updates of DuckDB itself, DuckDB-Wasm and the Iceberg extension it is now possible to query Iceberg catalog directly from the browser, with no backends. involved. Example clients that work: SQL Workbench SQL Workbench Embedded DuckDB T...

December 16, 2025
5 min read
duckDB iceberg wasm
TypeScript scripts as DuckDB Table Functions

TypeScript scripts as DuckDB Table Functions

What if you could query any REST API, GraphQL endpoint, or web page directly from DuckDB using SQL? No ETL pipelines, no intermediate files, no complex setup - just write a TypeScript script and use it as a table function. In this post, I'll show you...

December 10, 2025
7 min read
duckDB TypeScript arrow
Use your favorite AI tool to read the lastest AWS News

Use your favorite AI tool to read the lastest AWS News

You can use the unofficial AWS News MCP Server, which combines the news articles, blog posts and updates of more than 40 different AWS Newsfeeds to a single source. Connecting to the AWS News MCP Server You can use different protocols to access it: ...

October 14, 2025
4 min read
AWS mcp AI
Using Amazon SageMaker Lakehouse with DuckDB

Using Amazon SageMaker Lakehouse with DuckDB

Preconditions To use the Amazon SageMaker Lakehouse with DuckDB, you first have to create a S3 Table bucket, a namespace and an actual S3 Table. All those steps are described in my other blog post “Query S3 Tables with DuckDB”, so please make sure yo...

June 8, 2025
5 min read
sagemaker duckDB glue
Welcome to the age of $10/month Lakehouses

Welcome to the age of $10/month Lakehouses

Recap: Data Warehouses, Data Lakes, Lakehouses? As a short recap, what do these mean, and how are they differentiated? Modern Data Warehouses, like Amazon Redshift, Google BigQuery, and Snowflake, offer fast, SQL-optimized performance for structured ...

May 30, 2025
18 min read
lakehouse Data-lake duckDB
Using DuckDB databases as lightweight Data Lake access layer

Using DuckDB databases as lightweight Data Lake access layer

Data Lakes come in a broad variety and lots of different flavors. AWS, Azure, Google Cloud, Snowflake, DataBricks, etc. they all have their specialties, strong and weak sides. Common among them is that the most, if not all, of them use Object Storage...

May 17, 2025
15 min read
duckDB Data-lake analytics
Handling GTFS data with DuckDB

Handling GTFS data with DuckDB

The General Transit Feed Specification (GTFS) is a standardized, open data format for public transportation schedules and geographic information. In practice, a GTFS feed is simply a ZIP archive of text (CSV) tables - such as stops.txt, routes.txt, a...

May 16, 2025
8 min read
gtfs duckDB data-engineering
Cost-efficient event ingestion into Iceberg S3 Tables on AWS

Cost-efficient event ingestion into Iceberg S3 Tables on AWS

Amazon S3 Tables was launched on December 3rd 2024, and provides you “storage that is optimized for tabular data such as daily purchase transactions, streaming sensor data, and ad impressions in Apache Iceberg format”. While S3 Tables can be queried ...

April 10, 2025
13 min read
apache iceberg AWS cloudflare
Query S3 Tables with DuckDB

Query S3 Tables with DuckDB

DuckDB has gained a new feature in preview, that allows querying of Iceberg data in AWS S3 Tables. Setting up a S3 Table There are multiple steps which need to be performed to set up a S3 Table that can be then queried with tools like DuckDB. As the ...

March 14, 2025
5 min read
s3-tables AWS apache iceberg
Querying IP addresses and CIDR ranges with DuckDB

Querying IP addresses and CIDR ranges with DuckDB

I had a use case that eventually required performing IP address lookups in a given list of CIDR ranges, as I maintain an open source project that gathers IP address range data from public cloud providers, and also wrote an article in my blog about an...

September 20, 2024
2 min read
duckDB ip address CIDR