Blog Posts

Insights and articles about cloud computing, containers, and open source.

Query S3 Tables with DuckDB

Query S3 Tables with DuckDB

DuckDB has gained a new feature in preview, that allows querying of Iceberg data in AWS S3 Tables. Setting up a S3 Table There are multiple steps which need to be performed to s...

March 14, 2025
4 min read
aws s3 iceberg
Querying IP addresses and CIDR ranges with DuckDB

Querying IP addresses and CIDR ranges with DuckDB

I had a use case that eventually required performing IP address lookups in a given list of CIDR ranges, as I maintain an open source project that gathers IP address range data f...

September 20, 2024
2 min read
data-analysis cidr ip-address
Chat with a Duck

Chat with a Duck

A while ago I published sql workbench.com and the accompanying blog post called "Using DuckDB WASM for in browser Data Engineering". The SQL Workbench enables its users to analy...

April 16, 2024
4 min read
ai sql llm
Using DuckDB-WASM for in-browser Data Engineering

Using DuckDB-WASM for in-browser Data Engineering

Introduction DuckDB, the in process DBMS specialized in OLAP workloads, had a very rapid growth during the last year, both in functionality, but also popularity amongst its user...

January 27, 2024
15 min read
sql wasm data-engineering
Retrieving Lambda@Edge CloudWatch Logs

Retrieving Lambda@Edge CloudWatch Logs

What is Lambda@Edge AWS Lambda@Edge is an extension of the traditional AWS Lambda service, but with a crucial twist – it brings serverless computing capabilities closer to the e...

January 26, 2024
3 min read
aws serverless lambdaedge
List of free AWS Knowledge Badges

List of free AWS Knowledge Badges

As the Skillbuilder website is sometimes a bit hard to navigate, here's the full list of free badges you can do on AWS Skillbuilder: AWS Knowledge: Cloud Essentials AWS Knowledg...

September 1, 2023
1 min read
aws free learning
Serverless Maps for fun and profit

Serverless Maps for fun and profit

Introduction In today's data driven world, interactive and visually appealing web based maps have become an integral part of countless applications and services. Whether it's fo...

August 7, 2023
9 min read
cdn maps serverless
Gathering and analyzing public cloud provider IP address data with DuckDB & Observerable

Gathering and analyzing public cloud provider IP address data with DuckDB & Observerable

This articles explains how the gathering and analyzing of public cloud provider IP address data is possible with DuckDB and Observerable

April 26, 2023
5 min read
free observable dataengineering
Casual data engineering, or: A poor man's Data Lake in the cloud - Part I

Casual data engineering, or: A poor man's Data Lake in the cloud - Part I

Using AWS Serverless services and DuckDB as near-realtime Data Lake backend infrastructure

April 24, 2023
19 min read
aws sql serverless
Using DuckDB to repartition parquet data in S3

Using DuckDB to repartition parquet data in S3

A common task in S3-based Data Lakes is to repartition data, to optimize query patterns and speed. This article describes a serverless solution using DuckDB

February 26, 2023
5 min read
analytics amazon-s3 data-lake