Blog Posts

Insights and articles about cloud computing, containers, and open source.

Query S3 Tables with DuckDB

Query S3 Tables with DuckDB

DuckDB has gained a new feature in preview, that allows querying of Iceberg data in AWS S3 Tables. Setting up a S3 Table There are multiple steps which need to be performed to set up a S3 Table that can be then queried with tools like DuckDB. As the ...

March 14, 2025
5 min read
s3-tables AWS apache iceberg
Querying IP addresses and CIDR ranges with DuckDB

Querying IP addresses and CIDR ranges with DuckDB

I had a use case that eventually required performing IP address lookups in a given list of CIDR ranges, as I maintain an open source project that gathers IP address range data from public cloud providers, and also wrote an article in my blog about an...

September 20, 2024
2 min read
duckDB ip address CIDR
Chat with a Duck

Chat with a Duck

A while ago I published sql-workbench.com and the accompanying blog post called "Using DuckDB-WASM for in-browser Data Engineering". The SQL Workbench enables its users to analyze local or remote data directly in the browser. This lowers the bar rega...

April 16, 2024
5 min read
duckDB llm AI
Using DuckDB-WASM for in-browser Data Engineering

Using DuckDB-WASM for in-browser Data Engineering

Introduction DuckDB, the in-process DBMS specialized in OLAP workloads, had a very rapid growth during the last year, both in functionality, but also popularity amongst its users, but also with developers that contribute many projects to the Open Sou...

January 27, 2024
18 min read
data-engineering wasm SQL
Retrieving Lambda@Edge CloudWatch Logs

Retrieving Lambda@Edge CloudWatch Logs

What is Lambda@Edge AWS Lambda@Edge is an extension of the traditional AWS Lambda service, but with a crucial twist – it brings serverless computing capabilities closer to the end-users. In essence, Lambda@Edge empowers developers to run custom code ...

January 26, 2024
3 min read
serverless lambda@edge AWS
List of free AWS Knowledge Badges

List of free AWS Knowledge Badges

As the Skillbuilder website is sometimes a bit hard to navigate, here's the full list of free badges you can do on AWS Skillbuilder: AWS Knowledge: Cloud Essentials AWS Knowledge: Architecting AWS Knowledge: Serverless AWS Knowledge: Object Stora...

September 1, 2023
1 min read
AWS skills learning
Serverless Maps for fun and profit

Serverless Maps for fun and profit

Introduction In today's data-driven world, interactive and visually appealing web-based maps have become an integral part of countless applications and services. Whether it's for navigation, location-based services, or data visualization, delivering ...

August 7, 2023
9 min read
serverless maps CDN
Gathering and analyzing public cloud provider IP address data with DuckDB & Observerable

Gathering and analyzing public cloud provider IP address data with DuckDB & Observerable

As organizations increasingly adopt the public cloud, managing the networking and security aspects of cloud computing becomes more complex. One of the challenges that cloud administrators face is, especially in a hybrid cloud environment, keeping tra...

April 26, 2023
8 min read
duckDB dataengineering free
Casual data engineering, or: A poor man's Data Lake in the cloud - Part I

Casual data engineering, or: A poor man's Data Lake in the cloud - Part I

In the age of big data, organizations of all sizes are collecting vast amounts of information about their operations, customers, and markets. To make sense of this data, many are turning to data lakes - centralized repositories that store and manage ...

April 24, 2023
20 min read
AWS serverless datalake
Using DuckDB to repartition parquet data in S3

Using DuckDB to repartition parquet data in S3

Since release v0.7.1, DuckDB has the ability to repartition data stored in S3 as parquet files by a simple SQL query, which enables some interesting use cases. Why not use existing AWS services? If your data lake lives in AWS, a natural choice for ET...

February 26, 2023
5 min read
duckDB Amazon S3 Data-lake