Posts tagged with analytics
2 posts found
Filter by tag
All Posts
AWS mcp AI #agent sagemaker duckDB glue lakehouse Data-lake analytics gtfs data-engineering apache iceberg cloudflare serverless s3-tables iceberg S3 ip address CIDR data analysis llm SQL wasm data analytics lambda@edge skills learning free badge maps CDN dataengineering dataanalytics observable datalake Amazon S3 Parquet lambda Reverse Proxy Caddy serverless framework cloudformation
Filter by tag:
All Posts
AWS mcp AI #agent sagemaker duckDB glue lakehouse Data-lake analytics gtfs data-engineering apache iceberg cloudflare serverless s3-tables iceberg S3 ip address CIDR data analysis llm SQL wasm data analytics lambda@edge skills learning free badge maps CDN dataengineering dataanalytics observable datalake Amazon S3 Parquet lambda Reverse Proxy Caddy serverless framework cloudformation
Using DuckDB databases as lightweight Data Lake access layer
Data Lakes come in a broad variety and lots of different flavors. AWS, Azure, Google Cloud, Snowflake, DataBricks, etc. they all have their specialties, strong and weak sides. Common among them is that the most, if not all, of them use Object Storage...
May 17, 2025
15 min read
duckDB Data-lake analytics
Using DuckDB to repartition parquet data in S3
Since release v0.7.1, DuckDB has the ability to repartition data stored in S3 as parquet files by a simple SQL query, which enables some interesting use cases. Why not use existing AWS services? If your data lake lives in AWS, a natural choice for ET...
February 26, 2023
5 min read
duckDB Amazon S3 Data-lake