#DuckDB

Mitch Chapmanmchapman87501@c.im
2025-05-30

This week I've learned that uv, marimo, polars, and duckdb play very well together.

Many of my happy surprises probably wouldn't be surprises, if I'd just read the docs more carefully 😁 But they'd still be happy.

Examples:

1. A marimo cell that defines a pytest-discoverable test function automatically runs the test when evaluated. No need to explicitly import pytest. (Though I assume pytest needs to be in the current venv.)

2. marimo's "sql" function returns its result as a Polars DataFrame, and by default it displays the result as the cell result.

#uv #marimo_io #polars #duckdb

docs.astral.sh/uv/
marimo.io
pola.rs
duckdb.org

A screenshot of a marimo notebook cell that defines a pytest-discoverable test function.Screenshot of the output from the test cell, showing typical pytest output.
2025-05-30

I want to make my partner a simple database (~3 tables, i think), because I think that's the "right" thing to use (as opposed to excel / google sheets). But, I've never had to create a database thingy before.

Seeing this as an opportunity to learn some #DuckDB or #SQLite.

Seeking #recommendations for a simple (GUI) interface for interacting with the tables.

#LibreOffice Base or something else?

#suggestions welcome
#database #db #help

Michael Simonsrotnroll666
2025-05-29

@AtanasE I do like that you can just query it with or and the latter has a neat spatial extension utilising postgis. @defuneste

2025-05-29

AI-Native Our World in Data Demography & Fertility Explorer with #Python and #DuckDB

🎥 youtube.com/watch?v=fA06oOxKh3

🧑‍💻 github.com/gloodata/extension-

【登壇資料】 「DuckDBのデータエンジニア観点での使い道 (2025年5月版)」というタイトルで社内で登壇しました
dev.classmethod.jp/articles/du

#dev_classmethod #DuckDB

2025-05-28

PondPilot: как мы сделали локальный SQL-редактор в браузере на DuckDB и WASM

Любой, кто хоть раз пытался «по-быстрому» проанализировать CSV-файл или прототип БД, сталкивался с выбором из неудобств: открывать в Excel, запускать Jupyter, возиться с pandas, или поднимать Postgres/ClickHouse ради пары запросов. Мне показалось странным, что в 2025 году до сих пор нет удобной zero-setup SQL-песочницы для локальных данных. Так родился PondPilot - open-source инструмент для анализа данных, работающий прямо в браузере, без серверов и настройки.

habr.com/ru/articles/913682/

#sql #duckdb #duckdb_wasm

Alejandro Baezzeab@fosstodon.org
2025-05-28

And #duckdb continues to be my everything analytics thing. 😅 #ducklake simply cements this. Basically use #sqlite, #duckdb, or #postgres. 😏

If I need more, I have to think REAL hard what I can't accomplish with sharding. 😁

duckdb.org/2025/05/27/ducklake

sriram kotasriramskota
2025-05-28

Is anyone using duckdb with elixir/Phoenix? what are you using it for?

boB Rudis 🇺🇦 🇬🇱 🇨🇦hrbrmstr
2025-05-28

Tyler Hillery took DuckLake for a spin on top of Supabase + Supabase Storage: youtube.com/watch?v=diL00ZZ-q50

A dual-pane IDE screenshot shows a DuckDB + PostgreSQL Python script on the left and a resulting SQL query output on the right. The script configures an S3-compatible DuckDB external storage secret and attaches a Postgres database via environment variables. It creates a table from a CSV hosted at blobs.duckdb.org containing Dutch train station data. The right pane displays a preview of the loaded data: rows of station records including fields like id, code, name_short, country, and type. There are 578 rows in total, with locations spanning the Netherlands, Germany, France, Switzerland, Belgium, and more.

メタデータ管理をデータベースで担う新しいLakehouseフォーマット「DuckLake」が発表されました
dev.classmethod.jp/articles/du

#dev_classmethod #DuckDB

hrbrmstr 🇺🇦 🇬🇱 🇨🇦hrbrmstr.dev@bsky.brid.gy
2025-05-27

#DuckDB DuckLake — a new lakehouse format built on pure SQL — is out! duckdb.org/2025/05/27/d... duckdb.org/docs/stable/...

DuckLake: SQL as a Lakehouse F...

boB Rudis 🇺🇦 🇬🇱 🇨🇦hrbrmstr
2025-05-27

【AWS CDK】EC2にGrafanaをインストールして、DuckDBでS3データを可視化してみた
dev.classmethod.jp/articles/aw

#dev_classmethod #DuckDB #Grafana #AWS_CDK #Amazon_EC2 #AWS

Dirk Eddelbuetteleddelbuettel
2025-05-26

PSA for a potential audience of ... maybe a handful of people: something I posted earlier today in the discord under 'show-and-tell': if you wished you could get new `duckdb` binaries automagically via `apt`, a simple repo shows one way -- and got me versions 1.3.0 and one or two of the 1.2.* ones. Text of post in alt-text.

Screenshot of a post made at the duckdb discord under show-and-tell. Plain text follows.

(Rather ad-hoc and possibly limited to small audience but hey it is "show-and-tell" here)  If you are on, say, Ubuntu and get annoyed that all-but-one (or two) software packages autoMAGICally update with an apt call (as in my case the list of auto-updating packages includes brave, broot, chrome, code, docker, dropbox, obs, rclone, slack, spotify, syncthing, ... as there are in fact a number of suitable apt repos either by vendors or contributors) it gets irritating to deal with the remaining few by hand. So folloing a first script to fill a local PPA with positron, quato, rstudio, shiny packages I added one to fetch duckdb , turn it into a .deb (and then place it into the local PPA).  In case that is of interest to the one or two of you here, the repo is at https://github.com/eddelbuettel/duck-deb but the cronjob runs locally.  Some variables hardcoded to what I run here (ie 24.10 amd64 binary, my email, ...) but should generalise.   File an issue at the repo if you questions / suggestions.  Worked fine for last few 1.2.* releases and now 1.3.0.
Alejandro Baezzeab@fosstodon.org
2025-05-23

#duckdb now has the ability to run commands from serialized formats line #json or #CSV. Already have use for this. 😏

Latest release is simply packed with improvements. 😎
duckdb.org/2025/05/21/announci

2025-05-23

Easily obtain OSM and OMF data: #Python and CLI tools #QuackOSM and #OvertureMaestro offer easier access to data from #OpenStreetMap (#OSM) and the Overture Maps Foundation (#OMF) through #PyArrow, #GeoParquet, or #DuckDB. These tools can simplify large-scale geospatial data...
spatialists.ch/posts/2025/05-2 #GIS #GISchat #geospatial #SwissGIS

GripNewsGripNews
2025-05-23

🌘 DuckDB 的機場:擴展 Arrow Flight 支援
➤ 開放 DuckDB 翱翔天際
airport.query.farm/
「機場」是 DuckDB 的一個擴展,它引入了對 Apache Arrow Flight 的支援,使 DuckDB 能夠透過 Arrow Flight 伺服器查詢、修改和儲存資料。此擴展允許存取 DuckDB 本身無法直接處理的資料格式或外部 API,並提供精細的存取控制和資料即服務 (Data-as-a-Service) 的能力。它基於 C++ 撰寫,並仰賴 Apache Arrow 函式庫。
+ 哇,這真是個令人興奮的發展!能讓 DuckDB 連接更多外部資料源,簡化資料流程。
+ 聽起來很複雜,但如果能讓我更容易地使用 DuckDB 查詢不同格式的資料,那絕對值得學習。
Arrow

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst