#apachehudi

2025-01-20

#ApacheHudi 1.0 is now generally available!

The release introduces new features aimed at transforming data lakehouses into what the project community considers a fully-fledged "Data Lakehouse Management System" (DLMS).

Details on #InfoQ ๐Ÿ‘‰ bit.ly/3E5AXZi

#AI #DataLake #opensource #DataAnalytics

rmoff ๐Ÿƒ๐Ÿป ๐Ÿบ ๐Ÿฅ“rmoff@data-folks.masto.host
2024-02-23

More good stuff from Grab - this time writing about how they are building a realtime datalake with tools including #apacheFlink, #apacheHudi, #apacheSpark and #TrinoDB

engineering.grab.com/enabling-

#dataEngineering #dataArchitectures #openSource

Alex Merced - ๐Ÿฅ‘ @ Dremioalexmerced@data-folks.masto.host
2023-07-17

โ“โ“โ“HOW LAKEHOUSE TABLE FORMAT WORKSโ“โ“โ“

1. Engine reads table format metadata
2. Builds list of files with relevant data based on metadata
3. Scans those files and executes query

#DataEngineering #DataAnalytics #BigData #DataLakehouse #ApacheIceberg #ApacheHudi #DeltaLake

Nicolas Frรคnkel ๐Ÿ‡ช๐Ÿ‡บ๐Ÿ‡บ๐Ÿ‡ฆ๐Ÿ‡ฌ๐Ÿ‡ชfrankel@mastodon.top
2023-05-05

Get a detailed overview of #DeltaLake, #ApacheHudi, and #ApacheIceberg as we discuss their data storage, processing capabilities, and deployment options dzone.com/articles/delta-hudi-

#analytics #spark

rmoff ๐Ÿƒ๐Ÿป ๐Ÿบ ๐Ÿฅ“rmoff@data-folks.masto.host
2023-02-03

This blog from Onehouse about #ApacheHudi is interesting.

My eye was caught by the chart showing which organisations and companies contribute to the #opensource projects. We all know that DB dominates DL. I wonder if the balance on the other two will stay over time or if Onehouse and Tabular (circled) will start to grow.

onehouse.ai/blog/apache-hudi-v

Wojtek Walczakwojtekwalczak
2022-12-23

My Medium adventure enters a new phase: the first post for a Medium-held publication, Plumbers of Data Science, just got published :)

It's also more technical than my previous writings. The point is to introduce Apache Hudi in a softer way than the official documentation does at the moment. So, if you're interested in starting with Hudi, look no further :)

medium.com/plumbersofdatascien

heise online (inoffiziell)heiseonline@squeet.me
2020-06-05
Apache Hudi, ein Tool zum Verwalten groรŸer Datenstrรถme, hat die Bewรคhrungsphase im Apache Incubator abgeschlossen.
Apache Software Foundation erhebt Hudi zum Top-Level-Projekt
#ApacheHadoop #ApacheHudi #ApacheSoftwareFoundation #Top-Level-Projekt

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst