Will I see you at the Subsurface Lakehouse Conference Nov 13th?
Register at Dremio.com/subsurface
Will I see you at the Subsurface Lakehouse Conference Nov 13th?
Register at Dremio.com/subsurface
Are you subscribed?
Subscribe to my blog on medium or substack to get regular updates on the data and AI world. Find all the links at AlexMerced.com/data.
Cloudflare has just launched the open beta of its Cloudflare Data Platform - a managed service for ingesting, storing & querying analytical data tables using open standards like Apache Iceberg.
๐ Dive into the key insights on #InfoQ โจ https://bit.ly/49y1tIa
#CloudComputing #DataLake #DataAnalytics #ApacheIceberg #Cloudflare
scrapy-contrib-bigexporter 0.6.1 released: https://codeberg.org/ZuInnoTe/scrapy-contrib-bigexporters
Added: You can customize Iceberg table location
#scrapy #webscraping #bigdata #iceberg #apacheiceberg #opensource #python
scrapy-contrib-bigexporter 0.6.0 released: https://codeberg.org/ZuInnoTe/scrapy-contrib-bigexporters
New: Export your webscraped items in Scrapy to Apache Iceberg tables with simple configuration
#scrapy #webscraping #bigdata #iceberg #apacheiceberg #opensource #python
#Netflix scaled ๐๐ฎ๐ฌ๐ to handle ๐ญ๐ซ๐ข๐ฅ๐ฅ๐ข๐จ๐ง-๐ซ๐จ๐ฐ ๐๐๐ญ๐๐ฌ๐๐ญ๐ฌ!
โก๏ธ Muse helps teams see which artwork & videos resonate with audiences.
โก๏ธ To keep up with demand, Netflix ๐ซ๐๐๐๐ฌ๐ข๐ ๐ง๐๐ ๐ญ๐ก๐ ๐๐๐ญ๐ ๐ฅ๐๐ฒ๐๐ซ, cutting query latencies by ~50% while keeping results accurate and responsive.
๐ Learn more: https://bit.ly/4gG3HGU
Watching the re-indexing of an archival catalog backup of AtoM, I realized:
Indices populated with 18751 documents in 164.84 seconds.
19k Objects?
Thats /nothing/ for a regular #bigDATA tech-tool. This is peanuts.
400.000 Objects?
Millions?! - According to documentation of #ApacheIceberg #ObjectStore #Redis #KeyDB, etc: **easy**
#DLTP & #GLAM: Storing and using those "objects" in key/value annotated filesystems with bigDATA tools:
**FUN!!**
Pick-up โArchitecting an Apache Iceberg Lakehouseโ
My third book has official hit MEAP and you can buy it 50% off for a limited time! ->
My third book has official hit MEAP and you can buy it 50% off for a limited time! ->
Amazon #S3 now supports sort and z-order compaction for #ApacheIceberg tables, promising reduced scan times & lower engine costs.
Available for both S3 Tables and traditional S3 buckets via AWS Glue Data Catalog optimization.
Dive into the details: https://bit.ly/3GyjxWQ
๐ข Behold, the earth-shattering breakthrough of Nimtable: a web UI to *click* on Apache Iceberg tables! ๐ Presumably because using command line tools is an insurmountable task for mere mortals. Or maybe itโs just a clever way to make clicking around a web interface the new rocket science. ๐
https://github.com/nimtable/nimtable #Nimtable #ApacheIceberg #WebUI #Innovation #TechNews #ClickAndGo #HackerNews #ngated
Nimtable: Open-source web UI to browse and manage Apache Iceberg tables
https://github.com/nimtable/nimtable
#HackerNews #Nimtable #OpenSource #ApacheIceberg #WebUI #DataManagement #DatabaseTools
Paris: Apache Iceberg Paris Community Meetup #1, Le jeudi 19 juin 2025 de 18h00 ร 21h30. https://www.agendadulibre.org/events/32653 #data #dataLakehouse #dataEngineer #dataScience #dataPlatform #dataWarehouse #apacheIceberg
"Centralize Your Data Lake: Apache Polaris Supports Apache Iceberg and Now Delta Lake"
BTW 'Polaris' used to be the name of the UK nuclear deterrent pre 1996. ๐ฌ
https://snowflake.com/en/engineering-blog/apache-polaris-supports-iceberg-delta-lake/
What happens when you marry #ClickHouse database with #ApacheIceberg? you could query huge datasets fast and with 10x cheaper storage. Sounds promising, right?
Join me tomorrow on the live stream to find out!
May 20th, 11am PT / 20:00 CET:
https://www.youtube.com/watch?v=VeyTL2JlWp0
#ApacheIceberg: What It Is and Why Everyoneโs Talking About It
Benefits of Apache Iceberg for geospatial data analysis
https://wherobots.com/blog/benefits-of-apache-iceberg-for-geospatial-data-analysis/
#HackerNews #ApacheIceberg #GeospatialData #DataAnalysis #BigData #Analytics
R2 Data Catalog: Managed Apache Iceberg tables with zero egress fees - Cloudflare
The Iceberg wars are hotting up. AWS has some competition.
Streamlining access to tabular datasets stored in Amazon S3 Tables with DuckDB | AWS Storage Blog