#DATAOPS

All Things Openallthingsopen
2025-06-30

🚀 NEW on We ❤️ Open Source 🚀

Daniel Paes introduces Runink: a Go-native, open source platform that makes data pipelines fast, secure, and Kubernetes-free. Built with Linux primitives and Raft for strong governance.

allthingsopen.org/articles/run

Left side says We Love Open Source. #WeLoveOpenSource. ATO. A community education resource from All Things Open. Right side has two wrenches adjusting a pipe.
devopsdays Amsterdamamsterdam@devopsdays.org
2025-06-18

📸 Handling data in Kubernetes - the cloud native way with Jonathan Gonzalez V.
From zero to a production-ready PostgreSQL cluster using CloudNativePG. Real DevOps for your data layer in action.
#DevOpsDaysAMS #Kubernetes #PostgreSQL #CloudNative #DevOps #DataOps

2025-06-16

🚀 OpenSearch 3.0 introduces powerful new PPL commands: lookup, join & subsearch! Supercharge your log analysis with enhanced correlation, efficient data exploration & real-time enrichment - all powered by Apache Calcite. Level up your observability game! 🔍 bit.ly/3ZZWwlS
#OpenSearch #DataOps #OpenSource #PPL

Recce - Trust, Verify, ShipDataRecce
2025-06-03

🚨 Data diff isn’t enough.
You’re putting out harmless fires while real metric failures burn unnoticed.

You don’t need more diffing. You need better understanding.
Read 👉 datarecce.io/blog/more-than-da

Continuous Delivery FoundationCDeliveryFdn@social.lfx.dev
2025-06-02

Join us on Wednesday, June 4 to talk about: Model Context Protocol
Vivek Mangipudi will lead the discussion in our #DataOps meeting.
Want to join? Reply and we'll add you.

PromptCloudpromptcloud
2025-05-28

Tired of babysitting DIY scraping scripts that crash the moment you scale?
You’re not alone.

PromptCloud takes the pain out of large-scale data extraction with fully managed, reliable solutions — so you can focus on what really matters: insights.

🔗 shorturl.at/EApIO

A meme showing a man confidently holding a pile of tangled wires labeled “My DIY scraping scripts,” contrasted with a chaotic, burning server room in the background labeled “The scripts when I try to scale.” The meme humorously highlights the fragility of DIY web scraping solutions under enterprise-scale demands.
Doug Ortizdougortiz
2025-05-23

The MLOps skillset is evolving! 🤯

Reddit shows a clear trend: DataOps is emerging as a distinct specialization, separate from traditional DevOps.

Is your organization prepared for this shift?

Full blog post: dougortiz.blogspot.com/2025/05

What training programs are you implementing to address this emerging DataOps talent gap?

Cher Fox (The Datanista) CDMPTheDatanista
2025-05-23

𝐇𝐚𝐫𝐝𝐜𝐨𝐝𝐢𝐧𝐠 𝐢𝐬 𝐇𝐮𝐫𝐭𝐢𝐧𝐠 𝐘𝐨𝐮𝐫 𝐓𝐞𝐚𝐦

Still hardcoding values in your data pipelines?
👎 It’s slowing down scaling.
👎 It creates more bugs.
👎 It’s not enterprise-ready.

Learn how to parameterize Azure Data Factory like a pro - with real-world strategies you can apply immediately after training.

💥 Save your team hours of rework.
💥 Build once, reuse forever.

📅 Live 6/11 - just $249.

🎁 1st 20 registrants get the bonuses!

🔗 Get your seat: entdna.com/product/advanced-az

2025-05-22

Picked up "Python Polars the definitive guide" by Jeroen Janssens and Thijs Nieuwdorp. The polar bear was already used on another O'Reilly book, but the Iberian lynx is cool.

Never sure how tech books will pan out, but Jeroen's book data science at the command line was a good one, so I am hopeful.

#python #polars #dataframes #datascience #DataEngineering #dataops #book #books #computerscience #analytics

Python Polars book cover, featuring an Iberian Lynx
Recce - Trust, Verify, ShipDataRecce
2025-05-15

What breaks if I change this column?

Read our technical deep-dive into how Recce constructs column-level lineage from models

- How we track column origins and transformations using SQLGlot

- How we classify columns as pass-through, renamed, derived, or source

- How we handle tricky edge cases like SELECT *, name collisions, and macro expansion

Read more:
datarecce.io/blog/column-level

Recce - Trust, Verify, ShipDataRecce
2025-05-15

Stop duplicating dashboards to preview the impact of dbt model changes!

With Recce, you can:
✅ Diff models
✅ Record data impact
✅ Share results instantly — no dashboards, no SQL, no screenshots

One click. Instant clarity:

medium.com/inthepipeline/stop-

2025-05-07

DBT: трансформация данных без боли

Привет! Меня зовут Кирилл Львов, я fullstack-разработчик в компании СберАналитика. В этой статье хочу рассказать про мощный инструмент трансформации данных — DBT (Data Build Tool). Сегодня любой средний и крупный бизнес хранит множество данных в разрозненных источниках (CRM, ERP, HRM, базы данных, файловые хранилища и т.д.). Каждая из этих систем самодостаточна и закрывает определённую боль бизнеса, но собрав данные из таких источников и стандартизировав их, нам открывается возможность анализировать данные, строить модели машинного обучения и принимать на основе этих данных управленческие решения. Для того чтобы реализовать такой подход строятся ELT (или ETL) процессы. ELT (Extract, Load, Transform) — это процесс, состоящий из трех этапов:

habr.com/ru/articles/907540/

#dbt #big_data #data_ingineering #аналитика_данных #трансформация_данных #elt #sql #dataops

2025-05-01

Big move: Fivetran + Census unite for seamless data flow. No more custom code headaches. Just pure data harmony.

#DataOps #ETL #DevTools fivetran.com/blog/why-fivetran

Marcos Lobo 💙💛marcosflobo@hachyderm.io
2025-04-30

❓Need clear “done” definitions when balancing code & data?

Align engineering & analytics acceptance criteria to avoid moving goalposts.

📋 Read how: newsletter.optimistengineer.co

#SoftwareEngineering #DataOps

PromptCloudpromptcloud
2025-04-18

We believe data should be clean, actionable — and sometimes a little fun.

🥚This table has a hidden message woven into the product titles. Can you find it?

Continuous Delivery FoundationCDeliveryFdn@social.lfx.dev
2025-04-03

Always wanted to learn the Fundamentals of #DataOps?
🔴 Watch this workshop: youtu.be/hZ8NpMgPKcM

Datalumendatalumen
2025-03-28

💶 𝗧𝗵𝗲 𝗵𝗶𝗱𝗱𝗲𝗻 𝗰𝗼𝘀𝘁 𝗼𝗳 𝗰𝘂𝘁𝘁𝗶𝗻𝗴 𝗰𝗼𝗿𝗻𝗲𝗿𝘀? 𝗧𝗲𝗰𝗵𝗻𝗶𝗰𝗮𝗹 𝗱𝗲𝗯𝘁, not just another corporate buzzword but the silent killer of efficiency, innovation, and organizational success.

👉 Understand what technical debt means in a datamanagement context.
datalumen.eu/technicaldebt_dat

hashtag#DataArchitecture

THE HIDDEN COST OF CUTTING CORNERS: UNDERSTANDING TECHNICAL DEBT IN DATA MANAGEMENT
Continuous Delivery FoundationCDeliveryFdn@social.lfx.dev
2025-03-25

✨ Latest #ContinuousSpotlight is on
Lisa N. Cao. You might know her as a CDF Ambassador or the person who launched our #DataOps Initiative, but she's so much more. Get to know her: cd.foundation/blog/2025/03/24/

Continuous Delivery FoundationCDeliveryFdn@social.lfx.dev
2025-03-14

📣 Join our next #ContinuousDelivery Workshop on March 27
✨ The Fundamentals of #DataOps with speaker Lisa N. Cao
🎫 It's virtual and free: cd.foundation/blog/2025/03/13/

Continuous Delivery FoundationCDeliveryFdn@social.lfx.dev
2025-01-23

📣 Help us develop an inclusive set of #DataOps and #DevOps best practices to allow others leverage open source tools and frameworks for streamlined, secure, and scalable ML application deployment. cd.foundation/blog/2025/01/23/

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst