#AnalyticsEngineering

Recce - Trust, Verify, ShipDataRecce
2025-06-11

"The value didn't justify the effort."
A Sr. Data Engineer at Swedish MediaTech after their Datafold PoC.

❌ Heavy setup → Noisy results → Alert fatigue → No way to start small
Comparison: datarecce.io/blog/recce-vs-dat

setup noise
Recce - Trust, Verify, ShipDataRecce
2025-06-10

Don’t start with what changed. Start with what SHOULD change!

Because not every diff is a problem, and not every problem shows up as a diff.

👉 datarecce.io/blog/more-than-da

Data Reviews What Changed
Recce - Trust, Verify, ShipDataRecce
2025-05-22

Want to better understand how your data models work, and what might break when they change?

Here are 5 types of column transformations in models:
1. Pass-through
2. Renamed
3. Derived
4. Source
5. Unknown

Each one helps you assess impact and trace data through your pipeline more clearly

We use these types in Recce to power column-level lineage and breaking change analysis

Read the deep dive
datarecce.io/blog/column-level

Recce - Trust, Verify, ShipDataRecce
2025-05-15

Stop duplicating dashboards to preview the impact of dbt model changes!

With Recce, you can:
✅ Diff models
✅ Record data impact
✅ Share results instantly — no dashboards, no SQL, no screenshots

One click. Instant clarity:

medium.com/inthepipeline/stop-

datarootsdataroots
2025-03-25
dbt Copilot
Recce - Trust, Verify, ShipDataRecce
2025-03-14

Column-level lineage is now available in Recce 0.57

Add it to your dbt data validation workflow:

1. Lineage Diff - Focus on impacted models
2. Breaking Change Analysis - Eliminate irrelevant changes
3. Column-Level Lineage - Track column evolution

Column-level lineage in datarecce.io
Recce - Trust, Verify, ShipDataRecce
2025-03-03

❌ No more manually cross-referencing dbt docs from dev and prod

❌ No more manually checking schemas in your data warehouse

❌ No more manually comparing row-counts on models.

See a trend?

Read how an experienced data professional validates zero regression on a PR:

linkedin.com/posts/abdelm_recc

2025-02-28

🚀 Started Module 4 of Zoomcamp!
Just kicked off the Analytics Engineering module and I'm diving into transforming the Green Taxi, Yellow Taxi, and FHV NY Taxi datasets loaded in . Excited to see how dbt can help create analytical views for better decision-making!

Recce - Trust, Verify, ShipDataRecce
2025-02-12

It's not enough to rely solely on running dbt tests in CI.

dbt tests can't cover every situation because, by nature, they can only be written for issues you can foresee.

The only way to check for data impact is to compare current data with historical data

Read Tim Hiebenthal's post "The current gaps in (your) dbt-tests":

handsondata.substack.com/p/the

While dbt’s testing framework is robust and you can even advance into things like anomaly detection, there’s a critical gap in a lot of setups:
Benchmarking development environments against production. Even with a CI pipeline running a suite of tests, it’s challenging to detect subtle issues like revenue change caused by changes in filters or calculations, because you need to have built a test for it beforehand. For example, your tests might pass, but your revenue calculations could still be off by 5% due to an unnoticed or uncovered change in logic. So in other words:
You want to know if the 1.25 million in sales last month, which your stakeholders are working with, suddenly changed.

Building unit tests for every table is an unrealistic ask, since you need to mock all input- and output records you want to check.
Recce - Trust, Verify, ShipDataRecce
2025-01-15

You don’t want to backtrack through time looking for the point that bad data was merged into your dbt project!

Use Recce for QA *before* merging to prod and get full insight into data impact.

Get open-source Recce now:
github.com/datarecce/recce

Doc shows the point at which Biff merged bad data into the dbt project
Recce - Trust, Verify, ShipDataRecce
2025-01-10

Recce Sandbox is now live!

Think dbt Analyses but with query diff for iterating on a model without materializing or modifying your project

$ pip install -U recce
$ recce server

Start playing with data model code and previewing data impact in the Sandbox, risk-free

medium.com/inthepipeline/previ

Recce - Trust, Verify, ShipDataRecce
2024-12-30

✳️ dbt Best Practices in Action ✳️

Optimizing your dbt workflows is essential for scalable, reliable data pipelines.

See how California Integrated Travel Project (Cal-ITP) used dbt best practices to enhance their data infrastructure:

medium.com/inthepipeline/dbt-b

What’s the biggest challenge your team faces when implementing dbt?

Recce - Trust, Verify, ShipDataRecce
2024-12-20

Live preview dbt data model changes without needing to rebuild the model!

Do it right now in Recce, here’s how

loom.com/share/3f59853e11b6401

1. Edit model code
2. Diff with the original code
3. Instantly compare the data (no need to drop into dbt and rebuild)

Recce - Trust, Verify, ShipDataRecce
2024-12-19

Data impact analysis on multiple dbt nodes at the same time

Row count, schema, and col match percentage

- Select nodes w/ selection syntax
- Manually select nodes
- Save and re-run checks
- Automate checks in CI

Get Recce now:
github.com/datarecce/recce

Recce - Trust, Verify, ShipDataRecce
2024-11-27

Analytics engineers don't work alone - data validation involves stakeholders and PR reviewers

With Recce, Share your complete data validation environment with your team and review data impact together.

- Validate your work
- Create your checklist
- Share it with your team

medium.com/inthepipeline/enhan

Recce - Trust, Verify, ShipDataRecce
2024-10-18

If you’re new to Data Recce here’s a tip:

Copy a screenshot of any diff to the clipboard with one click

Perfect for pasting into your GitHub PR comment!

Give it a try on your next PR when you want to:

- show the reviewer your work is correct
- share results for discussion
giphy.com/gifs/github-clipboar

2024-10-09

Coalesce 2024 keynote announcements included among others:
* support for Apache Iceberg
* a DBT Copilot
* integrations with Power BI
* improved data quality and data freshness tools
* More visual editing support
* in general the vibe of making it easier to reference data across teams

#Coalesce #DBT #AnalyticsEngineering #Coalesce2024 #Data

coalesce.getdbt.com/

2024-09-26

Just been doing a LinkedIn Learning course on dbt - a tool used in Analytics Engineering. Delighted that they were using DuckDB as the backend.
This kind of use is so huge for learners. Instead of faffing around to get credentials/access to a remote DB server just run locally for free and quickly get on with the learning of the app.

#DuckDB #DBT #AnalyticsEngineering #DB
linkedin.com/learning/data-eng

Recce - Trust, Verify, ShipDataRecce
2024-08-29

What comes after the chaos of self-service analytics?

Embrace the chance to elevate the importance of data and your team

medium.com/inthepipeline/what-


Deirdre O'Learydeoleary@masto.ai
2024-08-26

I'm seeking a new challenge in a Data Engineer or Analytics Engineer role.

Yes, I know that I might not have the exact skillset you're looking for & that my background might not be what you expect. However, I can & will learn anything that I'm missing, which is something I'd love to do in a established Data Team.

1/3

#DataEngineering #AnalyticsEngineering #DataEngineer #AnalyticsEngineer #SQL #Data #Database #DataWarehouse #JobSeeking #JobHunt #AvailableForWork #GetFediHired

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst