#tabulardata

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-06-14

TabPFNv2, the supposed few-shot wunderwaffe for tabular data, got pulled apart by Yandex researchers — and what they found was underperformance, inferiority to baselines, and shaky calibration.
Sometimes hype needs a gradient descent check. 📉

#tabulardata #machinelearning

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-05-22

A new model ViaSHAP powered by Kolmogorov Arnold Networks outperforms XGBoost on tabular data.

Paper from Henrik Bostroem (author of Crepes conformal prediction package) KTH research group.

#tabulardata

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-05-20

Make sure your models are calibrated to avoid costly mistakes.

#calibration #xgboost #datascience #tabulardata

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-05-20

Make sure your models are calibrated to avoid costly mistakes.

#calibration #xgboost #datascience #tabulardata

N-gated Hacker Newsngate
2025-05-17

🚨 Breaking news: Java developers discover Pandas rip-off with a name that sounds like a cheap gym 💪. promises to make tabular data “fahm” easy, but let's be real—it's just another way to make Java developers wish they had chosen Python instead 🐍.
github.com/moustafa-nasr/fahma

Open Knowledge Foundationokfn@fosstodon.org
2025-05-06

⭐️ What you’ll learn ⭐️

✅ Detecting and fixing errors in tables
• Learn to work with #tabulardata
• Don’t get lost when validating your spreadsheet
• Clean up your spreadsheets to gain valuable insights

👉🏾 More: buff.ly/9XVJxq4

🧵

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-04-17

If you're still using XGBoost in 2025, you're basically sending faxes in the age of fiber.

Boost smarter. Ditch the fossil.

#tabulardata

Rafael Perezrperezrosario
2025-04-07

Design patterns for presenting and manipulating tabular data.

"Datatable Design Patterns"
bootcamp.uxdesign.cc/data-tabl

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-04-01

Once you apply SMOTE, the Grim Reaper visits your dataset.
He doesn’t take lives — just precision, recall, and your dignity.
Your model performance? Undead, but not in a good way.

#tabulardata

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-03-27

A new bold paper from winged hussars.

#tabulardata

Open Knowledge Foundationokfn@fosstodon.org
2025-03-26

⭐️ What you’ll learn ⭐️

✅ Detecting and fixing errors in tables
• Learn to work with #tabulardata
• Don’t get lost when validating your spreadsheet
• Clean up your spreadsheets to gain valuable insights

👉🏾 More: blog.okfn.org/2025/03/25/annou

🧵

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-03-25

Let's make sure our models are properly calibrated.

Data Scientist: Absolutely. It’s crucial for our success.

#calibration #xgboost #datascience #tabulardata

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-03-23

When it comes to #tabulardata #catboost rules supreme, in probabilistic forecasting competition most of top winning submissiones used CatBoost.

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-03-22

In fact, a recent paper once again confirms CatBoost's dominance with tabular data, while XGBoost came in at just … number 10.

“AComprehensive Benchmark of Machine and Deep Learning Across Diverse Tabular Datasets”

#tabulardata #catboost

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-03-19

Just because thousands review these papers doesn't mean they know what actually works. Real life practitioners and experts often don’t have neither the time nor the incentives to serve as unpaid reviewers.

#tabulardata

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst