#tabularData

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-06-23
Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-06-20

Save time and avoid unnecessary preprocessing.
✅ Final takeaway:
Scaling isn't a universal rule — it's a tool.
Understand when and why it matters, and you’ll build faster, more efficient, and more robust ML systems.

#tabulardata #machinelearning

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-06-19

In fact, a recent paper once again confirms CatBoost's dominance with tabular data, while XGBoost came in at just … number 10.

“AComprehensive Benchmark of Machine and Deep Learning Across Diverse Tabular Datasets”

#tabulardata #catboost

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-06-19

in critical applications like human health, finance and self driving cars? no reasonable person will.

arxiv.org/pdf/2502.08978

#tabulardata

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-06-14

TabPFNv2, the supposed few-shot wunderwaffe for tabular data, got pulled apart by Yandex researchers — and what they found was underperformance, inferiority to baselines, and shaky calibration.
Sometimes hype needs a gradient descent check. 📉

#tabulardata #machinelearning

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-05-22

A new model ViaSHAP powered by Kolmogorov Arnold Networks outperforms XGBoost on tabular data.

Paper from Henrik Bostroem (author of Crepes conformal prediction package) KTH research group.

#tabulardata

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-05-20

Make sure your models are calibrated to avoid costly mistakes.

#calibration #xgboost #datascience #tabulardata

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-05-20

Make sure your models are calibrated to avoid costly mistakes.

#calibration #xgboost #datascience #tabulardata

N-gated Hacker Newsngate
2025-05-17

🚨 Breaking news: Java developers discover Pandas rip-off with a name that sounds like a cheap gym 💪. promises to make tabular data “fahm” easy, but let's be real—it's just another way to make Java developers wish they had chosen Python instead 🐍.
github.com/moustafa-nasr/fahma

Open Knowledge Foundationokfn@fosstodon.org
2025-05-06

⭐️ What you’ll learn ⭐️

✅ Detecting and fixing errors in tables
• Learn to work with #tabulardata
• Don’t get lost when validating your spreadsheet
• Clean up your spreadsheets to gain valuable insights

👉🏾 More: buff.ly/9XVJxq4

🧵

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-04-17

If you're still using XGBoost in 2025, you're basically sending faxes in the age of fiber.

Boost smarter. Ditch the fossil.

#tabulardata

Rafael Perezrperezrosario
2025-04-07

Design patterns for presenting and manipulating tabular data.

"Datatable Design Patterns"
bootcamp.uxdesign.cc/data-tabl

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-04-01

Once you apply SMOTE, the Grim Reaper visits your dataset.
He doesn’t take lives — just precision, recall, and your dignity.
Your model performance? Undead, but not in a good way.

#tabulardata

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-03-27

A new bold paper from winged hussars.

#tabulardata

Open Knowledge Foundationokfn@fosstodon.org
2025-03-26

⭐️ What you’ll learn ⭐️

✅ Detecting and fixing errors in tables
• Learn to work with #tabulardata
• Don’t get lost when validating your spreadsheet
• Clean up your spreadsheets to gain valuable insights

👉🏾 More: blog.okfn.org/2025/03/25/annou

🧵

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-03-25

Let's make sure our models are properly calibrated.

Data Scientist: Absolutely. It’s crucial for our success.

#calibration #xgboost #datascience #tabulardata

Valeriy M., PhD, MBA, CQFpredict_addict@sigmoid.social
2025-03-23

When it comes to #tabulardata #catboost rules supreme, in probabilistic forecasting competition most of top winning submissiones used CatBoost.

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst