#dataDon

2025-06-10

Only nine months late: my SQL adaptation of @drdrang’s Electoral College script, which has been sitting as a basically finished draft since the beginning of September. #datadon seanlunsford.com/2024/bringing

Elio Campitellieliocamp
2025-06-05

Hey, and folks. How would you estimate the regression of a normally distributed variable and a variable that is bounded between 0 and 1 (and has many 0s and 1s) (specifically: sea ice concentration). I realised that normal linear regression really doesn't make sense. Is beta regression the better choice? betareg is not fitting, (I get an initial value in 'vmmin' is not finite error).

Estelle Platiniestelle@techhub.social
2025-04-18

"The reason you begin tracking your data is that you have
some uncertainty about yourself that you believe the data
can illuminate. It’s about introspection, reflection, seeing
patterns, and arriving at realizations about who you are
and how you might change."
—Eric Boyd, self-tracker

an article by Natasha D. Schüll, 2019, "The Data-Based Self:
Self-Quantification and the Data-Driven (Good) Life" natashadowschull.org/wp-conten

#concentration #credit #scores #scoring #reward #rewards #psychology #socioPsych #socioPsychology #selfWorth #universalism #digitalization #recognition #data #AIRisks #AIEthics #gaming #socialization #Tracking #surveillance #selfRegulation #attention #reintermediation #intermediation #enshittification #risk #derisking #vulnerability #morality #selfConfidence #dataDon #Schüll #quotes

2025-04-04

@data @datadon 🧵

In its judgment, the General Court of the European Union strengthened the conciliation and regulatory powers of the European Data Protection Board.

"The DPC will have to continue its investigations into Meta unless it chooses to refer the matter to the European Court of Justice. Given the timeline of the original investigation, which began in 2018, it may still take several years before a final determination is made regarding the GDPR compliance of Meta’s business model, particularly in relation to behavioural advertising."

dpoblog.eu/cjeu-and-the-powers by Christina Etteldorf, Institut of European #Media #Law, Saarbrücken (Germany)

#advertising #GDPR #Meta #WhatsApp #instagram #CJEU #DPC #EDPB #Etteldorf #networkPolitics #targetedAdvertising #targetedAdvertisement #dataProtection #dataMining #dataDon #dataCapture #dataGovernance #retaliation

Elio Campitellieliocamp
2025-03-28

What's easier to interpret as a measure of linear relationship between two variables?

2025-02-11

Edit: Found one, see reply. Feel free to make any other suggestions though :)

Anyone know of a function dependency graph visualiser for Python that does the same thing(ish) as foodwebr for R? github.com/lewinfox/foodwebr

Everything I've found so far do module dependency graphs (e.g. github.com/gauge-sh/tach), which are cool, but I'd really like to get a graph of **function** dependencies, to see more finely "who calls who".

#rStats #Python #Datadon

devSJR :python: :rstats:devSJR@fosstodon.org
2025-02-09

@eliocamp

#python #Datadon

This link luzmo.com/blog/python-chart-li gives a nice overview.

In my case I am well served with seaborn and matplotlib in most cases when I use Python.

Regarding the tutorial I don't know the level you start from and I don't know if you prefer reading or watching but here are some recommendations.

matplotlib.org/stable/tutorial
w3schools.com/python/matplotli
dataquest.io/tutorial/data-vis
geeksforgeeks.org/graph-plotti

Elio Campitellieliocamp
2025-02-09

Hey, folks. What is the best current plotting library for static plots (publication to PDF.and HTML)? And what is the best tutorial to learn it?

2024-12-17

Enteric Fermentation in 2022

Livestock digestion emits too much methane:
* Too many bovines in India, Pakistan, Brazil, United States, China;
* Too many sheep and pigs in China.

(The bubble sizes depend on the amount of methane sent in 2022.)

#GreenhouseForcing #methane #emissions #climateChange #climateBreakdown #climateCollapse #dataViz #bubbleChart #dataMining #plotly #featureEngineering #featureSelection #dataDon

Enteric Fermentation in 2022

Livestock digestion emits too much methane:
* Too many bovines in India, Pakistan, Brazil, United States, China;
* Too many sheep and pigs in China.

(The bubble sizes depend on the amount of methane sent in 2022.)
2024-12-17

@data @datadon

#DataViz on two requirements:
* zooming, panning and rescaling
* shareable dashboards

"Plotly vs. Bokeh: Interactive Python Visualisation Pros and Cons", by Dr Paul Iacomi: pauliacomi.com/2020/06/07/plot

#dataDev #retrieval #dataMining #plotly #Dash #Bokeh #python #dataInteraction #data #dataDon #widgets #ipython #jupyter #dashboards #businessIntelligence

rmoff 🏃🏻 🍺 🥓rmoff@data-folks.masto.host
2024-11-22

It's trite to start a blog post share with "A common question I hear is…", but in this case it's *true*! One of the most common things I see people asking how to do is get data from #Postgres to #Snowflake, so I wrote a blog post about how to do with @decodable. You can use a web UI or YAML—your choice.

✍️ dcbl.link/postgres-to-snowflak

#dataEngineering #ELT #ETL #datadon

A whimsical illustration split into two panels. The left panel features a cute, blue elephant with closed eyes, standing on a snowy ground. The elephant is blowing snowflakes and sparkles from its trunk into the right panel. The background is dark with hanging stars and geometric shapes. The right panel shows a large, intricate snowflake surrounded by swirling snow and sparkles against a blue background, with abstract tree shapes below.

community, brush up on ML, work on some personal projects, finally learn some front-end, and figure out what's next. My plan is blog as I go and meet some awesome people along the way. Excited to be here! #introduction #datadon (2/2)

2024-10-26

If you’re not there yet, check it out #datadon

2024-10-06

@data @datadon
"Obtaining the data is a hard human problem. That is, 'people don’t want to give it to you'."

You can "use your allies on the bottom (front-line workers) and the top (executives) to squeeze out your opponents in the middle (managers, often in IT or data science departments").

Sarah Constantin: sarahconstantin.substack.com/p

#industry #power #factory #processes #dataPrivacy #confidentiality #dataDon #dataGovernance #security #dataAccess #analytics #powerTalk #data #dataAnalysis #losers

Elio Campitellieliocamp
2024-09-10

Stats question... x and y are random variables with equal measurement error. I can compute the leading PC as orthogonal regression. The variance explained by this PC is 0.91. However, the R^2 between the two variables is 0.64. What's the interpretation of these two numbers in this context?

Scaterrplot of x and y showing a good linear positive relationship and a black line of best fit.
2024-08-27

Data returned by an observation typically is represented as a vector in machine learning.

A neural network can be seen as a large collection of linear models. We may represent the inputs and outputs of each layer as vectors, matrices, and tensors (which are like higher dimensional matrices).

#algebra #linearAlgebra #vectors #matrices #determinants #singularity #ML #DataScience #math #maths #mathematics #mathStodon #ML #data #dataDon #dataScience #machineLearning #DeepLearning #neuralNetworks

A neural network can be seen as a large collection of linear models.

Creative Commons License, by Luis Serrano for DeepLearning.AI
2024-05-17
2024-05-14

This is is 🔥🔥🔥 and the tooling I wish I had when writing custom REST extractors

Automated pagination is the kind of feature that will make many DEs very happy

dlthub.com/docs/blog/rest-api-
#datadon

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst