#PublicVsPrivate

Seth Goldsteinseth@sethgoldstein.me
2024-12-08

For The Love Of The Web. Posting Publicly Is Going To Get Used In Some Way

Sam Cole over at 404 Media wrote an article about a Hugging Face Machine Learning Librarian making a public data set of 1 million Bluesky posts available to everyone for Machine Learning.

People were of course outraged. Afterall it’s the Internet. People thrive on being outraged, pissed off, and otherwise salty.

What people seem to miss is that what they’re posting on Bluesky is public and scrapable.

The way this guy made the data set was a bit sloppy and , in my opinion, irresponsible. He didn’t anonymize the data and left personal identifiable information in the data set. He also didn’t get consent from people first.

Yea, I agree it feels a bit icky that this was done, mostly without consent or anonymizing the data. But for the love of the Web, what you put online publicly is — PUBLIC. People will see it and possibly use it for whatever they want. How hard is this to grasp?

This collection, according to Sam’s article, is also in a legal gray area right now and is going through the courts around the world.

To give some credit to the librarian, he down the data set after getting quite a bit of “feedback.” 😵‍💫😜

But that didn’t stop the trolls from making even bigger data sets and putting the out online.

I really do in fact understand why people are upset, but those posts are public. Don’t post stuff and expect it to be private when it’s PUBLIC!

Honestly, I’m fine with my content that I post publicly be used to train LLMs and AI, because it will improve the technology that I benefit from.

I agree with Rand Fishkin, the founder of Moz and Sparktoro.

He posted on Bluesky:

I know others are probably upset about this, but LLM training is, for me, a benefit of participating in spaces like this. I *want* my word usage, brands, and content to be part of how AI answers questions in the future. Just like I wanted Google to index my websites.

— Rand Fishkin (@randfish.bsky.social) December 8, 2024 at 4:06 PM

I don’t think that’s crazy desire. Right? Am I completely off-base? What do you think?

#AI #Bluesky #Data #Datasets #LLMs #MachineLearning #PublicVsPrivate

white cloud sky - Photo by Kumiko SHIMIZU on UnSplash.com
Mojo ♻️mojo@aus.social
2024-08-15

@peachfront

In Australia, we've unfortunately copied much of the American blueprint when it comes to education. Our private schools are now better funded by the federal government by a factor of three to one compared to public schools. This has created a widening gap in access to quality education, where private schools thrive while public schools struggle with underfunding. If all students, regardless of their background, attended the same schools, we'd see a fairer distribution of resources and opportunities, preventing the deepening divide that exists today.

#AustraliaEducation #PublicVsPrivate #EducationFunding #EqualityInEducation #CloseTheGap

2022-11-27

A fundamental problem with capitalist ideas of property and ownership is that the owner has the "right" (with some caveats) to do with it as they want. This includes destroying it, intentionally or through neglect (or ineptitude). This has few issues with small, personal items, where the impact of the destruction is personalised. However, what happens when the property is privately owned but used, or relied on, by others? 1/-

#Discussion #PublicVsPrivate #Ownership #Property #Capitalism

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst