#datamining

2026-02-09

In order to watch the Super Bowl, as well as get access to the Jurassic World movies to satisfy my kiddo's current obsession with all things dinosaur, I did a month's subscription to Peacock. In installing the Peacock app onto my Android phone running a "degoogled" crDroid rom, TrackerControl detected a hilarious level of 10 embedded known trackers. Without the protection of a blocker like TrackerControl, this supposed streaming service is also a data mining extravaganza, sending tons of info back to all kinds of 3rd parties.


#SuperBowl #privacy #android #apps #trackercontrol #crdroid #Peacock #datamining #streaming #movies #enshittification #degoogled
Screenshot of TrackerControl app running on an Android phone, showing blocked fingerprinting and other tracking requests being made by the Peacock appScreenshot of TrackerControl app running on an Android phone, showing a list of 10 embedded tracker libraries found in the Peacock appScreenshot of TrackerControl app running on an Android phone, showing blocked advertising and analytics requests being made by the Peacock app
Jesse Spielmanheavyimage
2026-02-02

I had the idea recently: could we leverage the power of some of the tools used by the very powerful against them?
1/?

2026-01-30

Thách thức khi khai thác dữ liệu từ Twitter/X

Việc lấy dữ liệu sạch và cấu trúc từ Twitter/X ngày càng khó khăn do API chính thức đắt đỏ và các biện pháp chặn bot. Một giải pháp API mới vừa được phát triển nhằm giải quyết vấn đề hạ tầng (proxy, trình duyệt) để thu thập:

- Tweet, timeline, kết quả tìm kiếm.
- Hồ sơ người dùng, chỉ số tương tác.
- Theo dõi xu hướng và luồng thảo luận.

Bạn thấy nền tảng nào khó lấy dữ liệu nhất hiện nay?

#TwitterX #DataMining #WebScraping #API #KhaiThacDuLieu #

N-gated Hacker Newsngate
2026-01-22

🚨 Breaking News: Silicon Valley's elite 👔 now moonlight as military brass 🎖️, because who better to defend the nation than the geniuses behind data mining and targeted ads? 😂 Meanwhile, X.com reminds us that browsing without is akin to driving without wheels—just plain silly! 🙄
twitter.com/SecArmy/status/193

N-gated Hacker Newsngate
2026-01-20

🚨BREAKING: Popular app launcher decides to spice things up by becoming a data miner's paradise! Now you can enjoy personalized ads from your favorite corporate overlords right on your home screen. 👏 Because who needs privacy when you can have targeted ads, right? 😂
lemdro.id/post/lemdro.id/35049

2026-01-15

Cornell University: Digital humanities scholars chart lost art of maps in novels. “Digital humanities scholars from the Cornell Ann S. Bowers of Computing and Information Science have developed a computational system to mine maps from nearly 100,000 digitized books from the 19th and early 20th centuries, discovering that just 1.7% of novels include maps, mostly at the beginning or end, among […]

https://rbfirehose.com/2026/01/15/cornell-university-digital-humanities-scholars-chart-lost-art-of-maps-in-novels/
2026-01-06

When the veil gets lifted.

161 partners! Really? And this is not a unique case.
If a website pretends to be about information with this many partners, to me it's motivation is sales and your business model is flawed.

If I see a cookie notice with this much partners and/or don't see a 'disagree' option right away, I'm gone. Websites like these are leaches. If this website ended today I couldn't care.

#data #datamining #privacy #cookies #transparancy

2026-01-05

This is a great talk about the importance of data privacy and the risks of big tech. While it's likely not news to anyone on here, share it with people you know for whom it is news. #bigtech #privacy #ted #datamining

TED Talks Daily: This is what a digital coup looks like | Carole Cadwalladr

Episode webpage: go.ted.com/carolecadwalladr25

Media file: sphinx.acast.com/p/open/s/6758

Karl Dezeti ₀ ∫¹³⁷ ♀☺⸎ⁱ☻♂ ∂ℹdezeti@norden.social
2025-12-26

Strittig ist beim #Urheberschutz nicht, ob #Datamining für #AI #Menschenrechte unterläuft, sondern ..
.. ob die wörtliche Widergabe von Texten durch #LLM s Rechte von Konzernen verletzen:
scinexx.de/news/technik/copyri
.. ob das Entfernen von #Wasserzeichen für #KI die Rechte von Bilddiensten einschränkt:
ai-rockstars.de/gemini-2-0-fla
.. ob ein #Promt schützenswert ist:
meedia.de/news/beitrag/19639-g
Die Verwurstung von Inhalten soll offenbar umso legaler sein, je schlechter sie sich zum Original zuordnen lässt.

Cartoon:
Ein humanoider Roboter sitzt an einem Tisch und blickt auf eins von 6 Bildern, auf denen eine Katze mit Hut und Sonnenbrille in verschiedenen Posen zu sehen ist. Das Bild wird von einem Mann, der mit einem anderen Mann spricht, der hinter ihm in der Tür des Raumes steht. Er sagt: "Datenschutz und künstliche INtelligenz sind kein Widerspruch. Wir verwenden bei unserem KI-Training stets anonymisierte Daten.
Bildquelle: https://www.cloud-science.de/
Hartmut Seichterretrakker
2025-12-23

Gibt es Statistiken darüber wieviele Ingenieursstunden in die Rückgewinnung von maschinenlesbaren Daten mittels Tools wie z.B. geflossen sind weil in Deutschland die Digitalisierung bei PDFs von Exceltabellen aufhört?

Regina Mühlich ✅, DatenschutzReginaMuehlich
2025-12-22

Ein Fotograf kann die Nutzung einer seiner Fotografien durch einen Verein bei der Erstellung eines Datensatzes, der für das Training Künstlicher Intelligenz () genutzt werden kann, zu dulden haben. Der Verein kann sich hinsichtlich der Nutzung der heruntergeladenen Fotografie auf die Schrankenregelungen für das sog. Text und aus § 44b berufen.
otto-schmidt.de/news/wirtschaf

Markus Kastelitzlegalmemory@legal.social
2025-12-20

Österreich: RTR, Präsentation Text und #DataMining - Welche Botschaften senden österreichische Webseiten an #KI-Anbieter?
Anmeldung Präsentation TDM
info.rtr.at/anmeldung-praesent

Dr. Verónica Espinozaverukita1
2025-11-26

✨Do you already know Scimago Graphica?

I consistently incorporate this tool into my workshops, and I can tell you it’s a powerful platform that allows you to create all kinds of visualizations. It’s very easy to use, freely accessible, and quite intuitive.

I invite you to read my article on Medium about this tool
🔗 medium.com/@vespinozag/meet-sc

👇Here I’m sharing a set of visualizations I created with this tool

2025-11-19

Why “public AI”, built on open source software, is the way forward for the EU

A quarter of a century ago, I wrote a book called “Rebel Code”. It was the first – and is still the only – detailed history of the origins and rise of free software and open source, based on interviews with the gifted and generous hackers who took part. Back then, it was clear that open source represented a powerful alternative to the traditional proprietary approach to software […]

#ai #aiAct #cdsm #china #cloudComputing #copyrightDirective #dataMining #eu #freeSoftware #openSource #openSourceInitiative #paulKeller #publicAi #rebelCode #research #startups #supercomputers #tdm #textMining #us #ventureCapital

walledculture.org/why-public-a

IndieAuthors.Social Newsindieauthornews@indieauthors.social
2025-11-14

News Podcast: Amazon Launches Kindle Translate, Australia Blocks Data Mining Exception

On this episode of the Self-Publishing with ALLi podcast, Dan Holloway reports on Amazon’s launch of Kindle Translate, an AI-powered tool that allows translations between English, Spanish, and German. He discusses how it could open new…
selfpublishingadvice.org/news-

#Podcast #AITranslation #Anthropicsettlement #copyright #DataMining

Nick EspinosaNickAEsp
2025-11-01
Nick EspinosaNickAEsp
2025-11-01

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst