#SciOp

jonny (good kind)jonny@neuromatch.social
2025-11-04

Here's another #FEP for representing torrents on activitypub :)

short, sweet, and with a reference implementation and tests!

towards a federated bittorrent tracker with #sciop !

PR: codeberg.org/fediverse/fep/pul

Discussion: socialhub.activitypub.rocks/t/ (or this thread)

#FEP_d8c8 #BitTorrentOverActivityPub #FederatedP2P #BitTorrent

ℒӱḏɩę 💾☮∞🎶♲☀🔋Lydie@tech.lgbt
2025-10-28

This is extremely sad y'all. ZERO seeding going on from the SciOp / public data / antifa torrent server. Have a few spare gigs? Even that's something, please help preserve data the fascists are destroying!

#resist #fascism #datahoarder #digitalpreservation #archive #sciop #torrent #publicdata

All torrents in the list are 100% complete and are in a "Seeding" state, meaning the user is sharing the files with others. The "Down Speed" and "Up Speed" columns all read "0 B/s", indicating no active transfers at the moment the screenshot was taken.

The torrents listed are very large datasets, many appearing to be scientific or academic in nature. Here are the first few items in the list:

    Name: ExtremeWeather_a large-scale climate dataset

        Size: 1.503 TiB

        Status: Seeding

        Uploaded: 1.229 TiB

    Name: SDF

        Size: 1.107 TiB

        Status: [F] Seeding

        Uploaded: 9.91 GiB

    Name: ASN

        Size: 1.107 TiB

        Status: [F] Seeding

        Uploaded: 49.43 GiB

    Name: trump protest jan 06 2021

        Size: 1.013 TiB

        Status: [F] Seeding

        Uploaded: 18.78 GiB

    Name: land-normalized-difference-vegetation-index

        Size: 1,001.50...

        Status: Seeding

        Uploaded: 1.03 GiB

Other notable names in the list include biorxiv_2023, biorxiv_2021, biodiversity-library-org, National-Archives, and hydrological-properties, all with sizes in the hundreds of Gigibytes (GiB) or over a Tebibyte (TiB).
Boorboor
2025-09-28

Finally had some spare time to learn more about the project recently. I created a seed box to preserve several datasets at risk of censorship, including over 1.2TB and 94,000 images. I then configured to serve the photos and podcast for my own enjoyment. I even made a little preroll video for my server just for fun. :catjam:

sciop.net/

Two side-by-side browser windows showing the Sciop Smithsonian datasets on the right and torrents of the same datasets being seeded on the left.A single frame image used as the basis for a Plex preroll, including the title “Smithsonian uncensored” with their logo and a QR linking to the Sciop project.
2025-09-08

It would be nice if #sciop linked to the actual takedown request for datasets tagged with "takedown_issued"

2025-09-08

Would #FreeBSD be better for seeding massive amounts of data over BitTorrent than #Linux because it's better at disk I/O?
#sciop #AnnasArchive #archive #AskFedi

@nyxmir If the CDC used to have it, it might still be on SciOp.

If you don't know SciOp, it's a community where people seed (through Torrent) data threatened or deleted by Trump and other fascists. The datasets are hefty, but the point is to share and keep them available for everyone:

sciop.net/datasets/?query=cdc&

Good luck!

#SciOp #Science #CDC #P2P

jonny (good kind)jonny@neuromatch.social
2025-09-04

the haters all scoffed when i "embedded a whole set of bitmap fonts for a single use" but i knew i was following the light and the way. #sciop is in its "intelligibility" era, where we will do things like "tell people what is going on" and "comment on stuff" and "chat about moderation decisions" and whatnot.

A "whats new" box embedded on the sciop.net homepage, it's styled as a windows 98 alert box, blue gradient header bar, pixelated bitmap fonts, and beveled edges and all. it contains two posts:


The Modern Webseed
25-08-29 - Jonny - sciop-blog

We have added the ability to add and validate webseeds to torrents from the website, so even if you can't run a torrent client, you can join the swarm and keep endangered information alive.
Welcome to SciOp The Blog!
25-08-26 - Jonny - sciop-blog

This is the start of the sciop blog!


and the alert messages at the bottom read
Swarm Churn: Optimal
NAT Holepunching: Occasional
jonny (good kind)jonny@neuromatch.social
2025-09-01

#Sciop hit a Petabyte (actually a Pebibyte but nobody knows that word) of total proven capacity a week or two ago. That's all the seeders * the size of the things they are seeding. All volunteers, zero dollars in funding, piggybacking off existing resources wherever we can, run on a donated VPS. This is before we even get into federating archives and are still nailing down the basics of the site.

Peer to peer archives are real and they work, period. 216TiB of threatened cultural, climate, queer, and historical information held in common. That's a people powered archive, and you're welcome in it - to take from, to add to, and help sustain if you can.

Edit: if this is the first you're hearing of sciop, it's at sciop.net

Indexing 233 datasets
with 927 uploads.

13416 peers, 8590 seeders sharing
216.8 TiB in 8514732 files.

Swarm capacity 1.0 PiB
jonny (good kind)jonny@neuromatch.social
2025-08-30

First post on the #sciop blog - on adding webseeds from the web interface and why this is cool for bridging archives and bringing bigger systems into the swarm. If you can't run a bittorrent client but still want to seed, this post is for you!

with art from @aud and @lina

blog.sciop.net/2025-08-29/webs

jonny (good kind)jonny@neuromatch.social
2025-08-27

Im making a blog for #sciop because we keep doing dope shit and not writing it down. Putting a call out for guest artists who want to contribute fake GeoCities era banner ads

Data Rescue Project #DataRescuedatarescueproject.org@bsky.brid.gy
2025-08-22

Worried about #Smithsonian data and collections? We are too. Our friends over at #SafeguardingResearchAndCulture have been hard at work helping with #DataRescue and adding Smithsonian information to #SciOp. Check out their available datasets and please spread the word: sciop.net/datasets/

Datasets - SciOp

jonny (good kind)jonny@neuromatch.social
2025-08-20

if you are seeding anything on #sciop (or anywhere else too) using qbittorrent (and probably other clients too), you should increase your max torrent size to something like 2GB - that's what's causing the recurring problem that many people have flagged to us where their torrents seem to disappear from their client after restarting:
github.com/arvidn/libtorrent/i

tools > options > advanced, set both torrent file size limit and bdecode token limit very high

v2 torrents are very very good for archives, but they are more rarely used in piracy, so there is comparatively less optimization pressure for them. so this explains why our seed stats are so spiky, because we encourage hybrid/v2, and by default any v2 torrent larger than a few dozen GB will just go poof on restart.

edit: this was actually fixed in qbt 5.1.2, so you can also just update

jonny (good kind)jonny@neuromatch.social
2025-08-19

Last week trump announced plans to "review" 8 Smithsonian museums. Today he doubled down, very explicit about the intent to revise history to reflect the ethno-nationalist fantasy of US history.

You can do something about that! We are backing up the digital archives of those museums on sciop: sciop.net/tags/smithsonian

You can take direct action to preserve the historical artifacts the right wants to destroy:

1) you can download a copy and seed it, every seeder counts. Subscribe to the smithsonian RSS feed to auto-download torrents as they are scraped.

2) we have also written a crawler connected to sciop that distributes the scraping work, and automatically creates and uploads a validated torrent that piggybacks off the s3 bucket as a webseed source while it lasts (instructions in reply).

The data from the 8 threatened museums is on the order of ~10 TB, and we have split it up by jpg/tif so people without much spare storage can join in on the jpg's at least. The full contents of the public smithsonian bucket is ~700TB, so if we want to have a full independent copy we'll need lots more seeders.

All this code is being written flat out, on the run, as it's needed by volunteers with exactly zero resources, so it's not polished or well documented, and if you're interested in helping damp the flames of the book burning by contributing to any of the code or docs, we'd love to have you.

#Smithsonian #Sciop

Henrik Schönemannlavaeolus@fedihum.org
2025-08-09

The slides of my talk at #WHY2025 "Safeguarding Research & Culture: Save public data from the digital bookburnings!" are now online:
hu.berlin/SRC-WHY2025

Recording here (27min):
media.ccc.de/v/why2025-238-saf
(Wow, awesome work by @c3voc 💜)

More context: program.why2025.org/why2025/ta

#SafeguardingResearch #SciOp @SafeguardingResearch

Safeguarding Research/CultureSafeguardingResearch@fedihum.org
2025-08-08

Some of you may have seen the news re National #Climate Assessment Reports
cnn.com/2025/08/07/climate/wri

A friendly reminder:
They are all accessible here in this archive from November globalchange.govarchive.us/

As well as on #SciOp (from April)
sciop.net/datasets/globalchang

Teun 🌏 ❤️ 🏳️‍🌈 🇺🇦 🇵🇸teun@kolektiva.social
2025-07-29

Digital archival projects are crucial in the fight against fascism. I wrote about the why and the how.

And if you're reading this, that means you have a computer, so you too can contribute!

carefullmusings.bearblog.dev/t

#ArchiveTeam #SciOp #fascism #archive #resistance #DigitalPreservation

2025-07-29

On SciOp, looking for data to protect from digital book burning, I find that these USA gov Coronavirus sites now point to a single site, promoting the lab leak conspiracy theory, attacking Faulci, China, science, common sense and being one of the most cringeworthy pieces of disinformation I've seen on the net.

I'll link to SciOp, not to the bullshit site:
sciop.net/datasets/coronavirus

#politics #science #SciOp #trump #coronavirus #COVID19 #torrent #p2p #fascism #usa

ℒӱḏɩę 💾☮∞🎶♲☀🔋Lydie@tech.lgbt
2025-07-27

I just downloaded every single #torrent from #SciOp and am ensuring that all of them are now in the #antifa torrent server.

btw does anyone want the complete 1.2 gigs of torrents from SciOp in one ZIP package?

#resist #fascism #datahoarder #digitalpreservation #archive

jonny (good kind)jonny@neuromatch.social
2025-07-23

man i just had a series of extremely good ideas* that are very simple and very implementable for #sciop that i think will cause an absolutely disgusting amount of (good, intrinsically deduplicating, actually decrease server load by creating a supporting swarm of peers) public data scraping to happen and basically lower the barrier to scouting endangered datasets to zero

*if you received the message flood of me having them you are not allowed to tell people if they are actually bad

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst