#dataLad

Matthias Rißematrss
2026-03-07

@dbat the same issue exists in the research data management world with / . One thing that I am doing for our storage servers is regularly run on it. It requires filesystem support (xfs/btrfs), but deduplicates on an extent basis, so below the file level. If the difference between two versions only affects a small part of a file it should be able to help. I wonder if it could be run as a post-commit hook, or something like that.

Yann Büchau :nixos:nobodyinperson@fosstodon.org
2026-02-27

And here is my published dissertation @umphy, about quantifying the natural CO2 exhaust at the Starzach site in Southwest Germany (my result: ~10t/d):

hdl.handle.net/10900/176213

I used a lot of #FOSS software and hardware for all of it and it was amazing. Honorable mentions: #gitAnnex, #dataLad, #KiCAD, #OpenSCAD, #PlatformIO, #Arduino, #TexLaTeX. I just wish I'd used #nix / #nixOS sooner.

licensed #OpenAccess under #CreativeCommons CC-BY-4.0

#PhDLife

Figure 2: Examples of CO2 degassing at the Starzach site. Figure reproduced from Büchau et al. (2022, Appendix A,
page 62, kindly provided by the publisher under a CC-BY-4.0 license). (a) diffuse degassing, small ascending gas
bubbles (during spring 2020 flooding), (b) mofette with largest diameter, examined in 2015 by Lübben and Leven
(2022), (c) picture by Martin Schon in 2019, groundwater monitor well, turned into the site’s most active mofette shortly after its deployment in 2014.Table 2: Tabular comparison of four low-cost NDIR CO2 sensors evaluated for application at the Starzach site,
reproduced from Büchau et al. (2022, Appendix A, page 65, kindly provided by the publisher under a CC-BY-4.0
license)Figure 5: Gas flow funnel system mounted over the groundwater monitoring well (Figure 1, installed in 2014, which
turned into a mofette shortly after deployment) at the Starzach site in 2022. Figure reproduced from (Büchau et al.,
2024a, Appendix B, page 80, licensed under CC-BY-4.0).Figure 9: Flux-gradient setup close to the ground, next to the Starzach site’s mofette with the largest diameter (30 cm,
Figure 2b, examined in 2015 by Lübben and Leven, 2022). Four Sensirion SCD30 low-cost CO2 sensors each
are mounted 40 cm above and below a Campbell Scientific IRGASON eddy covariance station at 60 cm height.
Measurements of this setup are shown in Figure 10 and Figure 11.
Yann Büchau :nixos:nobodyinperson@fosstodon.org
2026-01-23

#forgejoAneksajo #gitAnnex #dataLad crowd:

Anyone else running into this experience-crippling #forgejo bug causing the activity page (de facto landing page for every user) to take extremely long to load (for me 10 seconds)?

codeberg.org/forgejo/forgejo/i

@datalad @forgejo

Yann Büchau :nixos:nobodyinperson@fosstodon.org
2026-01-13

Some more `git subtree push` quirks:

• `git subtree push` (obviously) does not push :gitannex: #gitAnnex files to the remote. Syncing annexed files there is unergonomic.
• `git subtree push` also strips commit signatures (e.g. GPG and as such #OpenTimeStamps timestamps). The truth lies in the monorepo only. Understandable, but very uncool.

git submodules have neither problem, but without tools like :datalad: #datalad you can't commit at once.

#git #gitSubTree

Matthias Rißematrss
2025-11-12

I've overheard someone at saying that the hash value in the distribits logo is from the first commit to and I've just checked and can confirm that this is true (except for the year in the middle, 360f isn't exactly a valid one). Relatedly, I've learned that DataLad apparently was called DataGit in the beginning. I don't actually know what to do with this information.

2025-11-12

Do you have big data to share in a forge? Try #forgejoaneksajo! It's an active soft fork of @forgejo, adding git-annex support.
@matrss gave a nice talk about it on @distribits, which you can watch there. Thank you!

distribits.live/talks/2025/ris

PS: I wish #GinGNode (gin.g-node.org) will update to this at some point in the future! It goes one step further, adding DOIs to datasets/code.

#distribits2025 #forgejo #distribits #gitannex #DataLad

WiNoDa Knowledge LabWiNoDa@nfdi.social
2025-09-26

A new article is just released in the

👩‍💻 𝒲𝒾𝒩𝑜𝒟𝒶 ℒ𝒶𝒷 𝒥𝑜𝓊𝓇𝓃𝒶𝓁 🗞️

Read about:
🔸 The tool DataLad
🔸 An event review to enhance DataLad skills

As always in englisch & german:
winoda.de/2025/09/25/drei-tage

credits to @fabr

You can still read our latest articles, if you missed them:
🔸 LOM (Learning Objetive Matrix)
🔸 How acronyms influence our work (CARE, FAIR)

credits to @AvSchroeder

Upcoming on 29.09:
🔸 Certification of repositories

#Blog #article #winoda #LOM #Lernzielmatrix #DataLad #repositories

Asta v. SchröderAvSchroeder@fedihum.org
2025-09-25

Three days of DataLad-Workshop & Hackathon in Aachen winoda.de/en/2025/09/25/three- In everyday research, a lot of mostly heterogeneous data is generated, which is often processed and analyzed collaboratively. This involves complex workflows and ML pipelines consisting of numerous transformation and […] #WiNoDaKnowledgeLabJournalen #Data #DataCompetenceCenter #DataLad #Git #Report

2025-09-19

Hello Leipzig, Germany! We're looking forward to the #DataLad workshop on September 29th/30th at @ufz: events.hifis.net/event/2531

:datalad: :forgejo: :git: :gitannex:

Yann Büchau :nixos:nobodyinperson@fosstodon.org
2025-09-18

If you are attending this year's @distribits end of October¹ and you want to be dressed 'appropriately' 😉, last year I organised that hellotux.com offers :gitannex: #gitAnnex² and :datalad: #dataLad³ shirts, sweaters. Now also backpacks 🎒 and towels can be ordered.

I ordered several items and the quality is good. Properly stitched logos that never wash off. They are a small family business that uses and supports #FOSS.

²hellotux.com/git-annex
³hellotux.com/datalad
¹distribits.live/

Yann Büchau :nixos:nobodyinperson@fosstodon.org
2025-09-16

@olm_e @FreeCAD You could use :gitannex: #gitAnnex and/or :datalad: #dataLad to version control your designs. I have not tried it with #FreeCAD, but I do it all the time with #solveSpace and #OpenSCAD. Git Annex allows version-controlling arbitrarily-large files and provides a simple 'git annex assist` command to 'do the git stuff' and sync everyone up. Don't know how assemblies work in FreeCAD, but if it is one file that sources other, independent files, then it should work.

Yann Büchau :nixos:nobodyinperson@fosstodon.org
2025-08-28

@diegolakatos If you like :gitannex: #gitAnnex, there's a fork of #forgejo: #forgejoAneksajo:

codeberg.org/forgejo-aneksajo/

With this, you can also store arbitrarily big files on your forgejo, use it as a file syncing service like #syncthing, organise your research data or media database with #datalad, etc.

Michał Szczepanikdoktorpanik@masto.ai
2025-08-19

Today, with @drballistic we had the pleasure of presenting a tutorial, Managing Scientific Data and Workflows with @datalad at EuroSciPy.

Our materials are at olebialas.github.io/DataLad-Eu

Thanks to the organizers, the recording is already available online: youtube.com/watch?v=vvYwPl9pau

A lot of cool people and projects around, looking forward to hearing the talks over the next days!

#euroscipy #euroscipy2025 #datalad #gitAnnex

Me with a microphone and a pointer, wearing a DataLad polo shirt and a conference badge, in front of a beamer screen. The slide on a screen is titled "A community project"; slide text says: 10+ years of ongoing development and maintainance, 100 + contributors across core, extensions, and Handbook.
Michał Szczepanikdoktorpanik@masto.ai
2025-08-17

I'm travelling to EuroSciPy 2025 in Kraków by train, with DB / PKP. I think it's so cool that you can do about a thousand kilometres with just one transfer (I'm counting hopping between stations in Berlin as one).

I'll be co-presenting a tutorial on @datalad and I'm looking forward to the whole conference!

#euroscipy #euroscipy2025 #datalad

Yann Büchau :nixos:nobodyinperson@fosstodon.org
2025-07-11

@umphy :gitannex: #gitAnnex and :datalad: #dataLad organize the data and sync it to our own :forgejo: #forgejoAneksajo instance. You can instantly see if everything is there and worked. A great workflow!

#forgejo

Asta v. SchröderAvSchroeder@fedihum.org
2025-07-03

Nach drei Tagen Workshop zu #DataLad in Aachen - danke nochmals an @lukascbossert und Team des @dkz2r für die tolle Organisation - heute ein Training zu #Moodle... Leider muss ich mich jetzt wieder selbst ums Catering kümmern... Zum Glück hat es etwas abgekühlt (zumindest hier bei mir sind es sommerlich "normale 19°C bei Wolken. Herrlich!)

2025-07-02

@AvSchroeder @lukascbossert @mih @dkz2r @adswa @doktorpanik @jsheunis @abcdj @nfdi4objects @nfdi4ing @WiNoDa @NFDI

Sounds like it was a fantastic workshop – thanks for sharing your impressions! Kudos to the #DataLad team! 👏

Asta v. SchröderAvSchroeder@fedihum.org
2025-07-02

@lukascbossert @mih @dkz2r @adswa @doktorpanik @jsheunis @abcdj @nfdi4objects @nfdi4ing @WiNoDa @NFDI it was a great workshop and if our heads smoked it was only due to the heat - the #DataLad Team did an excellent job of explaining, repairing botched attempts, and even bugfixing while remaining calm, patient and friendly the whole time -- I enjoyed it a lot.
Lots to process now...

Lukas C. Bossertlukascbossert
2025-07-02

@mih @dkz2r @adswa @doktorpanik @jsheunis @abcdj @nfdi4objects @nfdi4ing @WiNoDa Last day of our 3-days workshop at the IT Center of the Aachen. Today everyone is diving deeper in the realm of and applying it to the individual usecases. BIG THANK YOU to the whole team for making this possible and supporting us. Voting for as @NFDI - Base service: ❤️ .

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst