#metadata

Helmholtz Metadata Collab.helmholtz_hmc@helmholtz.social
2025-10-10

The HMC team is heading to International Data Week 2025 in Brisbane (13–16 Oct)! 🌏✨

With contributions from colleagues, we’re looking forward to connecting with the global data community, exchanging ideas, and discussing how #FAIRdata, #metadata and #OpenScience can advance science and society.

#IDW #IDW2025 #FAIRdata #OpenScience

@codata @resdatall
@RDA_Association
@ardc_au
@WDS_IPO
@helmholtz

The HMC team is heading to International Data Week 2025 in Brisbane (13–16 Oct)!
2025-10-09

Att delta på två konferenser är för mycket - men det går när en avskild plats bjuder på lugnare miljö för att hålla föreläsning!😅

Tack till SFIS som bjöd in mig till att tala på SFIS Mellansverige Teknikdag 2025.

Jag pratade om öppna standarder & interoperabilitet med #metadata & #informationshantering. Detta för kunskap, förvaltning & informationsutbyte - inte minst kundförtroende och skattemedel! Ramade in med berättelser om @okfse, #NationellDataverkstad och @entryscape av @metasolutions.

Mattias Axell taking a selfie at a balcony at the OGP Summit conference in Spain while holding a laptop and having his backpack behind him.
2025-10-09

Boosting #openresearch: DataCite metadata is now integrated into OpenAlex. Over 92 million DOIs, including datasets, preprints, and many more research outputs & resources, are available in OpenAlex, enhancing research connectivity & discovery. Read the announcement: doi.org/10.5438/we1x-2k68

@OpenAlex
#OpenInfrastructure #OpenScience #OpenResearch #metadata #persistentidentifier #PID #DOI

Image features the DataCite Blog title, announcing the integration of DataCite metadata in OpenAlex. It includes photographs of two individuals, Kyle Demes and Maria Gould, and the DataCite logo.
2025-10-09

🚀 Đang gặp vấn đề mất metadata khi chuyển PDF/Excel sang markdown trong pipeline RAG? Đội Pipeshub đề xuất mô hình “blocks” giữ nguyên vị trí, trang, bảng, dòng… giúp tăng độ chính xác, giảm hallucination và cho phép tùy chỉnh sâu hơn. Họ muốn biến nó thành chuẩn mở và phát triển gói Python. Bạn có quan tâm? #RAG #DocumentParsing #AI #OpenStandard #Metadata #trí_tuệ #xử_lý_tài_liệu

reddit.com/r/LocalLLaMA/commen

2025-10-09

Realized that in my current workflow, every field in the #FADGI WebVTT #metadata except two can be auto-generated or hard-coded, so I made a script that runs #Whisper, generates FADGI metadata and optionally pulls the last two fields from a csv, and adds it to the output. It's written for use at my institution but could be easily adapted:
github.com/ninarao/whispervtt

(I also learned that Whisper via Python uses greedy search by default instead of beam, so I adjusted that too.)

#AVpres #captions

Miguel Afonso Caetanoremixtures@tldr.nettime.org
2025-10-07

"APIs to Capabilities

Enterprises have invested 10-15+ years into exposing enterprise capabilities (internal and external) with APIs. That is not going away. MCP, as exciting as it is, is really just a simple protocol shim for AI models to call tools. But to expose the tools correctly to the model, we need to describe capabilities not just API contract structure:

- tool names should be unique, action oriented (e.g., “listAllTodoTasks” vs just “list”)
- include detailed purpose explanations
- give examples of when to call with example requests/responses
preconditions for using the tool

Using OpenAPI Spec

The OpenAPI Specification contains a number of fields and structures to support adding rich semantic meaning to our APIs:

-Using the info section
- A number of sections offer the ability to link out to externalDocs
- Most sections provide a title, summary, and description field
- You can link out to industry accepted (or enterprise specific) data fields using JSON-LD for very deep semantic meaning
- If none of these are adequate, you can extend the spec with “x-properties”

Let’s take a quick look at an example."

blog.christianposta.com/semant

#APIs #APIDesign #APIDevelopment #OpenAPI #MCP #LLMs #Metadata #AI #GenerativeAI #AIAgents

James Truitt (he/him)linguistory@code4lib.social
2025-10-07

Anyway, it keeps bugging me that the lists of metadata standards I keep encountering often include EAD but never DACS. Like, you put AACR2 on here but not MARC, so what gives?

#GradSchoolGripes #Metadata

James Truitt (he/him)linguistory@code4lib.social
2025-10-07

It's especially interesting given that this is when the MPLP movement is taking off in archives world, too.

And I also find it bemusing that folks thinking about digital facsimiles and metadata in this period don't seem to have drawn much from thought or practice around microformat facsimiles, at least, not that I've seen yet

#Metadata

Senator Josh Hawley ignited controversy claiming the FBI wiretapped him post-Jan 6, but experts like Matthew Gertz and Jennifer Bendery clarify only metadata, not calls, were accessed in DOJ’s probe. This fuels bigger debates on political accountability and misinformation. Dive deeper at alternet.org/the-right-wing/ha #JoshHawley #January6 #FBI #metadata #DOJ #CongressionalHearing

2025-10-07

Interoperable, trustworthy, and machine-readable copyright data in the AI era : Report of the CITF First Project 👇©️ #copyright #DigitalMedia #identifiers #metadata #AI

julkaisut.valtioneuvosto.fi/handle/10024/1...

2025-10-07

📢 New preprint w/ @MsPhelps!

We explore how manuscript submission systems may affect completeness of open metadata in @crossref. Some systems seem to support richer metadata – but tech isn't the only factor.

👉doi.org/10.31222/osf.io/ndx3f_

#OpenScience #ScholarlyPublishing #Metadata

Public Knowledge ProjectPublicKnowledgeProject
2025-10-07

👋 Let's connect at @craftoa Conference - Crafting the Future of : Technical, Social & Political Perspectives of

Urooj Nizami & @markhusk are there with many friends! Urooj will be on a panel re sustainable & will co-present on Canadian OA policy.

Lots going on! , Diamond plugins, , , sustainability & much more.

Program: craft-oa.eu/craft-oa-conferenc

Streaming: craft-oa.eu/craft-oa-conferenc

Mark is Strategic Business Development Advisor and Urooj is Community Engagement and Outreach Associate Director

@mattblaze

The spec for EXIF 3, at least, includes the ImageDescription tag as a standardized (but optional) thing. I haven't checked if it's in anything prior to 3.0.

Trying to get software to support it, of course...

#EXIF #TIFF #tags #metadata

2025-10-06

I recently returned from visiting Stefanie Haustein, associate professor at the School of Information Studies at the University of Ottawa and co-director of the ScholCommLab. Thanks to a @BerlinUAlliance grant, I am able to investigate metadata flows from local to global contexts.

Read more about what I worked on during my stay:
👉 doi.org/10.59350/fzz0a-wv919

Or about the project in general:
👉 doi.org/10.59350/wwwj7-4cm07

#openscience
#researchdata
#metadata
#fdm
@IBI_HU

The Munin ConferenceMuninConf@mastodon.world
2025-10-06

🚀 How do journal systems shape open metadata in Crossref? 📊 Kramer, de Jonge & Korzec analyze 150 publishers – tech vs. commercial choices? 🔍 Read more: septentrio.uit.no/index.php/SC
🌍❄️ #Munin2025 #OpenScience #Metadata #Crossref

2025-10-04

ICYMI: "Putting in this effort in making our papers more machine-readable and comprehensible pays off… making the discoverability and visibility of our journals greater."

More on GigaScience Press's metadata journey: doi.org/10.64000/z2qhj-7nd90?m

#metadata #discoverability #openscience #Crossref2025

Vivekanandan KS :nixos:vivekanandanks@mstdn.social
2025-10-04

That's the kind of #platform I really would love to see. All #artists and their work openly shared #metadata and #people have control over the #money spent either blindly on the #album or specifically to their liking.

2025-10-03

New #metadata dataset: Fiocruz/CMM - Coleção de Malacologia Médica gbif.org/dataset/3798d8e8-34a8

2025-10-03

New #metadata dataset: Fiocruz/CHIOC - Coleção Helmintológica do Instituto Oswaldo Cruz gbif.org/dataset/e17a91e1-9157

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst