#veraPDF

2025-05-16

I don't care if #Acrobat thinks, something with an alt-text shouldn't be embedded in somthing else that also has an alt-text. Both #PAC and #VeraPDF are happy with the PDF/UA-1, and so am I... publishup.uni-potsdam.de/opus4 #TeXLaTeX #accessibility

Johan van der Knijffbitsgalore@digipres.club
2025-05-07

Sort of a #PDF #doubleTrouble release day at the Open Preservation Foundation today, with new releases of both #veraPDF (which checks conformance to PDF/A and PDF/UA) and the #Arlington PDF Checker (which checks conformance to the Arlington PDF model):

openpreservation.org/news/vera

Good stuff!

2025-03-04

Hello @Georgia !

I wanted to use the online public demo #veraPDF #Arlington model at arlington.verapdf.org/ but it returns a 502 "Bad gateway" error...

Georgia MoppettGeorgia@digipres.club
2024-09-18

🚀 Join @carl today at #iPRES2024 where he demonstrates #veraPDF's #Arlington, an invaluable resource for anyone developing or testing #PDF #DigitalPreservation tools.

📍 De Bijloke - Kraakhuis

Check the release notes here + our thanks go to all contributors, particularly the Dual Lab development team, for their work on this : openpreservation.org/news/arli

Johan van der Knijffbitsgalore@digipres.club
2024-03-28

Just found out there's now a development prototype of veraPDF-rest, which exposes #VeraPDF's functionality through a REST API:

github.com/veraPDF/veraPDF-res

Will need to try this out, but this definitely looks really useful!

This could also be good for developing performant VeraPDF wrappers in other programming languages, like Python (similar to how Tika-python currently wraps around #Apache #Tika's REST API).

2024-03-14

New blog entry: PDF debugging. I show my favourite tools for examining PDF file in case I messed up and get a warning in Adobe Acrobat for example. No more error messages!

news.speedata.de/2024/03/14/pd

#pdf #debugging #qpdf #pdfcpu #adobeacrobatpro #verapdf

Robert Nasarekrnsrk@fedihum.org
2023-10-19

Hi Kids, wanna validate your PDF files over REST?
github.com/veraPDF/veraPDF-res
Demo here: demo.verapdf.org/

#pdfvalidate #veraPDF

Johan van der Knijffbitsgalore@digipres.club
2023-06-30

ICYMI, I ran some experiments to see if #VeraPDF’s parse status can be used to predict #PDF rendering problems, using an existing dataset of synthetic PDFs as ground truth. I also looked at how this compares against the occurrence of #JHOVE validation errors.

Details in this blog post:

bitsgalore.org/2023/06/29/vera

Vintage lithograph circus poster that shows a circus ring. In the front is a woman in a red dress, standing on horseback. Behind her there are more horses, with a variety of circus artists, including acrobats and jugglers, performing on horseback as well. In the background acrobats are walking on a tightrope.
Johan van der Knijffbitsgalore@digipres.club
2023-06-29

New blog post - #VeraPDF parse status as a proxy for #PDF rendering: experiments with the Synthetic PDF Testset:

bitsgalore.org/2023/06/29/vera

@general

Vintage lithograph circus poster that shows a circus ring. In the front is a woman in a red dress, standing on horseback. Behind her there are more horses, with a variety of circus artists, including acrobats and jugglers, performing on horseback as well. In the background acrobats are walking on a tightrope.
Johan van der Knijffbitsgalore@digipres.club
2023-06-21

Oh, this looks good - #OpenPreservationFoundation #OPF and the #PDFAssociation have released a first development preview of a #VeraPDF-powered #PDF checker. The software is based on the #Arlington PDF model, and analyses PDF files against the full PDF 2.0 specification:

openpreservation.org/news/deve

Haven't tried it yet, but based on what I'm reading this looks like the future of #PDF validation to me!

Johan van der Knijffbitsgalore@digipres.club
2023-06-06

Out of curiosity I ran both #JHOVE and #VeraPDF on the "Synthetic #PDF Testset for File Format Validation" by @mickylindlar et al. (link: radar-service.eu/radar/en/data).

Then did a quick comparison between validation errors as reported by JHOVE, and parse errors and logged warnings by VeraPDF.

Main result so far is that majority of PDFs for which JHOVE reports validation errors, also result in either parser error or warning in VeraPDF. Sneak peek here:

github.com/KBNLresearch/pdf-ch

Johan van der Knijffbitsgalore@digipres.club
2023-05-25

I explored to what extent #VeraPDF and #JHOVE can be used to identify #PDF features that are potential preservation risks. Check out this (massive!) blog post for the full lowdown #wtfPDF:

bitsgalore.org/2023/05/25/iden

Photo of a red toy robot and a similar looking blue toy robot in a boxing ring. Both robots face each other in a threatening stance.
Tim Allisontallison
2023-01-25

Many thanks to Carl Wilson, Boris Doubrov and for a fantastic update on the latest in this morning/afternoon!

openpreservation.org/events/ve

2022-12-07

#OPFOAG Thomas Ledoux advocates for a standard #schematron edition tool to enforce institutional policies on #JHOVE, #veraPDF & #jpylyzer outputs.

2022-12-07

@mickylindlar
Carl: "PDF is a huge tree of objects linked one to another." Which makes interpreting errors far from intuitive!

But #veraPDF, and soon #JHOVE, should be able to associate an error to the problematic zone in the PDF.

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst