#PDFs

2026-01-24

Tip of the day: The Content inspectors in #DEVONthink 4 are especially useful for #PDFs, as they provide an overview of the document. You get a table of contents and thumbnails of the document’s pages. With #DEVONthink 4, you can also customize a #PDF’s outline. #pkm #productivity #tipoftheday devontechnologies.com/blog/202

2026-01-21

Tip of the day: When doing research and taking notes, it is often helpful to link to specific parts of text in documents, especially #PDFs. #DEVONthink has two special copy and paste commands that make such linking very fast and effective. #notetaking #pkm #productivity #tipoftheday #workflow devontechnologies.com/blog/202

N-gated Hacker Newsngate
2026-01-19

Ah, the brave new world where your can finally live in witness protection without being "cloud-napped." 🚫☁️ Because clearly, what the internet truly lacked was yet another tool to wrangle those elusive PDF files…locally! 🤦‍♂️ Enjoy your newfound while you think about how often you actually need this. Spoiler: it's not that often. 📄🔒
pdfwithlove.netlify.app

2026-01-18
Hacker Newsh4ckernews
2026-01-10
N-gated Hacker Newsngate
2025-12-06

🎉🥳 In today's thrilling episode of "Tech Genius," someone on created a tool to make look like they were scanned, because apparently, the world was desperately lacking in fake nostalgia for fax machines 📠. Who knew that the pinnacle of would be making digital documents as annoyingly unreadable as possible? 🙄🚀
github.com/Francium-Tech/scani

Hacker Newsh4ckernews
2025-12-06

I made a tool to make PDFs look scanned because bureaucracy

github.com/Francium-Tech/scani

2025-10-31

@tallison ran Tika on my pile of PDFs and now I see the “pdfa:PDFVersion” field and just for the first result file I’m seeing PDF/A results for 193 files out of 10000. I have 17 more sets of results to go through, Im guessing the others will be similar. Fascinating! #pdfs #digipres

2025-09-30

Ran pdfcpu -relaxed on my pile of 175K+ #pdfs and there were 1762 files with validation errors here’s a sample. Thanks to @mickylindlar for suggesting pdfcpu, now I just need to make sense of the results. #digipres #digitalpreservation

validation error (obj#:968): postScriptCalculatorFunctionStreamDict: unsupported in version 1.2 validation error (obj#:1): pdfcpu: validateIndRefArrayEntry: invalid type at index 0 validation error (obj#:90): pdfcpu: validateOutlineTree: empty outline item dict "Count" must be 0 validation error (obj#:9): dict=extGStateDict entry=HT (obj#9): unsupported in version 1.1 validation error (obj#:58): pdfcpu: validateIndRefArrayEntry: invalid type at index 0 validation error (obj#:21): dict=pagesDict entry=Tabs: unsupported in version 1.2 validation error (obj#:746): dict=fileSpecDict entry=Thumb: unsupported in version 1.6 validation error (obj#:452): dict=outlineItemDict required entry=Parent missing
2025-09-19

Just finished the run and nearly 9K #pdfs were ‘Well formed, but not valid’ when using text output and with json there were only 3. @dpc_chat #JHOVE #digipres #digitalpreservation #OpenPreservationFoundation

⚯ Michel de Cryptadamus ⚯cryptadamist@universeodon.com
2025-09-18

Released v1.17.0 of The Pdfalyzer, the surprisingly popular tool for analyzing (possibly malicious) PDFs I created after my own unpleasant experience. Now ships with two command line tools for extracting stuff from PDF files:

1. extract_text_from_pdfs() - brute force extract all text from a PDF, including doing an #OCR extraction of any embedded images

2. extract_pdf_pages() - rip a page range from a #PDF and write them to a new one

* Github: github.com/michelcrypt4d4mus/p
* Pypi: pypi.org/project/pdfalyzer/
* Homebrew: formulae.brew.sh/formula/pdfal
* Fun thread someone made last week using Pdfalyzer to explain some of how byzantine the PDF format is: x.com/VikParuchuri/status/1965

#pypi #python #pdf #pdfs #malware #Threatassessment #maldoc #malwareanalysis #homebrew #infosec #cybersecurity #yararule #PdfFies

Github repo screenshot
Soft & Appssoft_apps
2025-09-17

✅ ¿Cansado de aplicaciones pesadas para editar ?

CanaryPDF: un kit de herramientas GRATUITO y seguro que funciona en tu .

Edita PDFs, extrae imágenes y tablas. SIN instalar nada y SIN registro.

Tus archivos NUNCA se suben a internet.

➡️ softandapps.info/2025/09/17/ca

N-gated Hacker Newsngate
2025-09-12

Oh boy, another package! 🤣 Get ready for a thrilling ride through "The Companion," where you'll find edge-of-your-seat excitement like... used with spotcolor! 🎨📄 Strap in for the ultimate in monotony! 😂
ctan.org/pkg/tlc3-examples

2025-09-09

Tip of the day: When doing research and taking notes, it is often helpful to link to specific parts of text in documents, especially #PDFs. #DEVONthink has two special copy and paste commands that make such linking very fast and effective. #notetaking #pkm #productivity #tipoftheday #workflow devontechnologies.com/blog/202

R.L. Dane :Debian: :OpenBSD: :FreeBSD: 🍵 :MiraLovesYou:rl_dane@polymaths.social
2025-09-08

I really wish there was a keyboard-driven #PDF viewer like #Zathura, #MuPDF, or #SioYek that let you fill out forms and annotate #PDFs.

That would be da bomb.

2025-09-08

@BertrandCaron you post is a good reminder, I need to run #JHOVE on my pile of 175K+ #PDFs, I’ve run veraPDF it would be nice to compare the results.

Verfassungklage@troet.cafeVerfassungklage@troet.cafe
2025-09-07

#LinuxGuides:

#Büro-Software für #Linux #Mint - E-Mails, #Office, #PDFs & Software - Bye Windows 10! Teil 3/5

m.youtube.com/watch?v=Wi_Tw1p-

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst