#pyPDF

2025-04-29

PDF parsen

Manchmal muss man PDF-Dateien auslesen. Dieser Artikel zeigt, wie man das mit einem Python-Skript macht.

#PDF #Parser #parsen #Auslesen #pypdf #Linux

gnulinux.ch/pdf-parsen

RivermonsterRomanOnARiver
2025-04-18

Working on a piece of (internal) software tentatively titled "t" it does some manipulation relating to PDFs. If I'm successful I can save our team about 50%. And what I've come across is that the module is really robust, powerful, lot of features. It's so strange that software like "Acrobat" has no free software equivalent - I'm pretty sure I can do everything that app does with this module, I could be wrong.

:rss: Qiita - 人気の記事qiita@rss-mstdn.studiofreesia.com
2024-05-26

PDFをLLMで解析する前処理のパーサーは何が良いのか?(pdfminer, PyMuPDF, pypdf, Unstructured)
qiita.com/cyberBOSE/items/142c

#qiita #Python #pdfminer #PyMuPDF #pyPDF #Unstructured

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst