#DocumentEngineering

Michael Piotrowskimxp@mastodon.acm.org
2025-04-11

There’s still time to submit a long paper to ACM #DocEng25: share your document-related research and submit a paper by April 29!

Short papers and demos are due June 17.

(For all categories, abstracts are due a week before).

#DocEng2025 #DocEng #DocumentEngineering
mastodon.acm.org/@mxp/11418235

Michael Piotrowskimxp@mastodon.acm.org
2024-05-27

The irony is that they conclude that 200 dpi are enough, and 300 dpi give you a good safety margin. They stress that higher resolutions waste storage space, communication network bandwidth for file transmission, and processing power.

But this didn't stop them from including a 3272 × 5500 pixel color PNG as what amounts to a thumbnail.

#DocumentEngineering

Michael Piotrowskimxp@mastodon.acm.org
2024-05-27

I’m looking at a paper that discusses the most suitable resolution for scanning documents. The 10-page PDF is almost 90 MB in size. Why? Because the authors did experiments with different resolutions… and the table that lists the characteristics at the various resolutions includes the *actual* scanned page at that resolution (shown at about 2 cm × 3 cm). ⇢

#DocumentEngineering

Michael Piotrowskimxp@mastodon.acm.org
2024-02-16

What’s especially interesting is that Don Chamberlin isn’t just some random IBMer (en.wikipedia.org/wiki/Donald_D). And he wasn’t only involved with Quill SGML editor, but also with the Quilt XML query language, which heavily influenced XQuery (he was one of the editors of the W3C XQuery 1.0 recommendation).

#XMLPrague #REXX #SGML #HistoryOfComputing #DocumentEngineering

Michael Piotrowskimxp@mastodon.acm.org
2024-02-16

In my talk for the 2019 #XMLPrague paper “History and the Future of Markup” archive.xmlprague.cz/2019/file I speculated about the use of #REXX as a language for #SGML tree manipulation.

Last night I “discovered” that REXX had indeed been used for this at IBM, and Goldfarb knew about it: namely in the experimental Quill editor running on an experimental windowing system on an experimental operating system on the IBM RT PC… (see, e.g., doi.org/10.1145/62506.62524)

#HistoryOfComputing #DocumentEngineering

Presentation slide titled “SGML is a cultural artifact”:

– To the best of my knowledge, there are no “good” (i.e., technical) reasons for these properties of SGML.
   
“The historical origins of SGML as a technique for adding marks to texts has left a legacy of complexities and difficulties which hinder its wide acceptance.” (Price, 1998)

– (Counterfactual history: Charles Goldfarb meets Mike Cowlishaw at IBM and Rexx is integrated into SGML as a language for tree manipulation.)

– This was proven true by XML: it is fascinating how quickly it was adopted and how an extremely rich ecosystem developed once many of these complexities had been discarded.
2023-03-14

Without even blinking, Greg Brockman during his GPT-4 demo, redefines the "document" as "arbitrarily interspersed images and text."

Byebye #DocumentEngineering

Michael Piotrowskitrue_mxp@qoto.org
2023-02-13

The call for papers for @ACM #DocEng23 (August 22–25 in Limerick, Ireland) is out:

doceng.org/doceng2023/cfp

❝The 23rd ACM Symposium on Document Engineering (DocEng’23) seeks original research papers that focus on the design, implementation, development, management, use and evaluation of advanced systems where document and document collections play a key role. DocEng emphasizes innovative approaches to document engineering technology, use of documents and document collections in real world applications, novel principles, tools and processes that improve our ability to create, manage, maintain, share, and productively use these. In particular, DocEng 2023 seeks contributions in the area of collaborative work with documents. Attendees at this international forum have interests that span all aspects of document engineering and applications.❞

#DocEng #DocumentEngineering

Michael Piotrowskitrue_mxp@qoto.org
2022-12-04

@mpe Awesome! Mandatory reading also for everyone in #documentEngineering!

Michael Piotrowskitrue_mxp@qoto.org
2022-11-19

It’s finally out now: @CerstinMahlow and my #DocEng2022 position paper on “Academic writing and publishing beyond documents.”

doi.org/10.1145/3558100.356384

#DocumentEngineering #scholarlyPublishing

2022-11-11

and of course there is an “r” missing in #DocumentEngineering

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst