#ShellTools

2025-05-23

Command-line Tools can be 235x Faster than your Hadoop Cluster
"This find | xargs mawk | mawk pipeline gets us down to a runtime of about 12 seconds, or about 270MB/sec, which is around 235 times faster than the Hadoop implementation."

#complexity #ShellTools #RightToolForTheRightJob #Hadoop #computing

Doc Edward Morbius ⭕​dredmorbius@toot.cat
2025-04-28

Apropos an ongoing project, looking at sed, and realising:

  • It can execute external commands (e)
  • It can read in entire files at a given address within the input stream (r).
  • It can read in specified external files on a line-by-line basis at a specified address within the input stream (R).

I've only been using sed for, oh, 40 years.

gnu.org/software/sed/manual/se

#sed #ShellTools #linux #unix #til #shellScripting

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst