#DataWrangling

2026-01-29

Say goodbye to the frustrations of copying and pasting data to and from R with Datapasta from @milesmcbain! Get the package now: milesmcbain.github.io/datapast #DataWrangling #Rstats

2025-12-06

Sometimes you get data in less than optimal format, e.g. as a png of a figure 😭... In that case cran.r-project.org/web/package might be the rescue. #rstats #ohno #datawrangling

2025-10-06

Say goodbye to the frustrations of copying and pasting data to and from R with Datapasta from @milesmcbain! Get the package now: milesmcbain.github.io/datapast #DataWrangling #Rstats

Data Rescue Project #DataRescuedatarescueproject.org@bsky.brid.gy
2025-09-04

It's not hard to bill-lief that our technically-inclined volunteers stay afloat using @duckdb.org for their #DataWrangling needs

2025-08-13

Sometimes you get data in less than optimal format, e.g. as a png of a figure 😭... In that case cran.r-project.org/web/package might be the rescue. #rstats #ohno #datawrangling

Turning NASA Wake-up Calls into data

by @beet_keeper

For a while back then I was into space flight again. Scientists, science communicators, and engineers were all excited for a new era of rocket launches and the potential unification of the human race as we look towards the future.

During that time I discovered Colin Fries’ work in the NASA History Division to document all NASA “Wake-up calls”. A wake-up call is simply a piece of music used to wake astronauts on missions, a different piece of music, daily, for the duration of the flight.

Take, for example, the last Space Shuttle mission (Space Transportation System) STS-135; it was in flight for 13 days, and the wake-up call on day one was Coldplay’s Viva la Vida, while on day 13 it was Kate Smith singing God Bless America.

As a huge music buff who has the radio or music television on 18 hours a day, I really wanted to delve into this further. While Colin’s work is great, it’s just a PDF file (@wtfpdf). A PDF is not an ideal file format for querying data and gleaning new insights. So, while I wanted to explore it, I first decided to turn it into a true dataset. The result was a set of resources, a website, a JSON, a CSV, and an SQLite database which are each more functional and more maintainable over time.

Lets take a look at the results and https://nasawakeupcalls.github.io below!

Continue reading “Turning NASA Wake-up Calls into data”


#ApacheTika #Code #Coding #DataWrangling #Datasette #DatasetteLite #DH #DigitalHumanities #GLAM #harkive #NASA #NASAWakeUpCall #NASAWakeUpCalls #OpenData #PersonalProjects #Science #Space #SpaceHistory #Twitter #WakeUpCall
NASA Wakeup Calls banner featuring a sunrise over the earth's horizon. glowing over the right hand side of the image, and the project logo in the left hand.
Misinformation-SuperhighwaymanDamienWise@aus.social
2025-07-05
2025-06-13

Say goodbye to the frustrations of copying and pasting data to and from R with Datapasta from @milesmcbain! Get the package now: milesmcbain.github.io/datapast #DataWrangling #Rstats

N-gated Hacker Newsngate
2025-05-30

Oh, the endless saga of trying to bridge the gap between and 😩. Apparently, the internet gods have decided that converting tables is a task only for the truly worthy—or those with a spare afternoon to wrestle with server errors 🤦‍♂️. Meanwhile, your data waits, trapped in spreadsheet purgatory.
thisdavej.com/copy-table-in-ex

PromptCloudpromptcloud
2025-05-02

Structured data drives AI. But messy inputs? They stall everything.
We’ve listed six parsing issues you should be watching for.
👉 Read the blog to know more: shorturl.at/vuJjw

Data Parsing in AI and Machine Learning: Preparing Clean Data for Better Models
2025-04-19

Sometimes you get data in less than optimal format, e.g. as a png of a figure 😭... In that case cran.r-project.org/web/package might be the rescue. #rstats #ohno #datawrangling

2025-02-17

Say goodbye to the frustrations of copying and pasting data to and from R with Datapasta from @milesmcbain! Get the package now: milesmcbain.github.io/datapast #DataWrangling #Rstats

Alexandre B A Villares 🐍villares@ciberlandia.pt
2025-02-09
screenshot from the start of the introduction at https://wesmckinney.com/book/preliminaries

1.2 Why Python for Data Analysis?

For many people, the Python programming language has strong appeal. Since its first appearance in 1991, Python has become one of the most popular interpreted programming languages, along with Perl, Ruby, and others. Python and Ruby have become especially popular since 2005 or so for building websites using their numerous web frameworks, like Rails (Ruby) and Django (Python). Such languages are often called scripting languages, as they can be used to quickly write small programs, or scripts to automate other tasks. I don’t like the term “scripting languages,” as it carries a connotation that they cannot be used for building serious software. Among interpreted languages, for various historical and cultural reasons, Python has developed a large and active scientific computing and data analysis community. In the last 20 years, Python has gone from a bleeding-edge or "at your own risk" scientific computing language to one of the most important languages for data science, machine learning, and general software development in academia and industry.

For data analysis and interactive computing and data visualization, Python will inevitably draw comparisons with other open source and commercial programming languages and tools in wide use, such as R, MATLAB, SAS, Stata, and others. In recent years...

[and the page continues a bit]
Alexandre B A Villares 🐍villares@ciberlandia.pt
2025-01-18
O'Reilly's "Python for Data Analysis" 3rd edition book cover.

From the colophon: "The animal on the cover of Python for Data Analysis is a golden-tailed, or pen-tailed, tree shrew (Ptilocercus lowii). The golden-tailed tree shrew is the only one of its species in the genus Ptilocercus and family Ptilocercidae; all the other tree shrews are of the family Tupaiidae. Tree shrews are identified by their long tails and soft red-brown fur. As nicknamed, the golden-tailed tree shrew has a tail that resembles the feather on a quill pen. Tree shrews are omnivores, feeding primarily on insects, fruit, seeds, and small vertebrates."
2024-12-23

Sometimes you get data in less than optimal format, e.g. as a png of a figure 😭... In that case cran.r-project.org/web/package might be the rescue. #rstats #ohno #datawrangling

2024-10-23

Say goodbye to the frustrations of copying and pasting data to and from R with Datapasta from @milesmcbain! Get the package now: milesmcbain.github.io/datapast #DataWrangling #Rstats

Eric Maugendre about datamaugendre@hachyderm.io
2024-10-01

@datadon

Unfilled cells influence models.
"Handling Missing Data in Machine Learning": ml-nn.eu/a1/51.html by Calin Sandu @mlnn

#missingData #bias #wealth #dataQuality #complexity #dataDev #machineLearning #dataPrep #EDA #dataWrangling

2024-09-20

📢 Join us for a hands-on tutorial on effective data wrangling with R!

🌟 Learn essential techniques using tidyverse packages.

🗓️When: September 30, 2024 at 7PM CET
💥RSVP: 🔗 meetup.com/rladies-rome/events

#DataWrangling #RStats #DataScience
@fgazzelloni @silacos @rafagrlucas

2024-08-27

Sometimes you get data in less than optimal format, e.g. as a png of a figure 😭... In that case cran.r-project.org/web/package might be the rescue. #rstats #ohno #datawrangling

Dan Stowelldanstowell
2024-08-06

a simple script to trim a set of audio files to a maximum length (e.g. 2 minutes each): gist.github.com/danstowell/84e

Client Info

Server: https://mastodon.social
Version: 2025.07
Repository: https://github.com/cyevgeniy/lmst