Data Is Plural

A weekly newsletter (and seasonal podcast) highlighting useful and curious datasets.

Data Is Pluraldataisplural
2023-06-14

Cody Winchester asks the question: "How many rat hairs in your macaroni before the FDA considers it adulterated?"

The agency's Food Defect Levels Handbook has the answer, in its table listing the “maximum levels of natural or unavoidable defects in foods for human use that present no health hazard”: fda.gov/food/ingredients-addit

Winchester has converted the table to JSON: github.com/cjwinchester/fda-fo

Featured in today's edition of Data Is Plural: data-is-plural.com/archive/202

Data Is Pluraldataisplural
2023-06-14

The Markup has obtained, analyzed, and published a spreadsheet of 650,000+ ad-targetable “audience segments” and their data suppliers: themarkup.org/privacy/2023/06/

The data, which until recently had been linked from an ad platform's website: github.com/the-markup/xandr-au

Featured in today's edition of Data Is Plural: data-is-plural.com/archive/202

Data Is Pluraldataisplural
2023-06-14

Spotlight PA and the Pittsburgh Institute for Nonprofit Journalism have shared data on 697 criminal cases that involved competency hearings, based on state court records: github.com/spotlightpa/compete

The newsrooms used the data for an investigation into Pennsylvania's competency system earlier this year: spotlightpa.org/news/2023/03/p

The data and story are featured in today's edition of Data Is Plural: data-is-plural.com/archive/202

Data Is Pluraldataisplural
2023-06-14

Reuters and Big Local News teamed up to extract data on 43,000+ climate finance contributions from wealthy to developing countries. Some funding went to "questionable" projects “including a coal plant, a hotel and chocolate shops," according their investigation: reuters.com/investigates/speci

How to get and use their data: biglocalnews.org/content/news/

Featured in today's edition of Data Is Plural: data-is-plural.com/archive/202

Data Is Pluraldataisplural
2023-06-14

Internationally, there's also the Global Wildfire Information System: gwis.jrc.ec.europa.eu/

(Previously featured in DIP 2022.07.27: data-is-plural.com/archive/202.)

Data Is Pluraldataisplural
2023-06-14

Another government organization, the Canadian Interagency Forest Fire Centre (ciffc.ca/), maintains a dashboard of active fires: ciffc.net/

Data Is Pluraldataisplural
2023-06-14

The Canadian Wildland Fire Information System monitors provides maps and datasets of fires and fire weather in the country: cwfis.cfs.nrcan.gc.ca/home

Read more in today's edition of Data Is Plural: data-is-plural.com/archive/202

Data Is Pluraldataisplural
2023-06-07

“Ransomware negotiations are usually not shared widely, limiting the understanding of the process,” writes Valéry Marchive, whose new repository of chat transcripts — github.com/Casualtek/Ransomcha — “aims at changing that, in a respectful manner for the victims of cyberattacks: chats are anonymized as long as the victim hasn’t been publicly disclosed, either by the attackers or in the media.”

Featured in today's edition of Data Is Plural: data-is-plural.com/archive/202 via @duncangeere

Data Is Pluraldataisplural
2023-06-07

Harvard University's library system provides several ways to access detailed metadata about its holdings: library.harvard.edu/services-t

... including its LibraryCloud API: wiki.harvard.edu/confluence/di

... and bulk downloads: dataverse.harvard.edu/dataset.

Featured in today's edition of Data Is Plural: data-is-plural.com/archive/202

Data Is Pluraldataisplural
2023-06-07

Recent work by Megan Kang and Elizabeth Rasich “extends an existing proxy for household gun ownership rates — the rate of firearm suicide divided by suicide (FSS) — from 1949 to 2020, including new coverage for the 1949 to 1972 period”: papers.ssrn.com/sol3/papers.cf

Dataset: dataverse.harvard.edu/dataset.

Featured in today's edition of Data Is Plural: data-is-plural.com/archive/202

Data Is Pluraldataisplural
2023-06-07

The FAA, which regulates the US commercial space transportation industry, publishes HTML tables of licensed launches (552 of them since 1989), permits for experimental operations, and more: faa.gov/data_research/commerci

Featured in today's edition of Data Is Plural: data-is-plural.com/archive/202

Data Is Pluraldataisplural
2023-06-07

A recent article in Science, by Virginia Gewin, describes the pushback against USGS’s decision “to reduce the number of chemicals it tracks and to release updates less frequently”: science.org/content/article/mo

Data Is Pluraldataisplural
2023-06-07

As part of its National Water-Quality Assessment Project, the US Geological Survey publishes maps and datasets that estimate local pesticide usage: water.usgs.gov/nawqa/pnsp/usag

More details in today's edition of Data Is Plural: data-is-plural.com/archive/202

Data Is Pluraldataisplural
2023-05-31

NYC’s Department of Buildings publishes a map and dataset of active permits for “sidewalk sheds,” the ubiquitous, temporary structures (often colloquially called scaffolding) meant to shield pedestrians from falling debris: nyc.gov/assets/buildings/html/

More details and visualizations, by @zhik@urbanists.social: observablehq.com/@betanyc/what

Both featured in today's edition of Data Is Plural: data-is-plural.com/archive/202

Data Is Pluraldataisplural
2023-05-31

In a recent paper, the Scientific Committee on Antarctic Research’s GeoMAP team describes building the “first detailed geological map dataset covering all of Antarctica,” assembled and refined from 589 sources: nature.com/articles/s41597-023

Featured in today's edition of Data Is Plural: data-is-plural.com/archive/202

Data Is Pluraldataisplural
2023-05-31

Claudio Feliciani et al. have compiled a dataset of 281 crowd accidents from 1900 to 2019: sciencedirect.com/science/arti

Featured in today's edition of Data Is Plural: data-is-plural.com/archive/202

Data Is Pluraldataisplural
2023-05-31

The Census Bureau’s County Business Patterns datasets indicate the number establishments and (noise-infused) employee counts and payroll figures, disaggregated by industry code: census.gov/programs-surveys/cb

And they're not just for counties, but for a range of other geographic units, including states, congressional districts, metro areas, and ZIP codes.

Featured in today's edition of Data Is Plural: data-is-plural.com/archive/202

Data Is Pluraldataisplural
2023-05-31

The Prison Policy Initiative has cross-referenced data from the Bureau of Justice Statistics and several other sources to count/estimate the number of people under eight forms of “correctional control” in each state and DC: prisonpolicy.org/reports/corre

- federal prisons
- state prisons
- local jails
- Indian Country jails
- youth confinement
- involuntary commitment
- parole
- probation

Featured in today's edition of Data Is Plural: data-is-plural.com/archive/202

Data Is Pluraldataisplural
2023-05-24

Gero Laurenz Höhn et al. (tandfonline.com/doi/full/10.10) have collected the prices of protected-origin and non-protected European ham from dozens of supermarket websites: dataverse.nl/dataset.xhtml?per

Featured in this week's edition of Data Is Plural: data-is-plural.com/archive/202

Data Is Pluraldataisplural
2023-05-24

The Parking Reform Network, a US-based nonprofit that aims “to discourage the building of too much parking supply,” has compiled a map and dataset of ~1,400 relevant local mandates: parkingreform.org/resources/ma

Featured in today's edition of Data Is Plural: data-is-plural.com/archive/202

Client Info

Server: https://mastodon.social
Version: 2025.04
Repository: https://github.com/cyevgeniy/lmst