←back to thread

CDC data are disappearing

(www.theatlantic.com)
749 points doener | 1 comments | | HN request time: 0.214s | source
Show context
foresto ◴[] No.42904468[source]
Maybe useful to someone:

https://archive.org/details/20250128-cdc-datasets

"""

An archive of all CDC datasets uploaded to https://data.cdc.gov/browse before January 28th, 2025. Excludes corrupt datasets and data not publicly accessible.

Most datasets are accompanied by an additional file ending in -meta that includes the metadata associated with the data. Attachments referenced in these files can be found in the attachments/ folder.

If you would like to seed this data to improve its redundancy please do not use the auto generated torrent, as it is incomplete. Instead use the torrent file labeled "full-20250128-cdc-datasets-USETHIS.torrent"

"""

replies(2): >>42904790 #>>42905826 #
grumple ◴[] No.42904790[source]
Thanks, that is useful. Are there any other efforts to archive all the data on government websites? I suppose we could crawl archive.org.
replies(1): >>42906229 #
abracadaniel ◴[] No.42906229[source]
It will only take an order making the data illegal to host to have it removed. More copies are critical.
replies(1): >>42907309 #
1. 59nadir ◴[] No.42907309[source]
This means nothing if you host it on non-US servers. No one would take it seriously internationally.