irma.nps.gov-datastore

folder irma.nps.gov-datastore (14 files)
filezipfiles.txt.zst 10.08kB
filesteps.txt 0.38kB
fileREADME 1.14kB
fileprofiles.tar.zst 45.95MB
filepdfs.tar.zst 473.98GB
filepages.tar.zst 11.62MB
fileothers.tar.zst 252.37GB
filehtml.tar.zst 48.05MB
fileget-profile.sh 0.25kB
fileholdings.tar.zst 7.28MB
fileextracted-zip.tar.zst 2.18TB
fileget-holdings.sh 0.80kB
filedownload-page.sh 1.00kB
filedownload-file-types.txt.zst 631.36kB
Type: Dataset
Tags: gov, united states, national park service, irma, nps

Bibtex:
@article{,
title= {irma.nps.gov-datastore},
journal= {},
author= {},
year= {},
url= {},
abstract= {# IRMA NPS DataStore

A mirror of https://irma.nps.gov/DataStore/Search/Quick -- sent a search for the empty string, and crawled through all results and file downloads.

Data captured on 2025-03-04

This archive contains 3317 pages of search results, amounting to 165806 records
("references" in DataStore lingo)

pages.tar.zst contains all pages from search results, in JSON format.

profiles.tar.zst contains JSON metadata (description, date published, author)
for each reference.

holdings.tar.zst contains JSON file listings per reference, i.e. file
MIME-types and sizes.

html.tar.zst pdfs.tar.zst extracted-zip.tar.zst and others.tar.zst are the
actual downloaded files segmented by filetypes for compressability.

extracted-zip.tar.zst does not contain the original zipfiles but rather
extracted folders so that they can be more effectively recompressed by
ZStandard. The original total size of all zipfiles was 2.2 TiB, all data fully
extracted was 3.1 TiB.

Detailed code for how the data was scraped is available in steps.txt. Data is
packed in ZStandard-compressed tarballs with -9 --long to reduce torrent
metadata and disk usage.},
keywords= {national park service,irma,nps,gov,united states},
terms= {},
license= {},
superseded= {}
}


Send Feedback Start
   0.000008
DB Connect
   0.000460
Lookup hash in DB
   0.000782
Get torrent details
   0.000651
Get torrent details, finished
   0.001073
Get authors
   0.000007
Select authors
   0.000733
Parse bibtex
   0.000907
Write header
   0.000634
get stars
   0.000518
home tab
   0.001210
render right panel
   0.000012
render ads
   0.000719
fetch current hosters
   0.003938
Done