irma.nps.gov-datastore

folder irma.nps.gov-datastore (14 files)
filezipfiles.txt.zst 10.08kB
filesteps.txt 0.38kB
fileREADME 1.14kB
fileprofiles.tar.zst 45.95MB
filepdfs.tar.zst 473.98GB
filepages.tar.zst 11.62MB
fileothers.tar.zst 252.37GB
filehtml.tar.zst 48.05MB
fileget-profile.sh 0.25kB
fileholdings.tar.zst 7.28MB
fileextracted-zip.tar.zst 2.18TB
fileget-holdings.sh 0.80kB
filedownload-page.sh 1.00kB
filedownload-file-types.txt.zst 631.36kB
Type: Dataset

Bibtex:
@article{,
title= {irma.nps.gov-datastore},
journal= {},
author= {},
year= {},
url= {},
abstract= {# IRMA NPS DataStore

A mirror of https://irma.nps.gov/DataStore/Search/Quick -- sent a search for the empty string, and crawled through all results and file downloads.

Data captured on 2025-03-04

This archive contains 3317 pages of search results, amounting to 165806 records
("references" in DataStore lingo)

pages.tar.zst contains all pages from search results, in JSON format.

profiles.tar.zst contains JSON metadata (description, date published, author)
for each reference.

holdings.tar.zst contains JSON file listings per reference, i.e. file
MIME-types and sizes.

html.tar.zst pdfs.tar.zst extracted-zip.tar.zst and others.tar.zst are the
actual downloaded files segmented by filetypes for compressability.

extracted-zip.tar.zst does not contain the original zipfiles but rather
extracted folders so that they can be more effectively recompressed by
ZStandard. The original total size of all zipfiles was 2.2 TiB, all data fully
extracted was 3.1 TiB.

Detailed code for how the data was scraped is available in steps.txt. Data is
packed in ZStandard-compressed tarballs with -9 --long to reduce torrent
metadata and disk usage.},
keywords= {national park service,irma,nps,gov,united states},
terms= {},
license= {},
superseded= {}
}


Send Feedback Start
   0.000010
DB Connect
   0.002017
Lookup hash in DB
   0.001198
Get torrent details
   0.000473
Get torrent details, finished
   0.000793
Get authors
   0.000002
Select authors
   0.000491
Parse bibtex
   0.000224
Write header
   0.000676
get stars
   0.000305
home tab
   0.011236
render right panel
   0.000015
render ads
   0.001415
fetch current hosters
   0.001315
Done