irma.nps.gov-datastore

folder irma.nps.gov-datastore (14 files)
filezipfiles.txt.zst 10.08kB
filesteps.txt 0.38kB
fileREADME 1.14kB
fileprofiles.tar.zst 45.95MB
filepdfs.tar.zst 473.98GB
filepages.tar.zst 11.62MB
fileothers.tar.zst 252.37GB
filehtml.tar.zst 48.05MB
fileget-profile.sh 0.25kB
fileholdings.tar.zst 7.28MB
fileextracted-zip.tar.zst 2.18TB
fileget-holdings.sh 0.80kB
filedownload-page.sh 1.00kB
filedownload-file-types.txt.zst 631.36kB
Type: Dataset

Bibtex:
@article{,
title= {irma.nps.gov-datastore},
journal= {},
author= {},
year= {},
url= {},
abstract= {# IRMA NPS DataStore

A mirror of https://irma.nps.gov/DataStore/Search/Quick -- sent a search for the empty string, and crawled through all results and file downloads.

Data captured on 2025-03-04

This archive contains 3317 pages of search results, amounting to 165806 records
("references" in DataStore lingo)

pages.tar.zst contains all pages from search results, in JSON format.

profiles.tar.zst contains JSON metadata (description, date published, author)
for each reference.

holdings.tar.zst contains JSON file listings per reference, i.e. file
MIME-types and sizes.

html.tar.zst pdfs.tar.zst extracted-zip.tar.zst and others.tar.zst are the
actual downloaded files segmented by filetypes for compressability.

extracted-zip.tar.zst does not contain the original zipfiles but rather
extracted folders so that they can be more effectively recompressed by
ZStandard. The original total size of all zipfiles was 2.2 TiB, all data fully
extracted was 3.1 TiB.

Detailed code for how the data was scraped is available in steps.txt. Data is
packed in ZStandard-compressed tarballs with -9 --long to reduce torrent
metadata and disk usage.},
keywords= {national park service,irma,nps,gov,united states},
terms= {},
license= {},
superseded= {}
}


Send Feedback Start
   0.000011
DB Connect
   0.001526
Lookup hash in DB
   0.001599
Get torrent details
   0.000590
Get torrent details, finished
   0.001340
Get authors
   0.000003
Select authors
   0.000707
Parse bibtex
   0.000363
Write header
   0.000757
get stars
   0.000348
home tab
   0.001204
render right panel
   0.000008
render ads
   0.001369
fetch current hosters
   0.000862
Done