usace.contentdm.oclc.org

folder usace.contentdm.oclc.org (15 files)
filesteps.txt 2.41kB
fileREADME 0.52kB
filepdfs.tar.zst 482.59GB
filepages.tar.zst 10.65MB
fileothers.tar.zst 25.37GB
fileold/steps.txt 1.94kB
fileold/pages.tar.zst 386.23kB
fileold/README 0.36kB
fileold/items.tar.zst 1.93MB
fileold/item-links.txt.zst 17.63kB
filejp2s.tar.zst 113.83GB
fileitems.tar.zst 10.83MB
fileitem-links.txt.zst 2.28MB
filedownload-urls.txt.zst 79.09kB
filefile-types.txt.zst 202.76kB
Type: Dataset

Bibtex:
@article{,
title= {usace.contentdm.oclc.org},
journal= {},
author= {},
year= {},
url= {},
abstract= {**U.S. Army Corps of Engineers Digital Library**

An almost complete mirror of https://usace.contentdm.oclc.org/

Data captured from 2025-02-28 to 2025-03-02

Metadata is downloaded in JSON format and is available in pages.tar.zst and
items.tar.zst

Downloads are available segmented by filetype in other .tar.zst folders:
pdfs.tar.zst contains only PDF files, jp2s.tar.zst contains only JPEG 2000
files, and so on.

download-urls.txt.zst and item-links.txt.zst are intermediate artifacts
from scraping. steps.txt contains the shell scripts used to produce this dataset.},
keywords= {usace,usa,united states,gov},
terms= {},
license= {},
superseded= {}
}


Send Feedback Start
   0.000012
DB Connect
   0.001044
Lookup hash in DB
   0.001155
Get torrent details
   0.000402
Get torrent details, finished
   0.000868
Get authors
   0.000001
Select authors
   0.000523
Parse bibtex
   0.000180
Write header
   0.000622
get stars
   0.000341
home tab
   0.000976
render right panel
   0.000035
render ads
   0.001202
fetch current hosters
   0.001135
Done