OpenWebText-urls-26M-filtered.xz
eukaryote and jcpeterson

OpenWebText-urls-26M-filtered.xz 480.28MB
Type: Dataset

Bibtex:
@article{,
title= {OpenWebText-urls-26M-filtered.xz},
journal= {},
author= {eukaryote and jcpeterson},
year= {},
url= {https://github.com/eukaryote31/openwebtext},
abstract= {Every outbound reddit link from before 31. Dec 2018 with at least 3 karma. The list is filtered to remove image sites, non-scraper-friendly sites, and other media files. },
keywords= {WebText, Reddit, gpt2},
terms= {},
license= {},
superseded= {}
}



Send Feedback Start
   0.000012
DB Connect
   0.001172
Lookup hash in DB
   0.001126
Get torrent details
   0.000436
Get torrent details, finished
   0.000850
Get authors
   0.000033
Parse bibtex
   0.000126
Write header
   0.000598
get stars
   0.000345
home tab
   0.000466
render right panel
   0.000011
render ads
   0.001287
fetch current hosters
   0.001181
Done