OpenWebText-urls-26M-filtered.xz
eukaryote and jcpeterson

OpenWebText-urls-26M-filtered.xz 480.28MB
Type: Dataset

Bibtex:
@article{,
title= {OpenWebText-urls-26M-filtered.xz},
journal= {},
author= {eukaryote and jcpeterson},
year= {},
url= {https://github.com/eukaryote31/openwebtext},
abstract= {Every outbound reddit link from before 31. Dec 2018 with at least 3 karma. The list is filtered to remove image sites, non-scraper-friendly sites, and other media files. },
keywords= {WebText, Reddit, gpt2},
terms= {},
license= {},
superseded= {}
}



Send Feedback Start
   0.000012
DB Connect
   0.001198
Lookup hash in DB
   0.001099
Get torrent details
   0.000391
Get torrent details, finished
   0.000800
Get authors
   0.000031
Parse bibtex
   0.000266
Write header
   0.000753
get stars
   0.000349
home tab
   0.001557
render right panel
   0.000037
render ads
   0.001030
fetch current hosters
   0.001116
Done