OpenWebText-urls-26M-filtered.xz
eukaryote and jcpeterson

OpenWebText-urls-26M-filtered.xz 480.28MB
Type: Dataset

Bibtex:
@article{,
title= {OpenWebText-urls-26M-filtered.xz},
journal= {},
author= {eukaryote and jcpeterson},
year= {},
url= {https://github.com/eukaryote31/openwebtext},
abstract= {Every outbound reddit link from before 31. Dec 2018 with at least 3 karma. The list is filtered to remove image sites, non-scraper-friendly sites, and other media files. },
keywords= {WebText, Reddit, gpt2},
terms= {},
license= {},
superseded= {}
}


Send Feedback Start
   0.000012
DB Connect
   0.001036
Lookup hash in DB
   0.001359
Get torrent details
   0.000515
Get torrent details, finished
   0.000862
Get authors
   0.000031
Parse bibtex
   0.000154
Write header
   0.000616
get stars
   0.000323
home tab
   0.002816
render right panel
   0.000010
render ads
   0.000977
fetch current hosters
   0.000894
Done