The Pile An 800GB Dataset of Diverse Text for Language Modeling
EleutherAI

Name DL Added Torrents Total Size


Send Feedback Start
   0.000005
DB Connect
   0.000442
Lookup hash in DB
   0.000420
Get torrent details
   0.000139
Get torrent details, finished
   0.000230
Get authors
   0.000025
Parse bibtex
   0.000072
Write header
   0.000220
get stars
   0.000148
collections tab
   0.000459
render right panel
   0.000005
render ads
   0.000422
fetch current hosters
   0.000378
Done