The Pile An 800GB Dataset of Diverse Text for Language Modeling
EleutherAI

Name DL Added Torrents Total Size

Send Feedback Start
   0.000007
DB Connect
   0.000484
Lookup hash in DB
   0.000467
Get torrent details
   0.000124
Get torrent details, finished
   0.000228
Get authors
   0.000026
Parse bibtex
   0.000144
Write header
   0.000246
get stars
   0.000142
collections tab
   0.000501
render right panel
   0.000005
render ads
   0.000486
fetch current hosters
   0.000335
related datasets
   0.001897
Done