The Pile An 800GB Dataset of Diverse Text for Language Modeling
EleutherAI

Name DL Added Torrents Total Size
No stats to report yet.

Send Feedback Start
   0.000006
DB Connect
   0.000455
Lookup hash in DB
   0.000425
Get torrent details
   0.000110
Get torrent details, finished
   0.000198
Get authors
   0.000024
Parse bibtex
   0.000065
Write header
   0.000181
get stars
   0.000124
collections tab
   0.000397
render right panel
   0.000006
render ads
   0.000381
fetch current hosters
   0.000192
Start get stats
   0.000325
End get stats
   0.000001
related datasets
   0.001647
Done