The Pile An 800GB Dataset of Diverse Text for Language Modeling
EleutherAI

Name DL Added Torrents Total Size

Send Feedback Start
   0.000004
DB Connect
   0.000416
Lookup hash in DB
   0.000386
Get torrent details
   0.000107
Get torrent details, finished
   0.000191
Get authors
   0.000023
Parse bibtex
   0.000064
Write header
   0.000192
get stars
   0.000109
collections tab
   0.000425
render right panel
   0.000007
render ads
   0.000328
fetch current hosters
   0.000201
related datasets
   0.001586
Done