The Pile An 800GB Dataset of Diverse Text for Language Modeling
EleutherAI

Name DL Torrents Total Size

Send Feedback Start
   0.000005
DB Connect
   0.000513
Lookup hash in DB
   0.002627
Get torrent details
   0.000663
Get torrent details, finished
   0.000606
Get authors
   0.000093
Parse bibtex
   0.000553
Write header
   0.000601
get stars
   0.000553
collections tab
   0.001387
home tab
   0.004581
render right panel
   0.000038
render ads
   0.000088
fetch current hosters
   0.042825
Done