Common Crawl corpus - training-parallel-commoncrawl.tgz (CS-EN, DE-EN, ES-EN, FR-EN, RU-EN)

Name DL Added Torrents Total Size
Text [edit]
RSS CSV
32 233.75GB 260 0
No stats to report yet.

Send Feedback Start
   0.000005
DB Connect
   0.000463
Lookup hash in DB
   0.000380
Get torrent details
   0.000115
Get torrent details, finished
   0.000204
Get authors
   0.000001
Select authors
   0.000147
Parse bibtex
   0.000052
Write header
   0.000170
get stars
   0.000092
collections tab
   0.000582
render right panel
   0.000006
render ads
   0.000365
fetch current hosters
   0.000193
Start get stats
   0.000314
End get stats
   0.000001
related datasets
   0.001681
Done