Reddit comments/submissions 2005-06 to 2021-06
stuck_in_the_matrix and Watchful1


This entry has been superseded by a newer one. Click here to view it!
folder reddit (380 files)
filecomments/RC_2005-12.zst 143.12kB
filecomments/RC_2006-01.zst 403.90kB
filecomments/RC_2006-02.zst 1.01MB
filecomments/RC_2006-03.zst 1.39MB
filecomments/RC_2006-04.zst 2.11MB
filecomments/RC_2006-05.zst 2.85MB
filecomments/RC_2006-06.zst 3.05MB
filecomments/RC_2006-07.zst 3.77MB
filecomments/RC_2006-08.zst 5.16MB
filecomments/RC_2006-09.zst 5.20MB
filecomments/RC_2006-10.zst 5.24MB
filecomments/RC_2006-11.zst 6.00MB
filecomments/RC_2006-12.zst 6.03MB
filecomments/RC_2007-01.zst 8.10MB
filecomments/RC_2007-02.zst 9.26MB
filecomments/RC_2007-03.zst 10.42MB
filecomments/RC_2007-04.zst 11.46MB
filecomments/RC_2007-05.zst 15.44MB
filecomments/RC_2007-06.zst 15.93MB
filecomments/RC_2007-07.zst 18.31MB
filecomments/RC_2007-08.zst 19.70MB
filecomments/RC_2007-09.zst 22.83MB
filecomments/RC_2007-10.zst 24.30MB
filecomments/RC_2007-11.zst 30.04MB
filecomments/RC_2007-12.zst 32.21MB
filecomments/RC_2008-01.zst 39.38MB
filecomments/RC_2008-02.zst 38.51MB
filecomments/RC_2008-03.zst 40.37MB
filecomments/RC_2008-04.zst 41.38MB
filecomments/RC_2008-05.zst 47.17MB
filecomments/RC_2008-06.zst 51.25MB
filecomments/RC_2008-07.zst 53.07MB
filecomments/RC_2008-08.zst 53.10MB
filecomments/RC_2008-09.zst 60.34MB
filecomments/RC_2008-10.zst 69.55MB
filecomments/RC_2008-11.zst 68.83MB
filecomments/RC_2008-12.zst 75.36MB
filecomments/RC_2009-01.zst 92.81MB
filecomments/RC_2009-02.zst 85.02MB
filecomments/RC_2009-03.zst 96.89MB
filecomments/RC_2009-04.zst 100.45MB
filecomments/RC_2009-05.zst 112.60MB
filecomments/RC_2009-06.zst 119.03MB
filecomments/RC_2009-07.zst 138.80MB
filecomments/RC_2009-08.zst 164.80MB
filecomments/RC_2009-09.zst 188.21MB
filecomments/RC_2009-10.zst 214.31MB
filecomments/RC_2009-11.zst 208.18MB
filecomments/RC_2009-12.zst 237.63MB
Too many files! Click here to view them all.
Type: Dataset
Tags: reddit
Abstract:

Reddit comments and submissions from 2005-06 to 2021-06 collected by pushshift which can be found here https://files.pushshift.io/reddit/

These are zstandard compressed ndjson files. Example python scripts for parsing the data can be found here https://github.com/Watchful1/PushshiftDumps


License: No license specified, the work may be protected by copyright.

Bibtex:
@article{,
title= {Reddit comments/submissions 2005-06 to 2021-06},
journal= {},
author= {stuck_in_the_matrix and Watchful1},
year= {},
url= {},
abstract= {Reddit comments and submissions from 2005-06 to 2021-06 collected by pushshift which can be found here https://files.pushshift.io/reddit/

These are zstandard compressed ndjson files. Example python scripts for parsing the data can be found here https://github.com/Watchful1/PushshiftDumps},
keywords= {reddit},
terms= {},
license= {},
superseded= {https://academictorrents.com/details/ba051999301b109eab37d16f027b3f49ade2de13}
}

Hosted by users:

Send Feedback Start
   0.000003
DB Connect
   0.003178
Lookup hash in DB
   0.003570
Get torrent details
   0.011163
Get torrent details, finished
   0.000759
Get authors
   0.000057
Parse bibtex
   0.000348
Write header
   0.000734
get stars
   0.000568
home tab
   0.012520
render right panel
   0.000069
render ads
   0.000150
fetch current hosters
   0.036027
Done