Lerman Twitter 2010 Dataset
Kristina Lerman

folder twitter (3 files)
filelink_status_search_with_ordering_real_csv.zip 70.04MB
filedistinct_users_from_search_table_real_map.csv 28.08MB
fileactive_follower_real_sql.zip 194.05MB
Type: Dataset
Tags: twitter

Bibtex:
@article{,
title= {Lerman Twitter 2010 Dataset},
journal= {},
author= {Kristina Lerman },
year= {2010},
license= {This data is made available to the community for research purposes only},
url= {http://www.isi.edu/~lerman/downloads/twitter/twitter2010.html},
abstract= {Twitter_2010 data set contains tweets containing URLs that have been posted on Twitter during October 2010. In addition to tweets, we also the followee links of tweeting users, allowing us to reconstruct the follower graph of active (tweeting) users.
URLs	66,059
tweets	2,859,764
users	736,930
links	36,743,448
Tweets

Table (in csv format) link_status_search_with_ordering_real_csv contains tweets with the following information

link: URL within the text of the tweet
id: tweet id
create_at: date added to the db
create_at_long
inreplyto_screen_name: screen name of user this tweet is replying to
inreplyto_user_id: user id of user this tweet is replying to
source: device from which the tweet originated
bad_user_id: alternate user id
user_screen_name: tweeting user screen name
order_of_users: tweet's index within sequence of tweets of the same URL
user_id: user id
Table (in csv format) distinct_users_from_search_table_real_map contains names of tweeting users, and the following information for each user:

user_id: user id
user_screen_name: user name
indegree: number of followers
outdegree: number of friends/followees
bad_user_id: alternate user id
Follower graph

File active_follower_real_sql contains zipped SQL dump of links between tweeting users in the form:

user_id: user id
follower_id: user id of the follower
Empirical characterization of this data is described in 
Kristina Lerman, Rumi Ghosh, Tawan Surachawala (2012) "Social Contagion: An Empirical Study of Information Spread on Digg and Twitter Follower Graphs." This data is made available to the community for research purposes only. If you use the data in a publication, please cite the above paper.},
keywords= {twitter},
terms= {}
}


Send Feedback Start
   0.000007
DB Connect
   0.000478
Lookup hash in DB
   0.000447
Get torrent details
   0.000162
Get torrent details, finished
   0.000265
Get authors
   0.000048
Parse bibtex
   0.000173
Write header
   0.000243
get stars
   0.000112
home tab
   0.000307
render right panel
   0.000010
render ads
   0.000427
fetch current hosters
   0.000297
related datasets
   0.007130
Done