Twitter Data - NIPS 2012
J. McAuley and J. Leskovec

twitter.tar.gz 22.34MB
Type: Dataset
Tags: twitter, social networks, NIPS
Abstract:

This dataset consists of 'circles' (or 'lists') from Twitter. Twitter data was crawled from public sources. The dataset includes node features (profiles), circles, and ego networks.

##Dataset statistics

Attribute Value
Nodes 81306
Edges 1768149
Nodes in largest WCC 81306 (1.000)
Edges in largest WCC 1768149 (1.000)
Nodes in largest SCC 68413 (0.841)
Edges in largest SCC 1685163 (0.953)
Average clustering coefficient 0.5653
Number of triangles 13082506
Fraction of closed triangles 0.06415
Diameter (longest shortest path) 7
90-percentile effective diameter 4.5

##Source (citation)

J. McAuley and J. Leskovec. Learning to Discover Social Circles in Ego Networks. NIPS, 2012.

##Files:

Attribute Value
nodeId.edges The edges in the ego network for the node 'nodeId'. Edges are undirected for facebook, and directed (a follows b) for twitter and gplus. The 'ego' node does not appear, but it is assumed that they follow every node id that appears in this file.
nodeId.circles The set of circles for the ego node. Each line contains one circle, consisting of a series of node ids. The first entry in each line is the name of the circle.
nodeId.feat The features for each of the nodes that appears in the edge file.
nodeId.egofeat The features for the ego user.
nodeId.featnames The names of each of the feature dimensions. Features are '1' if the user has this property in their profile, and '0' otherwise. This file has been anonymized for facebook users, since the names of the features would reveal private data.


URL: http://snap.stanford.edu/data/egonets-Twitter.html
License: No license specified, the work may be protected by copyright.

Bibtex:
@article{,
title= {Twitter Data - NIPS 2012},
journal= {},
author= {J. McAuley and J. Leskovec},
year= {},
url= {http://snap.stanford.edu/data/egonets-Twitter.html},
license= {},
abstract= {This dataset consists of 'circles' (or 'lists') from Twitter. Twitter data was crawled from public sources. The dataset includes node features (profiles), circles, and ego networks.


##Dataset statistics

|Attribute|Value|
|---------|-------|
|Nodes|	81306|
|Edges	|1768149|
|Nodes in largest WCC	|81306 (1.000)|
|Edges in largest WCC|	1768149 (1.000)|
|Nodes in largest SCC	|68413 (0.841)|
|Edges in largest SCC|	1685163 (0.953)|
|Average clustering coefficient	|0.5653|
|Number of triangles|	13082506|
|Fraction of closed triangles|	0.06415|
|Diameter (longest shortest path)|	7|
|90-percentile effective diameter	|4.5|

##Source (citation)

	J. McAuley and J. Leskovec. Learning to Discover Social Circles in Ego Networks. NIPS, 2012.

##Files:

|Attribute|Value|
|---------|-------|
|nodeId.edges |The edges in the ego network for the node 'nodeId'. Edges are undirected for facebook, and directed (a follows b) for twitter and gplus. The 'ego' node does not appear, but it is assumed that they follow every node id that appears in this file.|
|nodeId.circles |The set of circles for the ego node. Each line contains one circle, consisting of a series of node ids. The first entry in each line is the name of the circle.|
|nodeId.feat |The features for each of the nodes that appears in the edge file.|
|nodeId.egofeat |The features for the ego user.|
|nodeId.featnames |The names of each of the feature dimensions. Features are '1' if the user has this property in their profile, and '0' otherwise. This file has been anonymized for facebook users, since the names of the features would reveal private data.|},
keywords= {twitter, social networks, NIPS},
terms= {}
}


Send Feedback Start
   0.000003
DB Connect
   0.000255
Lookup hash in DB
   0.000496
Get torrent details
   0.000450
Get torrent details, finished
   0.000460
Get authors
   0.000055
Parse bibtex
   0.000453
Write header
   0.000420
get stars
   0.000311
home tab
   0.000375
render right panel
   0.000006
render ads
   0.000029
fetch current hosters
   0.000697
Done