Documents and Dependencies: an Exploration of Vector Space Models for Semantic Composition
Alona Fyshe

conll2013.zip 1.32GB
Type: Dataset
Tags: Dataset

Bibtex:
@article{,
title = {Documents and Dependencies: an Exploration of Vector Space Models for Semantic Composition},
journal = {},
author = {Alona Fyshe},
year = {2013},
url = {https://www.cs.cmu.edu/~afyshe/},
abstract = {This zip should contain 4 files:
- README.txt (this file)
- doc2Dep20MWU57k_1000concat2000.tab
- doc2Dep20MWU57k_1000concat2000.txt
- doc2Dep20MWU57k_1000concat2000.mat

****doc2Dep20MWU57k_1000concat2000.tab****
This file contains the 54975 word-units with POS tags.  The order of the words in this file corresponds to the order of the rows in doc2Dep20MWU57k_1000concat2000.tab

****doc2Dep20MWU57k_1000concat2000.tab****
This tab-separated-value file contains the concatenated SVD matrices as created described in  "Documents and Dependencies: an Exploration of Vector Space Models for Semantic Composition"(Fyshe 2013).  The size of the matrix is 54975x2000.  The first 1000 dimensions are Document dimensions, the second 1000 (1001-2000) are Dependency dimensions.  The rows appear in the same order as the word-units in doc2Dep20MWU57k_1000concat2000.txt

****doc2Dep20MWU57k_1000concat2000.mat****
For convenience, this is the data contained in doc2Dep20MWU57k_1000concat2000.tab & doc2Dep20MWU57k_1000concat2000.txt saved into two matlab variables.  count_matrix is the concatenated SVD matrices (tab file), words are the words (txt file).

Questions may be directed to Alona Fyshe, afyshe at cs dot cmu dot edu.
}
}

Send Feedback Start
   0.000006
DB Connect
   0.000497
Lookup hash in DB
   0.003838
Get torrent details
   0.000673
Get torrent details, finished
   0.000803
Get authors
   0.000006
Select authors
   0.000496
Parse bibtex
   0.000602
Write header
   0.000628
get stars
   0.000520
home tab
   0.000539
render right panel
   0.000009
render ads
   0.000043
fetch current hosters
   0.001994
Done