30M Factoid Question-Answer Corpus (30MQA)
Iulian Vlad Serban and Alberto García-Durán and Caglar Gulcehre and Sungjin Ahn and Sarath Chandar and Aaron Courville and Yoshua Bengio

folder 30MQA (2 files)
file30MQA_1.tar.gz 315.96MB
file30MQA_2.tar.gz 213.39MB
Type: Dataset
Tags:

Bibtex:
@article{,
title= {30M Factoid Question-Answer Corpus (30MQA)},
keywords= {},
author= {Iulian Vlad Serban and Alberto García-Durán and Caglar Gulcehre and Sungjin Ahn and Sarath Chandar and Aaron Courville and Yoshua Bengio},
abstract= {The 30M Factoid Question-Answer Corpus consists of 30M natural language questions in English and their corresponding facts in the knowledge base Freebase.

The dataset is formatted as a text file, where each line contains:

```
<subject> \t <relationship> \t <object> \t natural language question,
```
 
where <subject>, <relationship> and <object> are  the subject, relationship and object identifier in Freebase corresponding to the natural language question.

For a more detailed description, have a look at our paper:

Generating Factoid Questions With Recurrent Neural Networks: The 30M Factoid Question-Answer Corpus
http://arxiv.org/abs/1603.06807

Sample:

```
<http://rdf.freebase.com/ns/m.04whkz5>	www.freebase.com/book/written_work/subjects	<http://rdf.freebase.com/ns/m.01cj3p>	what is the book e about ?
<http://rdf.freebase.com/ns/m.0tp2p24>	www.freebase.com/music/release_track/release	<http://rdf.freebase.com/ns/m.0sjc7c1>	in what release does the release track cardiac arrest come from ?
<http://rdf.freebase.com/ns/m.04j0t75>	www.freebase.com/film/film/country	<http://rdf.freebase.com/ns/m.07ssc>	what country is the debt from ?
<http://rdf.freebase.com/ns/m.0ftqr>	www.freebase.com/music/producer/tracks_produced	<http://rdf.freebase.com/ns/m.0p600l>	what songs have nobuo uematsu produced ?
<http://rdf.freebase.com/ns/m.036p007>	www.freebase.com/music/release/producers	<http://rdf.freebase.com/ns/m.0677ng>	who produced eve-olution ?
<http://rdf.freebase.com/ns/m.0ms5mg>	www.freebase.com/music/recording/artist	<http://rdf.freebase.com/ns/m.0mjn2>	which artist recorded most of us are sad ?
```
},
terms= {},
license= {Creative Commons Attribution 3.0 Unported},
superseded= {},
url= {}
}

Hosted by users:

Send Feedback Start
   0.000005
DB Connect
   0.000447
Lookup hash in DB
   0.000632
Get torrent details
   0.000661
Get torrent details, finished
   0.000679
Get authors
   0.000136
Parse bibtex
   0.001246
Write header
   0.000608
get stars
   0.000592
home tab
   0.002154
render right panel
   0.000009
render ads
   0.000041
fetch current hosters
   0.001872
Done