Elastic Malware Benchmark for Empowering Researchers 2017 Part 2

ember_dataset_2017_2.tar.bz2 1.75GB
Type: Dataset
Tags:
Abstract:

The EMBER dataset is a collection of features from PE files that serve as a benchmark dataset for researchers. The EMBER2017 dataset contained features from 1.1 million PE files scanned in or before 2017 and the EMBER2018 dataset contains features from 1 million PE files scanned in or before 2018. This repository makes it easy to reproducibly train the benchmark models, extend the provided feature set, or classify new PE files with the benchmark models.

This paper describes many more details about the dataset: https://arxiv.org/abs/1804.04637

Cite

H. Anderson and P. Roth, "EMBER: An Open Dataset for Training Static PE Malware Machine Learning Models”, in ArXiv e-prints. Apr. 2018.


@ARTICLE{2018arXiv180404637A,
  author = {{Anderson}, H.~S. and {Roth}, P.},
  title = "{EMBER: An Open Dataset for Training Static PE Malware Machine Learning Models}",
  journal = {ArXiv e-prints},
  archivePrefix = "arXiv",
  eprint = {1804.04637},
  primaryClass = "cs.CR",
  keywords = {Computer Science - Cryptography and Security},
  year = 2018,
  month = apr,
  adsurl = {http://adsabs.harvard.edu/abs/2018arXiv180404637A},
}



URL: https://github.com/elastic/ember
License: https://opensource.org/licenses/MIT

Bibtex:
@article{,
title= {Elastic Malware Benchmark for Empowering Researchers 2017 Part 2},
keywords= {},
author= {},
abstract= {The EMBER dataset is a collection of features from PE files that serve as a benchmark dataset for researchers. The EMBER2017 dataset contained features from 1.1 million PE files scanned in or before 2017 and the EMBER2018 dataset contains features from 1 million PE files scanned in or before 2018. This repository makes it easy to reproducibly train the benchmark models, extend the provided feature set, or classify new PE files with the benchmark models.

This paper describes many more details about the dataset: https://arxiv.org/abs/1804.04637

# Cite
H. Anderson and P. Roth, "EMBER: An Open Dataset for Training Static PE Malware Machine Learning Models”, in ArXiv e-prints. Apr. 2018.

```

@ARTICLE{2018arXiv180404637A,
  author = {{Anderson}, H.~S. and {Roth}, P.},
  title = "{EMBER: An Open Dataset for Training Static PE Malware Machine Learning Models}",
  journal = {ArXiv e-prints},
  archivePrefix = "arXiv",
  eprint = {1804.04637},
  primaryClass = "cs.CR",
  keywords = {Computer Science - Cryptography and Security},
  year = 2018,
  month = apr,
  adsurl = {http://adsabs.harvard.edu/abs/2018arXiv180404637A},
}
```


https://i.imgur.com/eor0Szg.png
},
terms= {},
license= {https://opensource.org/licenses/MIT},
superseded= {},
url= {https://github.com/elastic/ember}
}


Send Feedback Start
   0.000010
DB Connect
   0.000386
Lookup hash in DB
   0.006501
Get torrent details
   0.006784
Get torrent details, finished
   0.000770
Get authors
   0.000020
Select authors
   0.003260
Parse bibtex
   0.000965
Write header
   0.000694
get stars
   0.008090
home tab
   0.003209
render right panel
   0.000069
render ads
   0.000130
fetch current hosters
   0.024923
Done