MS-Celeb-1M: {A} Dataset and Benchmark for Large-Scale Face Recognition
Yandong Guo and Lei Zhang and Yuxiao Hu and Xiaodong He and Jianfeng Gao

folder MS-Celeb-1M (7 files)
fileREADME.md 1.16kB
fileREADME.txt 1.16kB
filedata/aligned_face_images/FaceImageCroppedWithAlignment.tsv 91.12GB
filedata/croped_face_images/FaceImageCroppedWithOutAlignment.tsv 155.27GB
filesamples_0.jpg 28.81kB
filesamples_1.jpg 28.81kB
filesamples_2.jpg 28.81kB
Type: Dataset
Tags:

Bibtex:
@article{dblp:journals/corr/guozhhg16,
author= {Yandong Guo and               Lei Zhang and               Yuxiao Hu and               Xiaodong He and               Jianfeng Gao},
title= {MS-Celeb-1M: {A} Dataset and Benchmark for Large-Scale Face Recognition},
journal= {CoRR},
volume= {abs/1607.08221},
year= {2016},
url= {http://arxiv.org/abs/1607.08221},
archiveprefix= {arXiv},
eprint= {1607.08221},
timestamp= {Mon, 13 Aug 2018 16:46:27 +0200},
biburl= {https://dblp.org/rec/bib/journals/corr/GuoZHHG16},
bibsource= {dblp computer science bibliography, https://dblp.org},
abstract= {In this paper, we design a benchmark task and provide the associated datasets for recognizing face images and link them to corresponding entity keys in a knowledge base. More specifically, we propose a benchmark task to recognize one million celebrities from their face images, by using all the possibly collected face images of this individual on the web as training data. The rich information provided by the knowledge base helps to conduct disambiguation and improve the recognition accuracy, and contributes to various real-world applications, such as image captioning and news video analysis. Associated with this task, we design and provide concrete measurement set, evaluation protocol, as well as training data. We also present in details our experiment setup and report promising baseline results. Our benchmark task could lead to one of the largest classification problems in computer vision. To the best of our knowledge, our training dataset, which contains 10M images in version 1, is the largest publicly available one in the world.
},
keywords= {},
terms= {},
license= {},
superseded= {}
}


Send Feedback Start
   0.000006
DB Connect
   0.000488
Lookup hash in DB
   0.000725
Get torrent details
   0.000718
Get torrent details, finished
   0.000715
Get authors
   0.000087
Parse bibtex
   0.000669
Write header
   0.000715
get stars
   0.000565
home tab
   0.001130
render right panel
   0.000037
render ads
   0.000087
fetch current hosters
   0.001120
Done