NLP
IIIT 5K word数据集,包含广告牌、招牌、门牌号、门牌、电影海报等查询词

101M

910

0

IIIT 5K word数据集,包含广告牌、招牌、门牌号、门牌、电影海报等查询词

NLP

Classification

IIIT 5K word数据集,包含广告牌、招牌、门牌号、门牌、电影海报等查询词前往PC端下载数据

Description

The IIIT 5K-word dataset is harvested from Google image search. Query words like billboards, signboard, house numbers, house name plates, movie posters were used to collect images. The dataset contains 5000 cropped word images from Scene Texts and born-digital images. The dataset is divided into train and test parts. This dataset can be used for large lexicon cropped word recognition. We also provide a lexicon of more than 0.5 million dictionary words with this dataset. 

Bibtex

If you use this dataset, please cite:

@InProceedings{MishraBMVC12,
  author    = "Mishra, A. and Alahari, K. and Jawahar, C.~V.",
  title     = "Scene Text Recognition using Higher Order Language Priors",
  booktitle = "BMVC",
  year      = "2012",
}

Contact

For any queries about the dataset feel free to contact Anand Mishra.         Email:1stName.LastName@research.iiit.ac.in                          


发表评论
0评