Select Language

公开数据集

Kannada-MNIST

Kannada-MNIST

Scene:

MNIST

Data Type:

Classification
所需积分:10 去赚积分?
  • 298浏览
  • 2下载
  • 0点赞
  • 收藏
  • 分享

贡献者查看主页

小小程序员

致力于人工智能业务的研究、数据集处理。

Data Preview ? 64.19M

    Data Structure ?

    *数据结构实际以真实数据为准

    Here, we disseminate a new handwritten digits-dataset, termed Kannada-MNIST, for the Kannada script, that can potentially serve as a direct drop-in replacement for the original MNIST dataset.

    Data Collection

    This dataset is based off of the efforts of 65 volunteers from Bangalore, India, who are native speakers and users of the Kannada language and the script. This was curated to serve as a direct one-to-one drop-in replacement for the original MNIST dataset (akin to Fashion-MNIST and K-MNIST datasets).

    65 volunteers were recruited in Bangalore, India, who were native speakers of the language as well as day-to-day users of the numeral script. Each volunteer filled out an A3 sheet containing a 32 × 40 grid. This yielded filled-out A3 sheets containing 128 instances of each number which we posit is large enough to capture most of the natural intra-volunteer variations of the glyph shapes. All of the sheets thus collected were scanned at 600 dots-per-inch resolution using the Konica Accurio-Press-C6085 scanner that yielded 65 4963 × 3509 png images.

    Data Format

    The main Kannada-MNIST dataset that consists of a training set of 60000 28 × 28 gray-scale sample images.

    Citation

    Please use the following citation when referencing the dataset:

    @article{prabhu2019kannada,
      title={Kannada-MNIST: A new handwritten digits dataset for the Kannada language},
      author={Prabhu, Vinay Uday},
      journal={arXiv preprint arXiv:1908.01242},
      year={2019}
    }


    0相关评论
    ×

    帕依提提提温馨提示

    该数据集正在整理中,为您准备了其他渠道,请您使用

    注:部分数据正在处理中,未能直接提供下载,还请大家理解和支持。