Select Language

公开数据集

固有唤醒词数据库 HI-MIA

固有唤醒词数据库 HI-MIA

Scene:

Music Analysis

Data Type:

Audio
所需积分:25 去赚积分?
  • 311浏览
  • 1下载
  • 0点赞
  • 收藏
  • 分享

贡献者查看主页

小小程序员

致力于人工智能业务的研究、数据集处理。

Data Preview ? 45.8G

    The data is used in AISHELL Speaker Verification Challenge 2019. It is extracted from a larger database called AISHELL-WakeUp-1.

    The contents are wake-up words "Hi, Mia" in both Chinese and English. The data is collected in real home environment using microphone arrays and Hi-Fi microphone. The collection process and development of a baseline system was described in the paper below. The data used in the challenge is extracted from 1 Hi-Fi microphone and 16-channel circular microphone arrays for 1/3/5 meters. And the contents are the Chinese wake-up words. The whole set is divided into train (254 people), dev (42 people) and test (44 people) subsets. Test subset is provided with paired target/non-target answer to evaluate verification results.

    You can cite the data using the following BibTeX entry:

    @misc{himia,
        title={HI-MIA : A Far-field Text-Dependent Speaker Verification Database and the baselines},
        author={Xiaoyi Qin and Hui Bu and Ming Li},
        year={2019},
        eprint={1912.01231},
        archivePrefix={arXiv},
        primaryClass={cs.SD}
    }


    0相关评论