Select Language

AI社区

公开数据集

CN-Celeb 一个室外收集的大规模说话人识别数据集 This is a large-scale speaker recognition dataset collected 'in the wild'. The dataset consists of two subsets,...Common Audio
29.66G 1121
阿拉伯自然音频数据集 这是第一个用于识别3种离散情感的阿拉伯自然音频数据集(ANAD):快乐,愤怒和惊讶。从在线阿拉伯脱口秀节目中下载了演播室外一...Music Analysis Audio
1.4G 343
DARPA TIMIT声学语音连续语音语料库 TIMIT(英语:The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus),是由德州仪器、麻省理工学院和SRI International...Music Analysis Audio
812.64M 499
牛津大学 VGG Condensed Movies 数据集 牛津大学 VGG 组学者创建了 Condensed Movies 数据集(CMD),由 3K 多部电影中的关键场景组成:每个关键场景都附有场景的高级语...Music Analysis Audio
6.24M 504
免费 ST 美国英语语料库 Thiscorpuswererecordedinsilencein-doorenvironmentusingcellphone.Ithas10speakers.Eachspeakerhasabout350utterances.Allutte...Music Analysis Audio
351M 490
免费 ST 中文普通话语料库 Thiscorpuswererecordedinsilencein-doorenvironmentusingcellphone.Ithas855speakers.Eachspeakerhas120utterances.Allutteranc...Music Analysis Audio
8.2G 528
大型爪哇 ASR 训练数据集 This data set contains transcribed audio data for Javanese. The data set consists of wave files, and a TSV file. The fil...Music Analysis Audio
1.1G 455
高质量古吉拉特语(女性)多说话者语音数据集 This data set contains transcribed high-quality audio of Gujarati sentencesrecorded by volunteers. The data set consists...Music Analysis Audio
917M 411
高质量古吉拉特语(男性)多说话者语音数据集 This data set contains transcribed high-quality audio of Gujarati sentencesrecorded by volunteers. The data set consists...Music Analysis Audio
825M 397
英国中部女性录音数据集 This data set contains transcribed high-quality audio of English sentencesrecorded by volunteers speaking different dial...Music Analysis Audio
103M 407
爱尔兰男性录音的数据集 This data set contains transcribed high-quality audio of English sentencesrecorded by volunteers speaking different dial...Music Analysis Audio
164M 404
Deeply Korean read speech corpus 深度韩语阅读语料库 about this resource:Recording environment: Studio apartment(moderate reverb), Dance studio(high reverb), Anechoic chambe...Music Analysis Audio
281M 352
Kazakh Speech Corpus (KSC) 哈萨克语语料库(KSC) A crowdsourced open-source speech corpus for the Kazakh language. The KSC contains around 332 hoursof transcribed audio...Music Analysis Audio
19G 895
Thorsten Müller(德国情感-TTS 数据集) I contribute my personal voice as a person believing in a world where all people are equal. No matter of gender, se...Music Analysis Audio
399M 389
Hi-Fi 多​​扬声器英语 TTS 数据集 (Hi-Fi TTS) 用于训练文本到语音模型的多说话者英语数据集about this resource:Hi-Fi Multi-Speaker English TTS Dataset (Hi-Fi TTS) is a m...Music Analysis Audio
41G 441
Nonverbal Vocalization Dataset 深度非言语发声数据集 about this resource:Volume(full set): ~0.6(~57) hours, ~800(~70,000) utterances, ~500(~1500) speakersFormat: 16kHz, 16-b...Music Analysis Audio
43.7M 419
LibriSpeech ASR corpus 语音数据 LibriSpeech ASR corpus 是一个语音数据,包括 1000小时 的英文发音和对应文字。标识符:SLR12摘要:大规模(1000小时)阅读英语...NLP Audio
8.14G 768
Google Audioset 音频数据集 AudioSet 包含了 632 类的音频类别以及 2084320 条人工标记的每段 10 秒长度的声音剪辑片段(片段来自 YouTube 视频)。音频本体...NLP Audio
2.41G 1215
TIMIT语音识别数据 TIMIT语音读取语料库旨在为声学语音研究以及自动语音识别系统的开发和评估提供语音数据。TIMIT包含由八种主要美国英语方言组成的...NLP Audio
419.81M 617
THUYG-20 维吾尔语语音数据 摘要:免费的维吾尔语言数据库由CSLT @清华大学和新疆大学发布类别:演讲许可证:Apache License v.2.0介绍THUGY20是由语音和语...NLP Audio
6.12G 917