相关数据分类
124
1
0
0
3
2
0
0
1
2
0
0
0
0
CN-Celeb 一个室外收集的大规模说话人识别数据集
This is a large-scale speaker recognition dataset collected 'in the wild'. The dataset consists of two subsets,...Common Audio
29.66G
1503
openslr![](https://www.payititi.com/api/avatar/show.php?username=xiaochengxu&size=large)
1.4G
668
Samira klaylat![](https://www.payititi.com/api/avatar/show.php?username=xiaochengxu&size=large)
DARPA TIMIT声学语音连续语音语料库
TIMIT(英语:The DARPA TIMIT Acoustic-Phonetic Continuous Speech Corpus),是由德州仪器、麻省理工学院和SRI International...Music Analysis Audio
812.64M
987
robot![](https://www.payititi.com/api/avatar/show.php?username=mit2020&size=large)
牛津大学 VGG Condensed Movies 数据集
牛津大学 VGG 组学者创建了 Condensed Movies 数据集(CMD),由 3K 多部电影中的关键场景组成:每个关键场景都附有场景的高级语...Music Analysis Audio
6.24M
836
Max Bain![](https://www.payititi.com/api/avatar/show.php?username=oxford&size=large)
免费 ST 美国英语语料库
Thiscorpuswererecordedinsilencein-doorenvironmentusingcellphone.Ithas10speakers.Eachspeakerhasabout350utterances.Allutte...Music Analysis Audio
351M
786
surfing.ai![](https://www.payititi.com/api/avatar/show.php?username=xiaochengxu&size=large)
免费 ST 中文普通话语料库
Thiscorpuswererecordedinsilencein-doorenvironmentusingcellphone.Ithas855speakers.Eachspeakerhas120utterances.Allutteranc...Music Analysis Audio
8.2G
1043
surfing.ai![](https://www.payititi.com/api/avatar/show.php?username=xiaochengxu&size=large)
大型爪哇 ASR 训练数据集
This data set contains transcribed audio data for Javanese. The data set consists of wave files, and a TSV file. The fil...Music Analysis Audio
1.1G
794
Google, Inc.![](https://www.payititi.com/api/avatar/show.php?username=google&size=large)
高质量古吉拉特语(女性)多说话者语音数据集
This data set contains transcribed high-quality audio of Gujarati sentencesrecorded by volunteers. The data set consists...Music Analysis Audio
917M
718
Google, Inc.![](https://www.payititi.com/api/avatar/show.php?username=google&size=large)
高质量古吉拉特语(男性)多说话者语音数据集
This data set contains transcribed high-quality audio of Gujarati sentencesrecorded by volunteers. The data set consists...Music Analysis Audio
825M
727
Google, Inc![](https://www.payititi.com/api/avatar/show.php?username=google&size=large)
英国中部女性录音数据集
This data set contains transcribed high-quality audio of English sentencesrecorded by volunteers speaking different dial...Music Analysis Audio
103M
777
Google, Inc.![](https://www.payititi.com/api/avatar/show.php?username=tianjing2020&size=large)
爱尔兰男性录音的数据集
This data set contains transcribed high-quality audio of English sentencesrecorded by volunteers speaking different dial...Music Analysis Audio
164M
702
Google, Inc.![](https://www.payititi.com/api/avatar/show.php?username=tianjing2020&size=large)
Deeply Korean read speech corpus 深度韩语阅读语料库
about this resource:Recording environment: Studio apartment(moderate reverb), Dance studio(high reverb), Anechoic chambe...Music Analysis Audio
281M
818
Deeply Inc![](https://www.payititi.com/api/avatar/show.php?username=tianjing2020&size=large)
Kazakh Speech Corpus (KSC) 哈萨克语语料库(KSC)
A crowdsourced open-source speech corpus for the Kazakh language. The KSC contains around 332 hoursof transcribed audio...Music Analysis Audio
19G
1336
NET![](https://www.payititi.com/api/avatar/show.php?username=tianjing2020&size=large)
Thorsten Müller(德国情感-TTS 数据集)
I contribute my personal voice as a person believing in a world where all people are equal. No matter of gender, se...Music Analysis Audio
399M
723
Thorsten Müller![](https://www.payititi.com/api/avatar/show.php?username=tianjing2020&size=large)
Hi-Fi 多扬声器英语 TTS 数据集 (Hi-Fi TTS)
用于训练文本到语音模型的多说话者英语数据集about this resource:Hi-Fi Multi-Speaker English TTS Dataset (Hi-Fi TTS) is a m...Music Analysis Audio
41G
774
LibriVox![](https://www.payititi.com/api/avatar/show.php?username=tianjing2020&size=large)
Nonverbal Vocalization Dataset 深度非言语发声数据集
about this resource:Volume(full set): ~0.6(~57) hours, ~800(~70,000) utterances, ~500(~1500) speakersFormat: 16kHz, 16-b...Music Analysis Audio
43.7M
734
Deeply Inc![](https://www.payititi.com/api/avatar/show.php?username=tianjing2020&size=large)
LibriSpeech ASR corpus 语音数据
LibriSpeech ASR corpus 是一个语音数据,包括 1000小时 的英文发音和对应文字。标识符:SLR12摘要:大规模(1000小时)阅读英语...NLP Audio
8.14G
1303
Vassil Panayotov![](https://www.payititi.com/api/avatar/show.php?username=xiaochengxu&size=large)
Google Audioset 音频数据集
AudioSet 包含了 632 类的音频类别以及 2084320 条人工标记的每段 10 秒长度的声音剪辑片段(片段来自 YouTube 视频)。音频本体...NLP Audio
2.41G
1690
Google![](https://www.payititi.com/api/avatar/show.php?username=xiaochengxu&size=large)
419.81M
982
宾夕法尼亚大学![](https://www.payititi.com/api/avatar/show.php?username=pennsylvania&size=large)
THUYG-20 维吾尔语语音数据
摘要:免费的维吾尔语言数据库由CSLT @清华大学和新疆大学发布类别:演讲许可证:Apache License v.2.0介绍THUGY20是由语音和语...NLP Audio
6.12G
1541
清华大学![](https://www.payititi.com/api/avatar/show.php?username=xiaochengxu&size=large)