公开数据集
相关数据分类
10
553
5
6
9
13
19
2
3
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
ATIS数据集清洁重新点燃,ATIS数据集的清理和平衡分割
ATIS DataSetThe ATIS dataset is a standard benchmark dataset widely used as an intent classification and slot filling ta...NLP,Classification,Earth and Nature,Computer Science,Health Classification
1.02M
853
kpe
Dmoztools分类数据, 包含艺术、商业、计算机、游戏、健康、科学购物、社会等
# DatasetThis dataset was created by Patanjali ChintalapatiReleased under Other (specified in description)# ContentsIt c...NLP,Text Mining,Websites Classification
279.6M
709
Patanjali Chintalapati
Machado de Assis的116部小说和其他文本数据
este repositório estão contidas 116 obras de ficção e outros textos de Machado de Assis nos formatos pdf e txt nas c...NLP,Business,Literature,Art,Brazil Classification
40.38M
1214
Luiz Amaral
命名实体识别(NER)从临床提取感兴趣的实体(例如,疾病名称、药物名称
Problem StatementClinical studies often require detailed patients’ information documented in clinical narratives. Named...NLP,Health,Health Conditions,Model Comparison,Statistical Analysis,Artificial Intelligence Classification
249.01M
694
Ramashankar Nayak
CoNLL003 命名实体识别(NER)问题的注释数据集
This is an annotated dataset for Named Entity Recognition (NER) problemContentThis dataset is divided into train.txt, te...NLP,Arts and Entertainment,Computer Science,Text Data,Games,Text Mining Classification
4.63M
850
AlaaKhaled
有毒嵌入物,拼图有毒评论挑战中的通用句子编码文本
There's no need for everyone to encode the same text with the Universal Sentence EmbeddingThis data set contains the...NLP,Deep Learning,Earth and Nature Classification
610.81M
1092
Liling Tan
星际迷航脚本,所有《星际迷航》系列脚本的原始文本脚本和处理行
Star Trek Scripts TextData scraped from data from http://www.chakoteya.net/StarTrek/index.htmlCode here: https://github....NLP,Movies and TV Shows,Text Data,Text Mining Classification
42.63M
611
Gary Broughton
NLP 数据
# DatasetThis dataset was created by AbiyuGReleased under CC BY-NC-SA 4.0# ContentsIt contains the following files:...NLP,Psychology Classification
3.14M
503
AbiyuG
自然语言处理中的情感分析
#数据集此数据集由NowYSM在Database:Open Database,Contents:Database Contents#Contents下创建。它包含以下文件:...NLP,Arts and Entertainment Classification
2.52M
538
NowYSM
俄罗斯电报聊天记录,公开俄罗斯电报聊天中解析的数据
Russian Telegram chats history Data parsed from must popular public Russian Telegram chats...NLP,Text Data,Russia Classification
11.08G
551
Nick
带有偏差数据集的毒性清理版本
cleaned tox bias cleaned up version of toxicity with bias data set...NLP,Data Cleaning,Health Classification
535.39M
1092
Ilya Evenbach
134.5M
996
utsav
Hearthstone Hearthstone卡名称和描述的翻译数据
Translation of Hearthstone card names and descriptions.Languages: German, English, Spanish, French, Italian, Japanese, K...NLP,Arts and Entertainment,Video Games,Games,Comics and Animation,Card Games Classification
54.7M
911
Liling Tan
129M
260
Liling Tan
DARPA TIMIT 声学语音连续语音
#DARPA TIMIT声学语音连续语音语料库-**特别感谢**:**https://github.com/philipperemy/timit/edit/master/README.md**-**下载...NLP,Audio Data Classification
1198.1M
488
Michael Fekadu
性别猜测的推文文件
Online Communities,Social Science,Social Networks,NLP,Binary Classification,Gender Classification
5.74M
849
Andy Harless



















