公开数据集

唐纳德·特朗普(Donald Trump)42分钟的咆哮|文本生成 Business,Online Communities,Politics,NLP Classification
0.02M 415
万篇德国新闻文章数据集,10kGNAD基于一百万篇文章语料库 10kGNAD数据集旨在作为第一个德国主题分类数据集解决部分问题。它由一家奥地利在线报纸的10273篇德语新闻文章组成,分为9个主题...NLP,Classification,Computer Science,Programming,News,Social Science Classification
51.81M 495
厄瓜多尔(全国)推特 Internet,Social Networks,NLP,Demographics,People and Society Classification
32.59M 365
推特情绪提取扩展 Internet,Online Communities,NLP,Text Data,Text Mining Classification
15.9M 407
Erowid经验报告word2vec向量 Online Communities,NLP,Psychology,Clustering Classification
19.95M 351
莎士比亚戏剧 Software,NLP,Text Data,Text Mining,Statistical Analysis Classification
4.78M 401
文字云 NLP,Text Data,Data Visualization,Languages Classification
2.53M 875
COVID 19开放研究数据集挑战赛(CORD 19) Business,Computer Science,Biology,Coronavirus,NLP,Public Health Classification
27148.7M 361
奖牌数据集 Computer Science,NLP,Deep Learning,Healthcare,Artificial Intelligence,Transformers Classification
20085.3M 335
情绪分析突尼斯评论 NLP,Text Data,Text Mining,spaCy,NLTK Classification
6.17M 300
BERT英语无编码unigrams Music,NLP Classification
94.36M 309
尼泊尔语 NLP,Artificial Intelligence Classification
218M 311
假新闻 News,NLP,Classification Classification
94.06M 739
Goodreads';有史以来最好的书 Image Data,NLP,Literature,Tabular Data Classification
2263.86M 374
科学预训练模型 Earth and Nature,Health,NLP,Text Data,Healthcare,spaCy Classification
586.45M 392
包含内容的UCI新闻聚合器数据集 Earth and Nature,Education,Online Communities,News,NLP,Linguistics Classification
16.11M 497
GDELT项目 Business,Arts and Entertainment,Social Science,NLP,Languages,Bigquery Classification
0M 375
贝特预训练 Arts and Entertainment,NLP,Text Data Classification
4477.46M 720
用于客观分析的体育文章 NLP,Classification,Deep Learning Classification
6.95M 445
维基 2 NLP Classification
98.32M 292