公开数据集

波斯维基百科数据集,波斯语(波斯语)维基百科语料库 Persian(Farsi) Wikipedia Dataset | دیتاست ویکی پدیا فارسی شامل تمامی مقالات فارسی...NLP,Deep Learning,Text Data,Data Analytics Classification
804.48M 510
Cal多音节语料库 Education,Universities and Colleges,NLP,Text Data,Text Mining,spaCy Classification
15.26M 874
KcBERT训练前语料库(韩国新闻评论) Computer Science,Education,News,NLP,Text Data Classification
11899.2M 620
山区项目论坛 Earth and Nature,Internet,Online Communities,NLP,Transformers Classification
223.86M 376
SCOTUS意见 Earth and Nature,Software,Law,Politics,NLP,Crime,History Classification
716.62M 1026
巴黎迪士尼乐园 Facebook评论 Online Communities,Video Games,NLP,Text Data,Languages,Text Mining Classification
35.35M 1207
将这些公司分组 Business,NLP Classification
0.06M 318
新闻媒体的RSS源 News,NLP Classification
0.34M 892
Proza ru中文翻译 Biology,NLP,Text Data Classification
6494.7M 310
Whatsapp状态的情感数据 Email and Messaging,NLP,Deep Learning,spaCy Classification
2.49M 427
Catatan Cakrawala模型 Business,Clothing and Accessories,Image Data,NLP Classification
1595.43M 392
来自维基百科的800万个德语句子 Internet,NLP,Text Data Classification
1099.53M 376
孟加拉报纸数据集 Internet,News,NLP,Text Data,Multiclass Classification Classification
6524.76M 438
商店购物方式Shopgram.io公司 Internet,NLP,Classification,Text Data,Clustering,Multilabel Classification Classification
248.26M 485
美国2020年总统大选演讲 Politics,NLP,Text Data,Text Mining Classification
8.82M 391
GitHub Bugs预测挑战(机器黑客) Computer Science,Programming,NLP,Classification,Deep Learning,Multiclass Classification Classification
298.85M 393
Ar QAG数据集 NLP,Text Data Classification
137.66M 345
员工评价的主题建模 Business,NLP,Ratings and Reviews,Deep Learning,Jobs and Career,Subject Classification
0.61M 366
验证Seti数据集 Business,Arts and Entertainment,Software,NLP,Artificial Intelligence Classification
0.89M 298
朱杜尔阿蒂克尔媒体在线登干标签点击诱饵 Business,Arts and Entertainment,Online Communities,News,NLP Classification
0.22M 315