公开数据集

波斯维基百科数据集,波斯语(波斯语)维基百科语料库 Persian(Farsi) Wikipedia Dataset | دیتاست ویکی پدیا فارسی شامل تمامی مقالات فارسی...NLP,Deep Learning,Text Data,Data Analytics Classification
804.48M 538
Cal多音节语料库 Education,Universities and Colleges,NLP,Text Data,Text Mining,spaCy Classification
15.26M 948
KcBERT训练前语料库(韩国新闻评论) Computer Science,Education,News,NLP,Text Data Classification
11899.2M 650
山区项目论坛 Earth and Nature,Internet,Online Communities,NLP,Transformers Classification
223.86M 430
SCOTUS意见 Earth and Nature,Software,Law,Politics,NLP,Crime,History Classification
716.62M 1058
巴黎迪士尼乐园 Facebook评论 Online Communities,Video Games,NLP,Text Data,Languages,Text Mining Classification
35.35M 1298
将这些公司分组 Business,NLP Classification
0.06M 323
新闻媒体的RSS源 News,NLP Classification
0.34M 919
Proza ru中文翻译 Biology,NLP,Text Data Classification
6494.7M 339
Whatsapp状态的情感数据 Email and Messaging,NLP,Deep Learning,spaCy Classification
2.49M 431
Catatan Cakrawala模型 Business,Clothing and Accessories,Image Data,NLP Classification
1595.43M 441
来自维基百科的800万个德语句子 Internet,NLP,Text Data Classification
1099.53M 433
孟加拉报纸数据集 Internet,News,NLP,Text Data,Multiclass Classification Classification
6524.76M 508
商店购物方式Shopgram.io公司 Internet,NLP,Classification,Text Data,Clustering,Multilabel Classification Classification
248.26M 541
美国2020年总统大选演讲 Politics,NLP,Text Data,Text Mining Classification
8.82M 398
GitHub Bugs预测挑战(机器黑客) Computer Science,Programming,NLP,Classification,Deep Learning,Multiclass Classification Classification
298.85M 457
Ar QAG数据集 NLP,Text Data Classification
137.66M 348
员工评价的主题建模 Business,NLP,Ratings and Reviews,Deep Learning,Jobs and Career,Subject Classification
0.61M 429
验证Seti数据集 Business,Arts and Entertainment,Software,NLP,Artificial Intelligence Classification
0.89M 351
朱杜尔阿蒂克尔媒体在线登干标签点击诱饵 Business,Arts and Entertainment,Online Communities,News,NLP Classification
0.22M 315