公开数据集

波斯维基百科数据集,波斯语(波斯语)维基百科语料库 Persian(Farsi) Wikipedia Dataset | دیتاست ویکی پدیا فارسی شامل تمامی مقالات فارسی...NLP,Deep Learning,Text Data,Data Analytics Classification
804.48M 464
Cal多音节语料库 Education,Universities and Colleges,NLP,Text Data,Text Mining,spaCy Classification
15.26M 813
KcBERT训练前语料库(韩国新闻评论) Computer Science,Education,News,NLP,Text Data Classification
11899.2M 568
山区项目论坛 Earth and Nature,Internet,Online Communities,NLP,Transformers Classification
223.86M 356
SCOTUS意见 Earth and Nature,Software,Law,Politics,NLP,Crime,History Classification
716.62M 960
巴黎迪士尼乐园 Facebook评论 Online Communities,Video Games,NLP,Text Data,Languages,Text Mining Classification
35.35M 1144
将这些公司分组 Business,NLP Classification
0.06M 310
新闻媒体的RSS源 News,NLP Classification
0.34M 855
Proza ru中文翻译 Biology,NLP,Text Data Classification
6494.7M 300
Whatsapp状态的情感数据 Email and Messaging,NLP,Deep Learning,spaCy Classification
2.49M 416
Catatan Cakrawala模型 Business,Clothing and Accessories,Image Data,NLP Classification
1595.43M 373
来自维基百科的800万个德语句子 Internet,NLP,Text Data Classification
1099.53M 360
孟加拉报纸数据集 Internet,News,NLP,Text Data,Multiclass Classification Classification
6524.76M 405
商店购物方式Shopgram.io公司 Internet,NLP,Classification,Text Data,Clustering,Multilabel Classification Classification
248.26M 476
美国2020年总统大选演讲 Politics,NLP,Text Data,Text Mining Classification
8.82M 376
GitHub Bugs预测挑战(机器黑客) Computer Science,Programming,NLP,Classification,Deep Learning,Multiclass Classification Classification
298.85M 371
Ar QAG数据集 NLP,Text Data Classification
137.66M 335
员工评价的主题建模 Business,NLP,Ratings and Reviews,Deep Learning,Jobs and Career,Subject Classification
0.61M 346
验证Seti数据集 Business,Arts and Entertainment,Software,NLP,Artificial Intelligence Classification
0.89M 291
朱杜尔阿蒂克尔媒体在线登干标签点击诱饵 Business,Arts and Entertainment,Online Communities,News,NLP Classification
0.22M 307