公开数据集

波斯维基百科数据集,波斯语(波斯语)维基百科语料库 Persian(Farsi) Wikipedia Dataset | دیتاست ویکی پدیا فارسی شامل تمامی مقالات فارسی...NLP,Deep Learning,Text Data,Data Analytics Classification
804.48M 433
Cal多音节语料库 Education,Universities and Colleges,NLP,Text Data,Text Mining,spaCy Classification
15.26M 761
KcBERT训练前语料库(韩国新闻评论) Computer Science,Education,News,NLP,Text Data Classification
11899.2M 514
山区项目论坛 Earth and Nature,Internet,Online Communities,NLP,Transformers Classification
223.86M 345
SCOTUS意见 Earth and Nature,Software,Law,Politics,NLP,Crime,History Classification
716.62M 908
巴黎迪士尼乐园 Facebook评论 Online Communities,Video Games,NLP,Text Data,Languages,Text Mining Classification
35.35M 1081
将这些公司分组 Business,NLP Classification
0.06M 305
新闻媒体的RSS源 News,NLP Classification
0.34M 803
Proza ru中文翻译 Biology,NLP,Text Data Classification
6494.7M 296
Whatsapp状态的情感数据 Email and Messaging,NLP,Deep Learning,spaCy Classification
2.49M 401
Catatan Cakrawala模型 Business,Clothing and Accessories,Image Data,NLP Classification
1595.43M 363
来自维基百科的800万个德语句子 Internet,NLP,Text Data Classification
1099.53M 348
孟加拉报纸数据集 Internet,News,NLP,Text Data,Multiclass Classification Classification
6524.76M 390
商店购物方式Shopgram.io公司 Internet,NLP,Classification,Text Data,Clustering,Multilabel Classification Classification
248.26M 459
美国2020年总统大选演讲 Politics,NLP,Text Data,Text Mining Classification
8.82M 368
GitHub Bugs预测挑战(机器黑客) Computer Science,Programming,NLP,Classification,Deep Learning,Multiclass Classification Classification
298.85M 362
Ar QAG数据集 NLP,Text Data Classification
137.66M 329
员工评价的主题建模 Business,NLP,Ratings and Reviews,Deep Learning,Jobs and Career,Subject Classification
0.61M 338
验证Seti数据集 Business,Arts and Entertainment,Software,NLP,Artificial Intelligence Classification
0.89M 276
朱杜尔阿蒂克尔媒体在线登干标签点击诱饵 Business,Arts and Entertainment,Online Communities,News,NLP Classification
0.22M 298