公开数据集

专利摘要 Computer Science,Law,NLP,Deep Learning,LSTM,RNN Classification
3.2M 511
arxiv数据集,过去18个月的存档数据集 arxiv dataset arxiv dataset for the past 18 months...NLP Classification
94.28M 1007
亚马逊Alexa评论 NLP Classification
0.49M 978
中国古代文字(文言文) Business,NLP,Text Data,Text Mining Classification
1572.11M 1194
WikiText长期依赖性语言建模数据集 WikiText语言建模数据集是从维基百科上一组经过验证的好文章和特色文章中提取的超过1亿个令牌的集合。与宾夕法尼亚树库(PTB)的...NLP,Deep Learning,Text Data Classification
1.11G 641
经典英语文学语料库与元数据,经典英语书籍及其作者 This is a dataset about classic readings in English, some cases other language translated to English.Dickens, Plato, Sha...NLP,Arts and Entertainment,Literature Classification
431.55M 524
提交100万份 Internet,Online Communities,Social Networks,NLP,Popular Culture Classification
700.01M 352
丰富的数据 NLP,Text Mining Classification
25.32M 384
产品分类 NLP,Multiclass Classification Classification
1.04M 991
贾纳塔哈克:独立日 Earth and Nature,Internet,Education,Sports,NLP,Beginner Classification
451.85M 392
矛盾的是,我亲爱的华生 Computer Science,NLP,Text Data,Beginner Classification
18.02M 447
印尼名字 Earth and Nature,Education,NLP,Deep Learning,Text Data,People,Gender Classification
0.03M 371
神经链接Tweets Business,Online Communities,News,NLP,Artificial Intelligence Classification
1.08M 350
2020年共和党大会演讲 Social Science,Politics,NLP,Languages Classification
0.24M 347
新闻杂志 Education,News,NLP,Classification,Literature Classification
5.12M 367
德国自由职业者文学学士 Business,Internet,NLP,Classification,Exploratory Data Analysis,Data Cleaning,Clustering Classification
5.76M 361
乔·拜登2020年DNC演讲 Politics,NLP Classification
0.02M 392
印尼圣经 Religion and Belief Systems,NLP,Text Data Classification
10.02M 329
亚马逊数据科学书评 Business,NLP,Ratings and Reviews Classification
11.05M 335
字体数据集 NLP,Data Visualization Classification
25.84M 970