公开数据集

专利摘要 Computer Science,Law,NLP,Deep Learning,LSTM,RNN Classification
3.2M 485
arxiv数据集,过去18个月的存档数据集 arxiv dataset arxiv dataset for the past 18 months...NLP Classification
94.28M 982
亚马逊Alexa评论 NLP Classification
0.49M 950
中国古代文字(文言文) Business,NLP,Text Data,Text Mining Classification
1572.11M 1158
WikiText长期依赖性语言建模数据集 WikiText语言建模数据集是从维基百科上一组经过验证的好文章和特色文章中提取的超过1亿个令牌的集合。与宾夕法尼亚树库(PTB)的...NLP,Deep Learning,Text Data Classification
1.11G 620
经典英语文学语料库与元数据,经典英语书籍及其作者 This is a dataset about classic readings in English, some cases other language translated to English.Dickens, Plato, Sha...NLP,Arts and Entertainment,Literature Classification
431.55M 509
提交100万份 Internet,Online Communities,Social Networks,NLP,Popular Culture Classification
700.01M 346
丰富的数据 NLP,Text Mining Classification
25.32M 380
产品分类 NLP,Multiclass Classification Classification
1.04M 965
贾纳塔哈克:独立日 Earth and Nature,Internet,Education,Sports,NLP,Beginner Classification
451.85M 381
矛盾的是,我亲爱的华生 Computer Science,NLP,Text Data,Beginner Classification
18.02M 425
印尼名字 Earth and Nature,Education,NLP,Deep Learning,Text Data,People,Gender Classification
0.03M 361
神经链接Tweets Business,Online Communities,News,NLP,Artificial Intelligence Classification
1.08M 342
2020年共和党大会演讲 Social Science,Politics,NLP,Languages Classification
0.24M 333
新闻杂志 Education,News,NLP,Classification,Literature Classification
5.12M 362
德国自由职业者文学学士 Business,Internet,NLP,Classification,Exploratory Data Analysis,Data Cleaning,Clustering Classification
5.76M 354
乔·拜登2020年DNC演讲 Politics,NLP Classification
0.02M 378
印尼圣经 Religion and Belief Systems,NLP,Text Data Classification
10.02M 324
亚马逊数据科学书评 Business,NLP,Ratings and Reviews Classification
11.05M 327
字体数据集 NLP,Data Visualization Classification
25.84M 937