公开数据集
相关数据分类
10
553
5
6
9
13
19
2
3
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
TED终极数据集
NLP,Classification,Text Data,Recommender Systems Classification
544.94M
411
Miguel Corral Jr
Reddit印度NLP数据集,数据集包括2017-2020年从R/India子版块的帖子
[](https://www.python.org/) [!...NLP,Classification,Multiclass Classification,India Classification
117.86M
444
Pranav Hari
JigSaw有毒评论分类清理数据,竖锯评论,带感情,评论长度和翻译文本
I've been working on the JigSaw Multilingual Toxic Comment classification competition and found that the data requir...NLP,Deep Learning,Feature Engineering,Text Data Classification
263.44M
614
Sleeba Paul
乌克兰语词汇描述
Earth and Nature,Education,NLP,Classification,Text Data Classification
0.04M
940
Yaroslav Isaienkov
OSCAR尼泊尔语语料库,尼泊尔语文本语料库,用于训练NLP的无监督语言模型
The files are from [OSCAR Corpus](https://oscar-corpus.com/). Please visit their site for more information.The dataset i...NLP,Computer Science,Movies and TV Shows,Text Data,Languages Classification
3.1G
538
Prabesh Dhakal
零售交易[于2020年7月17日发布]
Online Communities,Retail and Shopping,NLP,Data Visualization,Tabular Data,Data Cleaning Classification
1.3M
367
Jahnic Beck-Joseph
推特预测灾难
NLP,Classification,Text Data,Geospatial Analysis,Binary Classification Classification
1.34M
902
Ghanender Pahuja
CORD-19完整索引,在完整CORD-19数据集上嵌入索引
Sentence embeddings index over full CORD-19 dataset. Includes both COVID-19 and non-COVID-19 tagged literature on Corona...NLP,Computer Science,Coronavirus Classification
7.61G
291
David Mezzetti
塞思·戈丁的博客数据集
Business,Internet,Online Communities,NLP,Literature,Text Data,Text Mining,Marketing Classification
16.49M
381
Roman Glushko



















