公开数据集
相关数据分类
10
553
5
6
9
13
19
2
3
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
CC-100 卡纳达语单语言数据集:来自Web爬网数据的1300万条单语言数据集
This monolingual dataset includes roughly 13 million uncleaned Kannada sentences crawled from numerous websites....NLP,Text Data,Languages Classification
3.51G
411
Darshan
文本中的情感,句子中表达主要情感的文本数据
I was looking for a well labeled dataset to perform a multiclass classification. I wanted to do something more than just...NLP,Earth and Nature,Text Data,Multiclass Classification Classification
2.15M
457
Ishant
OZON产品类别
Business,NLP,Text Data,Multiclass Classification,Marketing Classification
181.16M
320
Andrew Bezborodov
来自AskUbuntu的意图识别聊天机器人语料库
Context190 questions and answers from https://askubuntu.com. ContentWhat's inside is more than just rows and columns...NLP,Artificial Intelligence Classification
0.23M
921
Elvin Aghammadzada
带有语言标签的文本数据。它可以用于语言检测。
Language Detection Dataset Text data with language labels. It can be used for language detection....NLP,Classification,Computer Science,Multiclass Classification,Languages Classification
31.7M
936
Ishant
测试用例数据集,软件测试中使用的样本数据集的集合
There are lots of datasets available for different machine learning tasks like NLP, Computer vision etc. However I could...NLP,Deep Learning,Earth and Nature Classification
1.3M
615
sapal6
NLP:报告和新闻分类
Social Science,Investing,NLP,Literature,Environment,Binary Classification,Multilabel Classification,Water Bodies Classification
0.03M
343
Vitalii Mokin
科研论文主题建模
Business,Earth and Nature,Education,NLP,Psychology Classification
21.96M
384
Abishek Sudarshan
COVID19相关常见问题,此数据包含与新冠肺炎相关的问答集19
What is this?This data contains collection of question and answers related to COVID19.Where does this come from?Thi...NLP,Health,Coronavirus,Psychology,Diseases Classification
0.1M
440
Deepan.N
GENIA生物医学事件数据集
ContextBio-medical texts have a lot of information which can be used for developments in the medical field. Traditionall...NLP,Biology,Text Mining,Medicine Classification
2.67M
755
Nishanth
Tanglish情绪分析推文,使用了4个标签来描述推特的情绪
So it all started when I was looking for Abusive Tamil tweets in the Roman Script to use for a project and instead of fi...NLP,Deep Learning,Online Communities,People Classification
0.85M
451
vyom bhatia



















