Select Language

AI社区

公开数据集

德国新闻数据集 Computer Science,Internet,Education,Software,News,NLP Classification
726.72M 135
纯文本维基百科,每个文件都包含维基百科文章的集合 Wikipedia dumps contain a tremendous amount of markup. WikiMedia Text is a hybrid of markdown and HTML, making it very d...NLP,Computer Science,Text Data,Text Mining Classification
23.71G 153
WebMD药物评论数据集,各种药物的用户评论数据集 The dataset provides user reviews on specific drugs along with related conditions, side effects, age, sex, and ratings r...NLP,Computer Science,Education,Tabular Data,Drugs and Medications Classification
168.58M 164
一个数据集,包含带有条件的评论中的标记和未标记的句子 This dataset was created during my PhD (http://www.tdg-seville.info/fogallego/Personal%20Info) at the University of Sevi...NLP,Text Data,Universities and Colleges,Ratings and Reviews Classification
794.68M 254
绕口令数据集,带绕口令的数据集(英文) This is a dataset consisting of tongue twisters (in English), mostly from Web Scraping.This dataset contains about 600 s...NLP,TensorFlow,Languages Classification
0.16M 155
一百万条新闻标题 Format: CSV ; Single Filepublish_date: Date of publishing for the article in yyyyMMdd formatheadline_text: Text of the h...NLP,News Classification
57.43M 189
曼基巴特 Business,News,Government,Politics,NLP,Psychology,India,Languages Classification
1.44M 149
研究文章数据集 Earth and Nature,NLP,Research Classification
31.44M 145
印度尼西亚普伊斯 NLP,Literature,Text Data,Art,Languages Classification
10.07M 132
印尼潘屯 NLP,Literature,Text Data,Art,Languages Classification
0.06M 124
灾难微博 NLP,Text Data,Binary Classification Classification
1.54M 190
知名品牌口号与风险评估 NLP,Text Data,E-Commerce Services,Text Mining,Marketing Classification
0.17M 205
人类mRNA序列数据(仅种子) human mrna sequence (seed only)...NLP,Biology Classification
2.09G 136
IITJEE NEET AIIMS学生提问数据 In India, every year lacs of students sit for competitive examinations like JEE Advanced, JEE Mains, NEET, etc. These ex...NLP,Classification,Education,Standardized Testing,Universities and Colleges,Multiclass Classification Classification
28.51M 167
问题章节分类,你能把问题分成正确的章节吗? 在印度,每年都有少数学生参加竞争性考试,如JEE Advanced、JEE Mains、NEET等。据说,这些考试是进入印度一流学院(如IIT、NIT...NLP,Classification,Text Data,Multiclass Classification,NLTK Classification
36.2M 251
推特分类拥抱脸 Computer Science,NLP,Classification,Text Data,Beginner,PyTorch Classification
0.02M 138
德雷克歌词 Arts and Entertainment,Music,NLP,Text Data,Popular Culture Classification
2.33M 132
航空公司审查数据进行情绪分析 NLP,Deep Learning,Time Series Analysis,Text Mining,Data Analytics Classification
3.26M 119
印度新闻数据集,包含了《印度时报》发布的大约360万个事件 This news dataset is a persistent historical archive of noteable events in the Indian subcontinent from start-2001 to q1...NLP,Arts and Entertainment,News,Cities and Urban Areas Classification
226.84M 187
这是有争议的 Online Communities,News,Social Networks,NLP,Text Data Classification
0.58M 171