相关搜索
您是不是在找?
今日排行
本周排行
本月排行
万篇德国新闻文章数据集,10kGNAD基于一百万篇文章语料库
10kGNAD数据集旨在作为第一个德国主题分类数据集解决部分问题。它由一家奥地利在线报纸的10273篇德语新闻文章组成,分为9个主题...NLP,Classification,Computer Science,Programming,News,Social Science Classification
51.81M
391
Timo Block
唐纳德·特朗普(Donald Trump)42分钟的咆哮|文本生成
Business,Online Communities,Politics,NLP Classification
0.02M
346
vyom bhatia
IITJEE NEET AIIMS学生提问数据
In India, every year lacs of students sit for competitive examinations like JEE Advanced, JEE Mains, NEET, etc. These ex...NLP,Classification,Education,Standardized Testing,Universities and Colleges,Multiclass Classification Classification
28.51M
398
Ultron
2.09G
247
Display Name Unavailable
来自202个Stackexchange站点的标记集合
This data is extracted from StackExchange for over 200+ websites under the Umbrella. This data consists of all possible...NLP,Business,Online Communities,Text Data Classification
16.75M
336
Shiv Kumar Ganesh
来自AskUbuntu的意图识别聊天机器人语料库
Context190 questions and answers from https://askubuntu.com. ContentWhat's inside is more than just rows and columns...NLP,Artificial Intelligence Classification
0.23M
703
Elvin Aghammadzada
Steam官方网站的大约 80000 款游戏数据集
这是一个数据集,包含任何可抓取的信息,关于来自 Steam 官方网站的大约 80000 款游戏。大多数列包含有价值的信息,可以让您更好...Video Games,Games Classification
98.8M
1039
Deepan.N
GENIA生物医学事件数据集
ContextBio-medical texts have a lot of information which can be used for developments in the medical field. Traditionall...NLP,Biology,Text Mining,Medicine Classification
2.67M
663
Nishanth
Tanglish情绪分析推文,使用了4个标签来描述推特的情绪
So it all started when I was looking for Abusive Tamil tweets in the Roman Script to use for a project and instead of fi...NLP,Deep Learning,Online Communities,People Classification
0.85M
375
vyom bhatia
用户评级为10M的Goodreads图书数据集
Arts and Entertainment,Social Science,NLP,Literature,Recommender Systems Classification
1128.5M
734
Bahram Jannesar
Septuagint
Earth and Nature,Religion and Belief Systems,NLP,Text Data,Languages Classification
7.39M
315
Abbrivia
来自wallstreetbets等的Subreddit数据,用于后验量化交易算法的情绪分析
All of the submissions to each of the r/wallstreetbets, r/investing, r/options, and r/SecurityAnalysis subreddits since...NLP,Online Communities,Investing Classification
1.49G
339
Sheridan Green
日语-英语字幕语料库(JESC)[CLEANED],由280万个句子组成的大型语料库
This dataset is cleaned version of JESC by handling misplelled English words and doing word segmentation using:English=...NLP,Business,Computer Science,Languages Classification
220.08M
382
Wahyu Setianto
Stackoverflow问题分类挑战
ContextAsking questions is a part of learning. There's no shame in not knowing something and coming to others for he...NLP Classification
6.37M
774
Nasser Boan
Zeki MFC;任15E;ark131;SF6;zleri |歌词
Music,NLP,Artificial Intelligence,LSTM Classification
0.33M
298
ferhatmetin34
IMBD情绪分类数据集,用spacy标记并以JSON格式存储
ContextIMDB sentiment classification dataset from derived from torchtext, tokenized using spacy and then stored as JSON...NLP,Beginner,Earth and Nature,Movies and TV Shows,Text Data,Binary Classification,spaCy Classification
104.31M
327
Manoj Patra