公开数据集

相关搜索
您是不是在找?
今日排行
本周排行
本月排行
冠状病毒(推特数据) Health,NLP,Classification,Deep Learning Classification
4.18M 526
冠状病毒(covid19)推特 4月下旬 Internet,Online Communities,Email and Messaging,Coronavirus,NLP,Diseases Classification
2087.86M 391
冠状病毒(covid19)推特 4月初 Internet,Online Communities,Email and Messaging,Coronavirus,NLP,Diseases Classification
2977.65M 850
hck ml酒店 NLP,Text Mining Classification
25.45M 692
冠状病毒英国报纸 Internet,Health,News,Biology,NLP,Healthcare Classification
0.25M 450
COVID 19开放研究数据集句子聚类 Coronavirus,NLP,Drugs and Medications,Clustering Classification
258.38M 866
CORD 19知识图 Earth and Nature,Internet,Education,Biology,Coronavirus,NLP Classification
5963.63M 360
COVID 19文章 Education,Coronavirus,NLP Classification
79.6M 334
用于语音克隆的英语多说话人语料库 CSTR-VCTK语料库 This CSTR VCTK Corpus includes speech data uttered by 109 native speakers of English with various accents. Each speaker...NLP,Audio Data Classification
15.22G 585
Bracia Figo Fagot analiza sentymentu餐厅 Arts and Entertainment,NLP Classification
0.09M 379
语言生成数据集:2亿个样本,用于语言生成的已处理Amazon Review数据集 Amazon Customer Reviews Dataset is a dataset of user-generated product reviews on the shopping website Amazon. It contai...NLP,Business,Deep Learning,Classification,Artificial Intelligence Classification
20.51G 582
EmojifyData数据集:1800万条英文推文,全部包含表情符号 So, me and my friend was participating IPavlov course on deep learning in NLP. As out final project we want to work on s...NLP,Online Communities,Text Data,Social Networks Classification
2.58G 857
CONLL2003杂项词重新标记 Earth and Nature,NLP,Text Data,Text Mining,spaCy Classification
0.01M 432
NLP Word2Vec 现有的word2vec嵌入,包括手套和谷歌新闻,用于被训练来重建单词的语言上下文 Word2vec is a group of related models that are used to produce word embeddings. These models are shallow, two-layer neur...NLP,Computer Science Classification
5.89G 518
BioCreativeVI PM跟踪文档分类任务中的训练模型 The trained models in BioCreativeVI-PM-Track Document Triage Task....NLP Classification
2.16G 784
Facebook发布的300维预训练FastText英语单词向量 300-dimensional pretrained FastText English word vectors released by Facebook.The first line of the file contains the nu...NLP,Arts and Entertainment,Games Classification
4.52G 508
用土耳其语编写的数据,可以训练word2vec或n-gram模型 This data contains each document written in Turkish and contains wiki document id. You can train word2vec or n-gram mode...NLP,Text Data,Text Mining Classification
463.02M 777
来自CNTK女士的ATIS Business,Earth and Nature,NLP Classification
2.35M 838
纽约时报评论,对《纽约时报》发表文章的评论,超过200万条评论 New York Times has a wide audience and plays a prominent role in shaping people's opinion and outlook on current aff...NLP,Computer Science,Programming,News Classification
1.55G 587
预测Reddit社区参与度数据集,GDELT帖子分类以及Sirocco文本分析(意见和实体提取) 该数据集包含3个月(2017年6月至8月)的Reddit新闻帖子,以及GDELT帖子分类以及Sirocco文本分析(意见和实体提取)的结果。它用...NLP,Computer Science,Online Communities Classification
174.09M 649