相关搜索
您是不是在找?
今日排行
本周排行
本月排行
语言生成数据集:2亿个样本,用于语言生成的已处理Amazon Review数据集
Amazon Customer Reviews Dataset is a dataset of user-generated product reviews on the shopping website Amazon. It contai...NLP,Business,Deep Learning,Classification,Artificial Intelligence Classification
20.51G
659
Abhishek Chatterjee
EmojifyData数据集:1800万条英文推文,全部包含表情符号
So, me and my friend was participating IPavlov course on deep learning in NLP. As out final project we want to work on s...NLP,Online Communities,Text Data,Social Networks Classification
2.58G
965
Daniil Larionov
NLP Word2Vec 现有的word2vec嵌入,包括手套和谷歌新闻,用于被训练来重建单词的语言上下文
Word2vec is a group of related models that are used to produce word embeddings. These models are shallow, two-layer neur...NLP,Computer Science Classification
5.89G
602
pkugoodspeed
BioCreativeVI PM跟踪文档分类任务中的训练模型
The trained models in BioCreativeVI-PM-Track Document Triage Task....NLP Classification
2.16G
899
lingluo
Facebook发布的300维预训练FastText英语单词向量
300-dimensional pretrained FastText English word vectors released by Facebook.The first line of the file contains the nu...NLP,Arts and Entertainment,Games Classification
4.52G
575
Vladimir Demidov
用土耳其语编写的数据,可以训练word2vec或n-gram模型
This data contains each document written in Turkish and contains wiki document id. You can train word2vec or n-gram mode...NLP,Text Data,Text Mining Classification
463.02M
890
MustafaKeskin
纽约时报评论,对《纽约时报》发表文章的评论,超过200万条评论
New York Times has a wide audience and plays a prominent role in shaping people's opinion and outlook on current aff...NLP,Computer Science,Programming,News Classification
1.55G
661
Aashita Kesarwani
预测Reddit社区参与度数据集,GDELT帖子分类以及Sirocco文本分析(意见和实体提取)
该数据集包含3个月(2017年6月至8月)的Reddit新闻帖子,以及GDELT帖子分类以及Sirocco文本分析(意见和实体提取)的结果。它用...NLP,Computer Science,Online Communities Classification
174.09M
738
Sergei Sokolenko
亚马逊Alexa的评论
Business,NLP,Deep Learning,Beginner,Naive Bayes Classification
0.49M
356
Manu Siddhartha
Word2vec在维基百科上训练数据(单字母+双字母),以捕捉unigram和bigram
这是一个单词嵌入模型,创建于维基百科+各种来源的评论。与从基于短语的方法(不考虑相邻词的短语/双词上下文)创建双词不同,这...NLP,Computer Science,Software,Programming,Neural Networks Classification
8.62G
669
aintnosunshine
标记为 ML/DL/AI 的中型文章,文章描述、标题、作者和其他元数据
Medium Articles tagged under ML/DL/AI scraped using Beautifulsoup and seleniumContent1.Tag : Tagged under AI/ML or DL2.N...NLP,Education,Online Communities,Artificial Intelligence Classification
55.49K
1005
Sangarshanan
Facebook 发布的300维预训练,在 Common Crawl 上训练的200万个词向量
300-dimensional pretrained FastText English word vectors released by Facebook.The first line of the file contains the nu...NLP,Arts and Entertainment Classification
650M
679
Manish Maharjan
FastText 一个用于学习词嵌入和文本分类的库
fastText is a library for learning of word embeddings and text classification created by Facebook's AI Research (FAI...NLP,Computer Science Classification
6.6G
1273
Jia Yang
SAVEE 数据库 用于情感识别系统的语音情感注释数据
The SAVEE database was recorded from four native English male speakers (identified as DC, JE, JK, KL), postgraduate stud...NLP,Business,Social Science Classification
162.57M
800
Tarun Sunkaraneni



















