公开数据集
数据结构 ?
1.24G
README.md
Reddit submissions from finance/investment/stock related posts:
r/wallstreetbets: #775326 (2021-01-01 00:02:06 - 2021-12-31 23:55:51)
r/gme: #273327 (2021-01-01 04:08:51 - 2021-12-31 23:59:44)
r/personalfinance: #131181 (2021-01-24 19:30:31 - 2021-12-31 23:55:49)
r/stocks: #75857 (2021-01-01 00:05:17 - 2021-12-31 22:34:41)
r/pennystocks: #54785 (2021-01-01 00:13:41 - 2021-12-31 23:30:46)
r/stockmarket: #43809 (2021-01-01 02:42:42 - 2021-12-31 23:48:27)
r/investing: #41912 (2021-01-01 00:18:40 - 2021-12-31 23:37:54)
r/options: #28782 (2021-01-01 01:39:43 - 2021-12-31 23:38:00)
r/robinhoodpennystocks: #23304 (2021-01-01 00:27:36 - 2021-12-31 22:00:14)
r/robinhood: #18893 (2021-01-01 00:22:48 - 2021-12-31 23:12:52)
r/forex: #14643 (2021-01-01 00:07:45 - 2021-12-31 23:21:17)
r/financialindependence: #10338 (2021-01-01 00:26:15 - 2021-12-31 21:45:26)
r/finance: #7130 (2021-01-01 02:33:23 - 2021-12-31 23:35:09)
r/securityanalysis: #1510 (2021-01-01 12:21:09 - 2021-12-30 12:56:24)
Data
See collection methodology, all times in UTC:
id(string): The id of the submission.author(string): The redditors username.created(datetime): Time the submission was created.retrieved(datetime): Time the submission was retrieved.edited(datetime): Time the submission was modified.pinned(integer): Whether or not the submission is pinned.archived(integer): Whether or not the submission is archived.locked(integer): Whether or not the submission is locked.removed(integer): Whether or not the submission is mod removed.deleted(integer): Whether or not the submission is user deleted.is_self(integer): Whether or not the submission is a text.is_video(integer): Whether or not the submission is a video.is_original_content(integer): Whether or not the submission has been set as original content.title(string): The title of the submission.link_flair_text(string): The submission link flairs text content.upvote_ratio(number): The percentage of upvotes from all votes on the submission.score(integer): The number of upvotes for the submission.gilded(integer): The number of gilded awards on the submission.total_awards_received(integer): The number of awards on the submission.num_comments(integer): The number of comments on the submission.num_crossposts(integer): The number of crossposts on the submission.selftext(string): The submission selftext on text posts.thumbnail(string): The submission thumbnail on image posts.shortlink(string): The submission short url.
Usage
See getting started, data available as csv and hdf:
submissions_reddit.csv: Load file usingpandasor any other framework.submissions_reddit.h5: Load file usingpandas >= 1.2.1andpython >= 3.8.5.
Legal
Provided "as is" without guarantee of completeness.
Photo by Lorenzo from Pexels.
- 分享你的想法
全部内容
数据使用声明:
- 1、该数据来自于互联网数据采集或服务商的提供,本平台为用户提供数据集的展示与浏览。
- 2、本平台仅作为数据集的基本信息展示、包括但不限于图像、文本、视频、音频等文件类型。
- 3、数据集基本信息来自数据原地址或数据提供方提供的信息,如数据集描述中有描述差异,请以数据原地址或服务商原地址为准。
- 1、本站中的所有数据集的版权都归属于原数据发布者或数据提供方所有。
- 1、如您需要转载本站数据,请保留原数据地址及相关版权声明。
- 1、如本站中的部分数据涉及侵权展示,请及时联系本站,我们会安排进行数据下线。
VIP下载(最低0.24/天)
1057浏览
2下载
0点赞
收藏
分享