Select Language

公开数据集

GitHub NLP分析的标题和描述 超过800万个GitHub发布了标题和描述

GitHub NLP分析的标题和描述 超过800万个GitHub发布了标题和描述

Scene:

NLP,Software

Data Type:

Classification
所需积分:25 去赚积分?
  • 194浏览
  • 0下载
  • 0点赞
  • 收藏
  • 分享

贡献者查看主页

小小程序员

致力于人工智能业务的研究、数据集处理。

Data Preview ? 2.85G

    Data Structure ?

    *数据结构实际以真实数据为准

    Over 8 million GitHub issue titles and descriptions from 2017.  Prepared from instructions at How To Create Data Products That Are Magical Using Sequence-to-Sequence Models.

    Original Source

    The data was adapted from GitHub data accessible from GitHub Archive.  The constructocat image is from https://octodex.github.com/constructocat-v2.

    License

    MIT License

    Copyright (c) 2018 David Shinn

    Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

    The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

    THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN ConNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

    Usability

    8.75

    License

    Other (specified in description)

    Expected update frequency


    0相关评论
    ×

    帕依提提提温馨提示

    该数据集正在整理中,为您准备了其他渠道,请您使用

    注:部分数据正在处理中,未能直接提供下载,还请大家理解和支持。