Select Language

公开数据集

钓鱼网站数据集

钓鱼网站数据集

Scene:

Computer

Data Type:

Classification
所需积分:3 去赚积分?
  • 349浏览
  • 3下载
  • 0点赞
  • 收藏
  • 分享

Data Preview ? 32.8K

    Data Structure ?

    *数据结构实际以真实数据为准


    Neda Abdelhamid
    Auckland Institute of Studies
    nedah '@' ais.ac.nz


    Data Set Information:

    The phishing problem is considered a vital issue in a€?.COMa€? industry especially e-banking and e-commerce taking the number of online transactions involving payments.
    We have identified different features related to legitimate and phishy websites and collected 1353 different websites from difference sources.Phishing websites were collected from Phishtank data archive (www.phishtank.com), which is a free community site where users can submit, verify, track and share phishing data. The legitimate websites were collected from Yahoo and starting point directories using a web script developed in PHP. The PHP script was plugged with a browser and we collected 548 legitimate websites out of 1353 websites. There is 702 phishing URLs, and 103 suspicious URLs.  

    When a website is considered SUSPICIOUS that means it can be either phishy or legitimate, meaning the website held some legit and phishy features.


    Attribute Information:

    URL Anchor
    Request URL
    SFH
    URL Length
    Having a€?@a€?
    Prefix/Suffix
    IP
    Sub Domain
    Web traffic
    Domain age
    Class



    collected features hold the categorical values , a€?Legitimatea€?, a€?Suspiciousa€? and a€?Phishya€?, these values have been replaced with numerical values 1,0 and -1 respectively.
    details of each feature are mentioned in the research paper mentioned below


    Relevant Papers:

    You can view all citations that used the paper that has applied this data,  mentioned below  
    at [Web link]



    Citation Request:

    Abdelhamid  et al.,(2014a) Phishing Detection based Associative Classification Data Mining.  Expert Systems With Applications  (ESWA), 41 (2014) 5948a€“5959.

    0相关评论
    ×

    帕依提提提温馨提示

    该数据集正在整理中,为您准备了其他渠道,请您使用

    注:部分数据正在处理中,未能直接提供下载,还请大家理解和支持。