Select Language






Data Type:

所需积分:15 去赚积分?
  • 300浏览
  • 2下载
  • 0点赞
  • 收藏
  • 分享

Data Preview ? 13K

    Data Structure ?


    Data Set Information:

    This file concerns credit card applications.  All attribute names and values have been changed to meaningless symbols to protect confidentiality of the data.
    This dataset is interesting because there is a good mix of attributes -- continuous, nominal with small numbers of values, and nominal with larger numbers of values.  There are also a few missing values.

    Attribute Information:

    A1: b, a.
    A2: continuous.
    A3: continuous.
    A4: u, y, l, t.
    A5: g, p, gg.
    A6: c, d, cc, i, j, k, m, r, q, w, x, e, aa, ff.
    A7: v, h, bb, j, n, z, dd, ff, o.
    A8: continuous.
    A9: t, f.
    A10: t, f.
    A11: continuous.
    A12: t, f.
    A13: g, p, s.
    A14: continuous.
    A15: continuous.
    A16: +,-         (class attribute)

    Relevant Papers:

    Quinlan. "Simplifying decision trees", Int J Man-Machine Studies 27, Dec 1987, pp. 221-234.
    [Web link]

    Quinlan. "C4.5: Programs for Machine Learning", Morgan Kaufmann, Oct 1992
    [Web link]

    Papers That Cite This Data Set1:

    Xiaoming Huo. FBP: A Frontier-based Tree-Pruning Algorithm. Seoung Bum Kim. 2002.  [View Context].

    Lorne Mason and Peter L. Bartlett and Jonathan Baxter. Improved Generalization Through Explicit Optimization of Margins. Machine Learning, 38. 2000.  [View Context].

    Kagan Tumer and Joydeep Ghosh. Robust Combining of Disparate Classifiers through Order Statistics. CoRR, csLG/9905013. 1999.  [View Context].

    Lorne Mason and Peter L. Bartlett and Jonathan Baxter. Direct Optimization of Margins Improves Generalization in Combined Classifiers. NIPS. 1998.  [View Context].

    Citation Request:

    Please refer to the Machine Learning Repository's citation policy

    (confidential source)

    Submitted by quinlan '@'