Attribute Selection for Numerical Databases that Contain Correlations

Clc Number:

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments

    There are many correlated attributes in a database. Conventional attribute selection methods are not able to handle such correlations and tend to eliminate important rules that exist in correlated attributes. In this paper, we propose an attribute selection method that preserves important rules on correlated attributes. We rst compute a ranking of attributes by using conventional attribute selection methods. In addition, we compute two-dimensional rules for each pair of attributes and evaluate their importance for predicting a target attribute. Then, we evaluate the shapes of important two-dimensional rules to pick up hidden important attributes that are under-estimated by conventional attribute selection methods. After the shape evaluation, we re-calculate the ranking so that we can preserve the important correlations. Intensive experiments show that the proposed method can select important correlated attributes that are eliminated by conventional methods.

    Cited by
Get Citation

Taufik Djatna, Yasuhiko Morimoto. Attribute Selection for Numerical Databases that Contain Correlations. International Journal of Software and Informatics, 2008,2(2):125~139

Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
  • Received:October 14,2008
  • Revised:December 19,2008
  • Adopted:
  • Online:
  • Published: