ebooks logo journals logo reference works logo abstract databases logo
bullet  SIGN IN Register | Why Register? | Got a Voucher? alerts   marked lists   shopping cart 

informaworld

HOME   |   SEARCH   |   BROWSE
    Issues List       Latest Issue       Volume 17 Issue 5 & 6       Subscribe       Article       Related articles      
<< firstfirst   < prevprev   Table of contentstoc   next >next   last >>last
Publisher Logo Publication Cover
Search within this journal

PRE-PROCESSING OF HIGH-DIMENSIONAL CATEGORICAL PREDICTORS IN CLASSIFICATION SETTINGS 

Authors: Eugene Tuv a; George Runger b
Affiliations:   a Analysis Control Technology, Intel Corporation, Chandler, AZ, USA.
b Department of Industrial Engineering, Arizona State University, Tempe, AZ, USA.
DOI: 10.1080/713827172
Publication Frequency: 10 issues per year
Published in: journal Applied Artificial Intelligence, Volume 17, Issue 5 & 6 May 2003 , pages 419 - 429
Formats available: PDF (English)
Article Requests: Order Reprints : Request Permissions
View Article: View Article (PDF) View Article (PDF)


Abstract

Models in industrial applications can encounter categorical predictors with a large number of categories (hundreds or thousands). An example is the lot identifier of product in semiconductor manufacturing. Such variables represent a serious problem for practically all modern classification techniques. The goal is an efficient, computationally fast way to discover a small number of natural partitions of values for such variables that have similar statistical properties in terms of categorical response. Such partitions (interesting by itself) can be used then as an input to standard learning algorithms, such as decision trees, support vector machines, etc. The proposed approach introduces a data transformation on derived sparse frequency tables. Application of even simplest non-hierarchical metric clustering method to the transformed coordinates shows significant improvement both in speed and quality of partition in comparison to currently used methods.
Bookmark with:
  • CiteULike
  • Del.icio.us
  • BibSonomy
  • Connotea
  • More bookmarks
Privacy Policy | Terms & Conditions | Accessibility | RSS
FAQs in: English . Français . Español . 中文(简体和繁體)
© 2010 Informa plc