Show simple item record

dc.contributor.authorNguyen, Tran Dang
dc.date.accessioned2013-07-15T08:49:26Z
dc.date.accessioned2018-05-17T03:44:31Z
dc.date.available2013-07-15T08:49:26Z
dc.date.available2018-05-17T03:44:31Z
dc.date.issued2009
dc.identifier.urihttp://10.8.20.7:8080/xmlui/handle/123456789/228
dc.descriptionOther Author: Tram, Tran Maien_US
dc.description.abstractMissing value is very popular in data acquisition. When handling data sets having missing values, classification methods have some difficulties in their learning process. Several pre-processing techniques, such as data replacing or data imputation, have been introduced to remove missing values before being processed by classification methods. However, when the percentage of missing values in the data set goes up (sometimes to 60-70%), such pre-processing techniques cannot be successfully used. This thesis introduces a new version of algorithm CN2 and Rules6 which represented for Separate and Conquer method in Rule Induction algorithms that has abilities to directly handling missing values and processing data set with large percentages of missing values. Tested on benchmarking data sets from UCI, the new algorithms achieved a better performance than that of popular methods Decision Table and C4.5 in directly handling missing values. These new algorithms also achieved significant results when compare with one common preprocessing missing value data set method, filled with the mode and means of attribute value.en_US
dc.description.sponsorshipDr. Nguyen Duc Cuongen_US
dc.language.isoenen_US
dc.publisherInternational University HCMC, Vietnamen_US
dc.relation.ispartofseries;022000190
dc.subjectProgrammingen_US
dc.titleProcessing missing value for rule induction algorithmsen_US
dc.typeThesisen_US


Files in this item

Thumbnail

This item appears in the following Collection(s)

Show simple item record