A Novel Statistical Feature Selection Approach for Text Categorization

Loading...
Thumbnail Image

Date

2017-10

Journal Title

Journal ISSN

Volume Title

Type

Article

Publisher

KOREA INFORMATION PROCESSING SOC

Series Info

JOURNAL OF INFORMATION PROCESSING SYSTEMS;Volume: 13 Issue: 5 Pages: 1397-1409

Doi

Abstract

For text categorization task, distinctive text features selection is important due to feature space high dimensionality. It is important to decrease the feature space dimension to decrease processing time and increase accuracy. In the current study, for text categorization task, we introduce a novel statistical feature selection approach. This approach measures the term distribution in all collection documents, the term distribution in a certain category and the term distribution in a certain class relative to other classes. The proposed method results show its superiority over the traditional feature selection methods.

Description

Accession Number: WOS:000418488900025

Keywords

University for MODEL, FREQUENCY, ALGORITHM, IDENTIFICATION, CLASSIFICATION, SENTIMENT ANALYSIS, E-mail Filte FEATURE SUBSET-SELECTION, Text Categorization, SMS Spam Filtering, Feature Selection, Electronic Texts

Citation