A Novel Statistical Feature Selection Approach for Text Categorization

Fattah, Mohamed Abdel

MSAR Home
→
MSA University Academic Research
→
Research Papers, Articles and Books Chapters.
→
Faculty Of Engineering Research Paper
→
View Item

A Novel Statistical Feature Selection Approach for Text Categorization

Fattah, Mohamed Abdel

Full Text link: https://pdfs.semanticscholar.org/d75c/65273c03aa91b9a924a3b866894ec11e260e.pdf

Date issued: 2017-10

Scientific Journal Rankings: Click Here

Publisher: KOREA INFORMATION PROCESSING SOC

Series Info: JOURNAL OF INFORMATION PROCESSING SYSTEMS;Volume: 13 Issue: 5 Pages: 1397-1409

Type: Article

Keywords: University for MODEL , FREQUENCY , ALGORITHM , IDENTIFICATION , CLASSIFICATION , SENTIMENT ANALYSIS , E-mail Filte FEATURE SUBSET-SELECTION , Text Categorization , SMS Spam Filtering , Feature Selection , Electronic Texts

Abstract:

For text categorization task, distinctive text features selection is important due to feature space high dimensionality. It is important to decrease the feature space dimension to decrease processing time and increase accuracy. In the current study, for text categorization task, we introduce a novel statistical feature selection approach. This approach measures the term distribution in all collection documents, the term distribution in a certain category and the term distribution in a certain class relative to other classes. The proposed method results show its superiority over the traditional feature selection methods.