題名: | A Discriminant based Document Analysis for Text Classification |
作者: | Lin, Yi-Xian Chien, Been-Chian |
關鍵字: | text classification classification coefficient discriminant coefficient correlation measure document analysis |
期刊名/會議名稱: | 2010 ICS會議 |
摘要: | Text classification technologies rely heavily on the distribution of features, and the selection of discriminant features with regards to the classes as the main basis for classification. In this paper, we propose the discriminant coefficient to represent the features of a document. Based on the discriminant coefficient, the classification coefficient for each document class is defined and computed. Then, a correlation measure approach is designed for text classification. The experimental results show that the proposed approach of document analysis has good effectiveness in comparison with the method of TF-IDF with cosine similarity for a single class text classification. Especially, as a document set with nearly equivalent number of documents for each class, the proposed approach can achieve better results than the traditional vector based methods. |
日期: | 2011-01-26T00:55:00Z |
分類: | 2010年 ICS 國際計算機會議(如需查看全文,請連結至IEEE Xplore網站) |
文件中的檔案:
沒有與此文件相關的檔案。
在 DSpace 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。