Text Categorization Using Latent Topics as Additional Features

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	Mizugai, Hiroshi
dc.contributor.author	Paik, Incheon
dc.contributor.author	Kanemoto, Shigeru
dc.date.accessioned	2009-06-02T07:05:32Z
dc.date.accessioned	2020-05-25T06:48:36Z	-
dc.date.available	2009-06-02T07:05:32Z
dc.date.available	2020-05-25T06:48:36Z	-
dc.date.issued	2009-02-12T02:15:47Z
dc.date.submitted	2009-02-12
dc.identifier.uri	http://dspace.lib.fcu.edu.tw/handle/2377/11205	-
dc.description.abstract	In feature selection of text categorization, there are methods which handle word sense disambiguation by extracting synonymy and polysemy among words in documents. One of the methods utilizes latent topics underlying documents by using a topic model. PLSA and LDA have been proposed as representative models. In this paper, two features which include both TF-IDF and the latent topic values which extracted automatically from topic models were utilized for text categorization using AdaBoost. Then, the performances were compared with the ones of only TF-IDF features. As a result, this study evaluates effectiveness and weakness of the augmented features.
dc.description.sponsorship	淡江大學，台北縣
dc.format.extent	6p.
dc.relation.ispartofseries	2008 ICS會議
dc.subject	Machine Learning
dc.subject	Text Categorization
dc.subject	Latent Topics
dc.subject	AdaBoost
dc.subject.other	Artificial Intelligence
dc.title	Text Categorization Using Latent Topics as Additional Features
分類:	2008年 ICS 國際計算機會議

文件中的檔案：

檔案	描述	大小	格式
ce07ics002008000154.pdf		181.08 kB	Adobe PDF	檢視/開啟

在 DSpace 系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

逢甲大學校園典藏知識庫