題名: | A Level-wise Clustering Algorithm on Structured Documents |
作者: | Wang, Ching-Yao |
關鍵字: | document clustering structured document tree structure clustering |
期刊名/會議名稱: | 中華民國92年全國計算機會議 |
摘要: | Document clustering is the process of applying clustering technique for document management [4][5]. Similar documents are grouped together so that both managing and searching the documents is efficient. However, since traditional document clustering algorithms do not take the structure information of documents into consideration, the clustering results can not reflect the characteristics of the documents fully. As the result, we represent each document as a tree structure and propose a level-wise clustering algorithm to solve this issue. The clustering process applies the level property of the tree and run level by level by the concept generalization operation. In order to store the clustering results and search interesting clusters efficiently, a multistage graph called Level-wise Document Clustering Graph (LDC-Graph) is proposed. Based on LDC-Graph, three search strategies are provided to meet the different requirements for uses. Finally, the experimental results show that the similarity search is efficient and the accuracy of the search is acceptable |
日期: | 2006-06-28T07:21:58Z |
分類: | 2003年 NCS 全國計算機會議 |
文件中的檔案:
檔案 | 描述 | 大小 | 格式 | |
---|---|---|---|---|
OT_1372003295.pdf | 108.21 kB | Adobe PDF | 檢視/開啟 |
在 DSpace 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。