題名: | Improving the Syntax-based Retrieval System Using Collocation Indexing |
作者: | Chen, Ruey-Jinng Kuo, Chin-Hwa Tsao, Nai-Lung Hung, Tsung-Fu |
關鍵字: | POS tagging Lemmatizing Collocation k-gram Indexing |
期刊名/會議名稱: | 2008 ICS會議 |
摘要: | The purpose of this paper is to design a syntax search system and to apply it to a movie search system. The concepts applied include those in the field of linguistics and collocation, to increase the speed of the syntax search system. First, we must process the keywords in the database by labeling them according to their part of speech. From the results of the process, we will construct a K-gram index and Collocation index.In this proposal we bring out a few examples of common English syntax rules and sentence structures as test models. After the run through, the K-gram index and the Collocation index are compared. We have found that part of the sentence, after having gone through the Collocation index search, has a far smaller sample space that the K-gram index alone, which is to say that the Collocation index is able to find the most correct result from fewer samples, thus minimizing the time cost in Query Match. |
日期: | 2009-01-03T09:13:49Z |
分類: | 2008年 ICS 國際計算機會議 |
文件中的檔案:
檔案 | 描述 | 大小 | 格式 | |
---|---|---|---|---|
ce07ics002008000003.pdf | 264.4 kB | Adobe PDF | 檢視/開啟 |
在 DSpace 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。