完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.author | Huang, Feng-Long | |
dc.contributor.author | Yu, Ming-Shing | |
dc.contributor.author | Chiang, Yang-Kua | |
dc.date.accessioned | 2009-06-02T08:41:30Z | |
dc.date.accessioned | 2020-07-05T06:32:05Z | - |
dc.date.available | 2009-06-02T08:41:30Z | |
dc.date.available | 2020-07-05T06:32:05Z | - |
dc.date.issued | 2006-06-14T03:26:50Z | |
dc.date.submitted | 2003-12-18 | |
dc.identifier.uri | http://dspace.fcu.edu.tw/handle/2376/1783 | - |
dc.description.abstract | We survey several frequent smoothing methods used by language models for Mandarin. Due to the problem of data sparseness, smoothing techniques are employed to re-estimate the probability for all events while calculating the probability of occurrence. Among well-known smoothing methods, Good-Turing is employed widely. We have proposed a set of properties to analyze the behaviors of Good-Turing in this paper. Two novel smoothing methods are proposed. Finally, we implement three n-gram for Mandarin and then analyze the entropy and related problems of the Good-Turing; such as cut-off value and types of events . | |
dc.description.sponsorship | 逢甲大學,台中市 | |
dc.format.extent | 6P. | |
dc.format.extent | 73465 bytes | |
dc.format.mimetype | application/pdf | |
dc.language.iso | zh_TW | |
dc.relation.ispartofseries | 中華民國92年全國計算機會議 | |
dc.subject | Language models | |
dc.subject | smoothing methods | |
dc.subject | statistical behavior | |
dc.subject | entropy | |
dc.subject.other | 其他領域 | |
dc.title | Survey of the Smoothing Issues on Mandarin Language Models | |
分類: | 2003年 NCS 全國計算機會議 |
文件中的檔案:
檔案 | 描述 | 大小 | 格式 | |
---|---|---|---|---|
OT_1342003236.pdf | 71.74 kB | Adobe PDF | 檢視/開啟 |
在 DSpace 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。