完整後設資料紀錄
DC 欄位 | 值 | 語言 |
---|---|---|
dc.contributor.author | Wu, I-Chen | |
dc.contributor.author | Su, Jui-Yuan | |
dc.contributor.author | Chen, Loon-Been | |
dc.date.accessioned | 2009-06-02T06:39:41Z | |
dc.date.accessioned | 2020-05-25T06:41:22Z | - |
dc.date.available | 2009-06-02T06:39:41Z | |
dc.date.available | 2020-05-25T06:41:22Z | - |
dc.date.issued | 2006-10-16T05:35:58Z | |
dc.date.submitted | 2004-12-15 | |
dc.identifier.uri | http://dspace.lib.fcu.edu.tw/handle/2377/1548 | - |
dc.description.abstract | Traditionally, most researchers used the URL-oriented data extraction model for data extraction. In this model, the systems extract URLs from pages and then use the extracted URLs to access next pages. However, more and more pages currently use script functions to access next pages. Since it is hard to extract URLs from script programs, it is inappropriate to use this model for such pages. For solving this problem, this paper proposed a new data extraction model, named the browseroriented data extraction model. In this model, the system built on top of browsers accesses pages by simulating users’ operations on browsers, which can also trigger script functions. Besides, this paper defines a scripting language, named the BODED (Browser-Oriented Data Extraction Description) Language, which instructs the system to do data extraction. | |
dc.description.sponsorship | 大同大學,台北市 | |
dc.format.extent | 6p. | |
dc.format.extent | 800116 bytes | |
dc.format.mimetype | application/pdf | |
dc.language.iso | zh_TW | |
dc.relation.ispartofseries | 2004 ICS會議 | |
dc.subject | data extraction | |
dc.subject | Internet | |
dc.subject | BODED | |
dc.subject.other | Miscellaneous | |
dc.title | Browser-Oriented Data Extraction | |
分類: | 2004年 ICS 國際計算機會議 |
文件中的檔案:
檔案 | 描述 | 大小 | 格式 | |
---|---|---|---|---|
ce07ics002004000100.pdf | 781.36 kB | Adobe PDF | 檢視/開啟 |
在 DSpace 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。