Browser-Oriented Data Extraction

Wu, I-Chen; Su, Jui-Yuan; Chen, Loon-Been

完整後設資料紀錄

DC 欄位	值	語言
dc.contributor.author	Wu, I-Chen
dc.contributor.author	Su, Jui-Yuan
dc.contributor.author	Chen, Loon-Been
dc.date.accessioned	2009-06-02T06:39:41Z
dc.date.accessioned	2020-05-25T06:41:22Z	-
dc.date.available	2009-06-02T06:39:41Z
dc.date.available	2020-05-25T06:41:22Z	-
dc.date.issued	2006-10-16T05:35:58Z
dc.date.submitted	2004-12-15
dc.identifier.uri	http://dspace.lib.fcu.edu.tw/handle/2377/1548	-
dc.description.abstract	Traditionally, most researchers used the URL-oriented data extraction model for data extraction. In this model, the systems extract URLs from pages and then use the extracted URLs to access next pages. However, more and more pages currently use script functions to access next pages. Since it is hard to extract URLs from script programs, it is inappropriate to use this model for such pages. For solving this problem, this paper proposed a new data extraction model, named the browseroriented data extraction model. In this model, the system built on top of browsers accesses pages by simulating users’ operations on browsers, which can also trigger script functions. Besides, this paper defines a scripting language, named the BODED (Browser-Oriented Data Extraction Description) Language, which instructs the system to do data extraction.
dc.description.sponsorship	大同大學,台北市
dc.format.extent	6p.
dc.format.extent	800116 bytes
dc.format.mimetype	application/pdf
dc.language.iso	zh_TW
dc.relation.ispartofseries	2004 ICS會議
dc.subject	data extraction
dc.subject	Internet
dc.subject	BODED
dc.subject.other	Miscellaneous
dc.title	Browser-Oriented Data Extraction
分類:	2004年 ICS 國際計算機會議

文件中的檔案：

檔案	描述	大小	格式
ce07ics002004000100.pdf		781.36 kB	Adobe PDF	檢視/開啟

顯示文件簡單紀錄

在 DSpace 系統中的文件，除了特別指名其著作權條款之外，均受到著作權保護，並且保留所有的權利。

逢甲大學校園典藏知識庫