題名: | Transition from ANSI to Unicode: Multilingual Support in Operating Systems and Programming Languages |
作者: | Wu, Pei-Chi |
關鍵字: | character sets text files control codes byte order string literals source files |
期刊名/會議名稱: | 1998 ICS會議 |
摘要: | Character sets are one of basic issues for information interchange. Most current character sets extend ANSI's 7-bit character set. These extensions are conflicted with each other and make the design of multilingual information systems complicated. Unicode or Universal Character Set (UCS) is a character set that covers symbols in major written languages. Text files and strings usually have no header to indicate which character set is in use, and they currently use ANSI by default. The transition from ANSI to Unicode may last a longer time than expected. This paper presents the following methods to help the transition: 1) A text file format of fixedwidth characters 2) Atagged string storage: Each string has a tag representing which character set or coding format is in use. 3) A method for assigning the format of string literals. These methods can improve multilingual support without introducing muchcomplexity. |
日期: | 2006-10-22T09:06:15Z |
分類: | 1998年 ICS 國際計算機會議 |
文件中的檔案:
檔案 | 描述 | 大小 | 格式 | |
---|---|---|---|---|
ce07ics001998000149.pdf | 532.27 kB | Adobe PDF | 檢視/開啟 |
在 DSpace 系統中的文件,除了特別指名其著作權條款之外,均受到著作權保護,並且保留所有的權利。