Wednesday, December 10, 2008

Tutorial Notes on Unicode

Tutorial Notes on Unicode

About This Book

Character Sets and Encodings

  • Definitions
  • Commonly Used Character Sets and Encodings

GB2312 Character Set and Encodings

  • GB2312 Character Set
  • EUC-CN Encoding

GB18030 Character Set and Encoding

  • History of GB Character Set
  • GB18030-2000 Encoding

JIS X0208 Character Set and Encodings

  • JIS X0208 Character Set
  • EUC-JP Encoding
  • ISO-2022-JP Encoding
  • Shift-JIS Encoding

Unicode Character Set

  • What is Unicode

Unicode Transformation Formats (UTF)

  • UTF-8 Encoding
  • UTF-16LE Encoding
  • UTF-16BE Encoding

JDK - Character Set and Encoding

  • What is a Character Encoding
  • Supported Character Encodings
  • Methods to Encode Characters
  • Methods to Decode Byte Sequences

JDK - Encoding Maps

  • Encoding Map Analyzer
  • US-ASCII
  • ISO-8859-1 - Latin 1
  • CP1252 - Windows-1252
  • UTF-8
  • UTF-16
  • UTF-16LE
  • UTF-16BE

JDK - Encoding Map Counts

  • Encoding Map Counter
  • Comparison of Encoding Maps

JDK - Encoding Conversion

  • Unicode Data Entry
  • Unicode Encoding Conversion
  • Viewing Unicode Text
  • Unicode Signs in Different Encodings

References

Labels:

0 Comments:

Post a Comment

<< Home