Oracle 5.0 Reference Manual page 823

Table of Contents

Advertisement

Character Sets and Collations That MySQL Supports
• The
character set does not support the conversion of these extension characters.
sjis
• There are several conversion rules from so-called "SHIFT JIS" to Unicode, and some characters
are converted to Unicode differently depending on the conversion rule. MySQL supports only one of
these rules (described later).
The MySQL
character set is designed to solve these problems. It is available as of MySQL
cp932
5.0.3.
Because MySQL supports character set conversion, it is important to separate IANA
into two different character sets because they provide different conversion rules.
cp932
How does
differ from sjis?
cp932
The
character set differs from
cp932
supports NEC special characters, NEC selected—IBM extended characters, and IBM
cp932
selected characters.
• Some
characters have two different code points, both of which convert to the same Unicode
cp932
code point. When converting from Unicode back to cp932, one of the code points must be
selected. For this "round trip conversion," the rule recommended by Microsoft is used. (See
support.microsoft.com/kb/170559/EN-US/.)
The conversion rule works like this:
• If the character is in both JIS X 0208 and NEC special characters, use the code point of JIS X
0208.
• If the character is in both NEC special characters and IBM selected characters, use the code point
of NEC special characters.
• If the character is in both IBM selected characters and NEC selected—IBM extended characters,
use the code point of IBM extended characters.
The table shown at
http://www.microsoft.com/globaldev/reference/dbcs/932.htm
about the Unicode values of
a four-digit number appears, the number represents the corresponding Unicode (ucs2) encoding.
For table entries with an underlined two-digit value appears, there is a range of
values that begin with those two digits. Clicking such a table entry takes you to a page that displays
the Unicode value for each of the
The following links are of special interest. They correspond to the encodings for the following sets of
characters:
• NEC special characters:
http://www.microsoft.com/globaldev/reference/dbcs/932/932_87.htm
• NEC selected—IBM extended characters:
http://www.microsoft.com/globaldev/reference/dbcs/932/932_ED.htm
http://www.microsoft.com/globaldev/reference/dbcs/932/932_EE.htm
• IBM selected characters:
http://www.microsoft.com/globaldev/reference/dbcs/932/932_FA.htm
http://www.microsoft.com/globaldev/reference/dbcs/932/932_FB.htm
http://www.microsoft.com/globaldev/reference/dbcs/932/932_FC.htm
• Starting from version 5.0.3,
with eucjpms, and solves the problems with
http://www.opengroup.or.jp/jvc/cde/sjis-euc-e.html.
in the following ways:
sjis
characters. For
cp932
characters that begin with those digits.
cp932
supports conversion of user-defined characters in combination
cp932
sjis/ujis
803
table entries with characters under which
cp932
conversion. For details, please refer to
and
Shift_JIS
http://
provides information
character
cp932

Advertisement

Table of Contents
loading

This manual is also suitable for:

Mysql 5.0

Table of Contents