Oracle 5.0 Reference Manual page 815

Table of Contents

Advertisement

You can always list the available character sets and their default collations with the
statement:
SET
mysql>
SHOW CHARACTER SET;
+----------+-----------------------------+---------------------+
| Charset
| Description
+----------+-----------------------------+---------------------+
| big5
| Big5 Traditional Chinese
| dec8
| DEC West European
| cp850
| DOS West European
| hp8
| HP West European
| koi8r
| KOI8-R Relcom Russian
| latin1
| cp1252 West European
| latin2
| ISO 8859-2 Central European | latin2_general_ci
| swe7
| 7bit Swedish
| ascii
| US ASCII
| ujis
| EUC-JP Japanese
| sjis
| Shift-JIS Japanese
| hebrew
| ISO 8859-8 Hebrew
| tis620
| TIS620 Thai
| euckr
| EUC-KR Korean
| koi8u
| KOI8-U Ukrainian
| gb2312
| GB2312 Simplified Chinese
| greek
| ISO 8859-7 Greek
| cp1250
| Windows Central European
| gbk
| GBK Simplified Chinese
| latin5
| ISO 8859-9 Turkish
| armscii8 | ARMSCII-8 Armenian
| utf8
| UTF-8 Unicode
| ucs2
| UCS-2 Unicode
| cp866
| DOS Russian
| keybcs2
| DOS Kamenicky Czech-Slovak
| macce
| Mac Central European
| macroman | Mac West European
| cp852
| DOS Central European
| latin7
| ISO 8859-13 Baltic
| cp1251
| Windows Cyrillic
| cp1256
| Windows Arabic
| cp1257
| Windows Baltic
| binary
| Binary pseudo charset
| geostd8
| GEOSTD8 Georgian
| cp932
| SJIS for Windows Japanese
| eucjpms
| UJIS for Windows Japanese
+----------+-----------------------------+---------------------+
In cases where a character set has multiple collations, it might not be clear which collation is most
suitable for a given application. To avoid choosing the wrong collation, it can be helpful to perform
some comparisons with representative data values to make sure that a given collation sorts values the
way you expect.
Collation-Charts.Org
10.1.13.1. Unicode Character Sets
MySQL 5.0 has two Unicode character sets:
• ucs2, the UCS-2 encoding of the Unicode character set using 16 bits per character
• utf8, a UTF-8 encoding of the Unicode character set using one to three bytes per character
You can store text in about 650 languages using these character sets. This section lists the collations
available for each Unicode character set and describes their differentiating properties. For general
information about the character sets, see
A similar set of collations is available for each Unicode character set. These are shown in the following
list, where
xxx
Danish collations, the specific names of which are
Character Sets and Collations That MySQL Supports
is a useful site for information that shows how one collation compares to another.
represents the character set name. For example,
| Default collation
| big5_chinese_ci
| dec8_swedish_ci
| cp850_general_ci
| hp8_english_ci
| koi8r_general_ci
| latin1_swedish_ci
| swe7_swedish_ci
| ascii_general_ci
| ujis_japanese_ci
| sjis_japanese_ci
| hebrew_general_ci
| tis620_thai_ci
| euckr_korean_ci
| koi8u_general_ci
| gb2312_chinese_ci
| greek_general_ci
| cp1250_general_ci
| gbk_chinese_ci
| latin5_turkish_ci
| armscii8_general_ci |
| utf8_general_ci
| ucs2_general_ci
| cp866_general_ci
| keybcs2_general_ci
| macce_general_ci
| macroman_general_ci |
| cp852_general_ci
| latin7_general_ci
| cp1251_general_ci
| cp1256_general_ci
| cp1257_general_ci
| binary
| geostd8_general_ci
| cp932_japanese_ci
| eucjpms_japanese_ci |
Section 10.1.10, "Unicode
ucs2_danish_ci
795
SHOW CHARACTER
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Support".
represents the
xxx_danish_ci
and utf8_danish_ci.

Advertisement

Table of Contents
loading

This manual is also suitable for:

Mysql 5.0

Table of Contents