Character Sets and Collations That MySQL Supports
•
hp8_bin
•
hp8_english_ci
•
(cp1252 West European) collations:
latin1
•
latin1_bin
•
latin1_danish_ci
•
latin1_general_ci
•
latin1_general_cs
•
latin1_german1_ci
•
latin1_german2_ci
•
latin1_spanish_ci
•
latin1_swedish_ci
is the default character set. MySQL's
latin1
character set. This means it is the same as the official
Numbers Authority) latin1, except that IANA
as "undefined," whereas cp1252, and therefore MySQL's latin1, assign characters for
0x9f
those positions. For example,
translates
to Unicode 0x0081,
0x81
to 0x009d.
The
latin1_swedish_ci
customers. Although it is frequently said that it is based on the Swedish/Finnish collation rules, there
are Swedes and Finns who disagree with this statement.
The
latin1_german1_ci
DIN-2 standards, where DIN stands for Deutsches Institut für Normung (the German equivalent of
ANSI). DIN-1 is called the "dictionary collation" and DIN-2 is called the "phone book collation." For
an example of the effect this has in comparisons or when doing searches, see
"Examples of the Effect of
•
latin1_german1_ci
Ä = A
Ö = O
Ü = U
ß = s
•
latin1_german2_ci
Ä = AE
Ö = OE
Ü = UE
ß = ss
In the
latin1_spanish_ci
•
(Mac West European) collations:
macroman
•
macroman_bin
•
macroman_general_ci
•
(7bit Swedish) collations:
swe7
(default)
(default)
is the Euro sign. For the "undefined" entries in cp1252, MySQL
0x80
to 0x008d,
0x8d
collation is the default that probably is used by the majority of MySQL
and
latin1_german2_ci
Collation".
(dictionary) rules:
(phone-book) rules:
collation, "ñ" (n-tilde) is a separate letter between "n" and "o".
(default)
798
is the same as the Windows
latin1
or IANA (Internet Assigned
ISO 8859-1
treats the code points between
latin1
to 0x008f,
0x8f
collations are based on the DIN-1 and
cp1252
and
0x80
to 0x0090, and
0x90
Section 10.1.7.8,
0x9d