Character Sets; Dec Multinational Character Set (Dec_Mcs); Categories Of Ascii Character Set Characters; Categories Of Dec Multinational Character Set Characters - Compaq DEC Text Processing Utility (DECTPU) Guide Manual

Guide to the dec text processing utility
Table of Contents

Advertisement

Lexical Elements of the DEC Text Processing Utility Language

4.3 Character Sets

4.3 Character Sets
When you invoke DECTPU, you can use one of the following keywords with the
/CHARACTER_SET qualifier to specify the character set that you want DECTPU
to use:
DEC_MCS (for the DEC Multinational Character Set)
ISO_LATIN1 (for the ISO Latin1 Character Set)
GENERAL (for other general character sets)
TPU$CHARACTER_SET (see the DCL help topic for this logical name)
Each character set is an 8-bit character set with 256 characters. Each character
in a set is assigned a decimal equivalent number ranging from 0 to 255. Each
character set uses an extension of the American Standard Code for Information
Interchange (ASCII) character set for the first 128 characters. Table 4–1 shows
the categories into which you can group the ASCII characters.
Table 4–1 Categories of ASCII Character Set Characters
Category
0–31
32
33–64
65–122
123–126
127
The following sections discuss the types of character sets supported by DECTPU.

4.3.1 DEC Multinational Character Set (DEC_MCS)

The DEC Multinational Character Set characters from 128 to 255 are extended
control characters and supplemental multinational characters. Table 4–2 shows
the categories into which you can group the characters.
Table 4–2 Categories of DEC Multinational Character Set Characters
Category
128–159
160
161–191
192–254
255
For a complete list of characters in the DEC Multinational Character Set, see the
OpenVMS documentation.
4–2 Lexical Elements of the DEC Text Processing Utility Language
Meaning
Nonprinting characters such as tab, line feed, carriage return, and bell
Space
Special characters such as the ampersand ( & ), question mark ( ? ), equal
sign ( = ), and the numbers 0 through 9
The uppercase and lowercase letters A through Z and a through z
Special characters such as the left brace ( { ) and the tilde ( ~ )
Delete
Meaning
Extended control characters
Reserved
Supplemental special graphics characters such as the copyright sign ( © )
and the degree sign ( ° )
The supplemental multinational uppercase and lowercase letters such as the
Spanish Ñ and ñ
Reserved

Advertisement

Table of Contents
loading

This manual is also suitable for:

Dec text processing utilityDectpu

Table of Contents