Work With Character Sets - BEA WebLogic User Manual

Mobility server

Hide thumbs Also See for WebLogic:

User manual (294 pages)

Manual (88 pages)

User manual (94 pages)

Table Of Contents

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

199

200

201

202

203

204

205

206

207

208

209

210

211

page of 211

/ 211
Contents
Table of Contents
Bookmarks

Table of Contents

Part IV Presentation of Mobile Content

Note: You must include the alt attribute on each

where="ImgWBMPSupported" ...>

Work with Character Sets

Character encoding is an algorithmic process that specifies how human-readable characters are

converted into bytes for storage or transmission. Characters in a language (or set of languages)

are mapped to numbers represented by bytes (or octets). Character decoding is the process of

converting bytes into characters.

To avoid encoding errors during the process of storing, transmitting and displaying a document

on the web, a single consistent method of encoding / decoding should be used throughout.

This document explains how to avoid and resolve encoding problems.

About Character Encoding/Decoding

Character encoding is a method of converting characters into bytes and decoding is a method of

converting bytes into characters.

The standard character set for computers has traditionally been ASCII (American Standard Code

for Information Interchange). No provision is made in ASCII for foreign characters or specialized

symbols. Hence, various so-called "extended ASCII" sets have been developed to provide these

symbols. However, the Web has adopted an extended character set, ISO 8859-1 (otherwise

known as ISO Latin-1), as its standard.

In addition, to avoid a preference for one language over another, HTML 4.0 has adopted Unicode

as its official document character set. Unicode is attempting to create a single character set under

which every character, from every language in every region can be represented.

Encode Mechanisms

An application must select a character encoding / decoding method when it is opening, validating

or displaying a HTML document. For documents in English and most other Western European

languages, the character encoding ISO-8859-1 is typically used.

There are a number of mechanisms within the HTTP, XML and HTML protocols for specifying

the character encoding:

•

Unicode encoded-documents commonly use Byte Order Marks (BOM) to inform the

decoding software which algorithm needs to be used to decode the byte stream correctly. This

is simply a set of defined lead bytes that mark the stream as being of a particular type.

•

The HTTP protocol defines a response header called "Content-Type" which can include the

character set name as part of its value (when the mime-type is text/*). The HTTP server needs

to be configured to set this header. For example, to specify that an HTML document uses

ISO-8859-1, a server would send the following header:

Content-Type: text/html; charset=ISO-8859-1

In XML, the XML declaration can contain the document encoding:

<?xml version="1.0" encoding="ISO-8859-1"?>

•

In HTML, a <meta> tag can be used to define the document encoding

charset=ISO-8859-1" />

108 - BEA WebLogic Mobility Server User Guide

<mm:img>

as the final image in the list.

tag, and you must specify

<mm:img

Table of Contents

Need help?

Do you have a question about the WebLogic and is the answer not in the manual?

This manual is also suitable for:

Weblogic mobility server

Work With Character Sets - BEA WebLogic User Manual

Work with Character Sets

Need help?

Subscribe to Our Youtube Channel

Related Manuals for BEA WebLogic

This manual is also suitable for:

Table of Contents