Instruction Cache; Cache Miss Cost; Round Robin Replacement Cache Policy; Code Placement To Reduce Cache Misses - Intel PXA255 User Manual

Xscale microarchitecture

Hide thumbs Also See for PXA255:

Developer's manual (600 pages)

Datasheet (40 pages)

Table Of Contents

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

page of 198

/ 198
Contents
Table of Contents
Bookmarks

Table of Contents

A.4.1

Instruction Cache

The Intel® XScale™ core has separate instruction and data caches. Only fetched instructions are

held in the instruction cache even though both data and instructions may reside within the same

memory space with each other. Functionally, the instruction cache is either enabled or disabled.

There is no performance benefit in not using the instruction cache. The exception is that code,

which locks code into the instruction cache, must itself execute from non-cached memory.

A.4.1.1.

Cache Miss Cost

The Intel® XScale™ core performance is highly dependent on reducing the cache miss rate. Note

that this cache miss penalty becomes significant when the core is running much faster than external

memory. Executing non-cached instructions severely curtails the processor's performance in this

case and it is very important to do everything possible to minimize cache misses.

A.4.1.2.

Round Robin Replacement Cache Policy

Both the data and the instruction caches use a round robin replacement policy to evict a cache line.

The simple consequence of this is that at sometime every line will be evicted, assuming a non-

trivial program. The less obvious consequence is that predicting when and over which cache lines

evictions take place is very difficult to predict. This information must be gained by

experimentation using performance profiling.

A.4.1.3.

Code Placement to Reduce Cache Misses

Code placement can greatly affect cache misses. One way to view the cache is to think of it as 32

sets of 32 bytes, which span an address range of 1024 bytes. When running, the code maps into 32

modular blocks of 1024 bytes of cache space (See

overused, will thrash the cache. The ideal situation is for the software tools to distribute the code on

a temporal evenness over this space.

This is very difficult if not impossible for a compiler to do. Most of the input needed to best

estimate how to distribute the code will come from profiling followed by compiler based two pass

optimizations.

A.4.1.4.

Locking Code into the Instruction Cache

One very important instruction cache feature is the ability to lock code into the instruction cache.

Once locked into the instruction cache, the code is always available for fast execution. Another

reason for locking critical code into cache is that with the round robin replacement policy,

eventually the code will be evicted, even if it is a very frequently executed function. Key code

components to consider for locking are:

•

Interrupt handlers

•

Real time clock handlers

•

OS critical code

•

Time critical application code

The disadvantage to locking code into the cache is that it reduces the cache size for the rest of the

program. How much code to lock is very application dependent and requires experimentation to

optimize.

Intel® XScale™ Microarchitecture User's Manual

Optimization Guide

Figure 6-1 on page

6-2). Any sets, which are

A-13

Table of Contents

Instruction Cache; Cache Miss Cost; Round Robin Replacement Cache Policy; Code Placement To Reduce Cache Misses - Intel PXA255 User Manual

Instruction Cache

Cache Miss Cost

Round Robin Replacement Cache Policy

Code Placement to Reduce Cache Misses

Locking Code into the Instruction Cache

Related Manuals for Intel PXA255

Related Content for Intel PXA255

Table of Contents