Stall/Writeback Statistics Mode - Intel PXA255 User Manual

Xscale microarchitecture

Hide thumbs Also See for PXA255:

Developer's manual (600 pages)

Datasheet (40 pages)

Table Of Contents

100

101

102

103

104

105

106

107

108

109

110

111

112

113

114

115

116

117

118

119

120

121

122

123

124

125

126

127

128

129

130

131

132

133

134

135

136

137

138

139

140

141

142

143

144

145

146

147

148

149

150

151

152

153

154

155

156

157

158

159

160

161

162

163

164

165

166

167

168

169

170

171

172

173

174

175

176

177

178

179

180

181

182

183

184

185

186

187

188

189

190

191

192

193

194

195

196

197

198

page of 198

/ 198
Contents
Table of Contents
Bookmarks

Table of Contents

is high, possibly due to starvation, these Data Cache buffers will become full. This performance

monitoring mode is provided to see if the Intel® XScale™ core is being starved of the bus external

to the Intel® XScale™ core.

PMN0 accumulates the number of clock cycles the processor is being stalled due to this condition

and PMN1 monitors the number of times this condition occurs.

Statistics derived from these two events:

•

The average number of cycles the processor stalled on a data-cache access that may overflow

the data-cache buffers. This is calculated by dividing PMN0 by PMN1. This statistic lets you

know if the duration event cycles are due to many requests or are attributed to just a few

requests. If the average is high then the Intel® XScale™ core may be starved from accessing

the application processor internal bus due to other bus activity, e.g. companion chip bus cycles.

•

The percentage of total execution cycles the processor stalled because a Data Cache request

buffer was not available. This is calculated by dividing PMN0 by CCNT, which was used to

measure total execution time.

8.5.5

Stall/Writeback Statistics Mode

When an instruction requires the result of a previous instruction and that result is not yet available,

the Intel® XScale™ core stalls in order to preserve the correct data dependencies. PMN0 counts

the number of stall cycles due to data-dependencies. Not all data-dependencies cause a stall; only

the following dependencies cause such a stall penalty:

•

Load-use penalty: attempting to use the result of a load before the load completes. To avoid the

penalty, software should delay using the result of a load until it's available. This penalty shows

the latency effect of data-cache access.

•

Multiply/Accumulate-use penalty: attempting to use the result of a multiply or multiply-

accumulate operation before the operation completes. Again, to avoid the penalty, software

should delay using the result until it's available.

•

ALU use penalty: there are a few isolated cases where back to back ALU operations may

result in one cycle delay in the execution. These cases are defined in

Considerations".

PMN1 counts the number of writeback operations emitted by the data cache. These writebacks

occur when the data cache evicts a dirty line of data to make room for a newly requested line or as

the result of clean operation (CP15, register 7).

Statistics derived from these two events:

•

The percentage of total execution cycles the processor stalled because of a data dependency.

This is calculated by dividing PMN0 by CCNT, which was used to measure total execution

time. Often a compiler can reschedule code to avoid these penalties when given the right

optimization switches.

•

Total number of data writeback requests to external memory can be derived solely with PMN1.

Intel® XScale™ Microarchitecture User's Manual

Performance Monitoring

Chapter 11, "Performance

8-7

Table of Contents

Stall/Writeback Statistics Mode - Intel PXA255 User Manual

Stall/Writeback Statistics Mode

Related Manuals for Intel PXA255

Related Content for Intel PXA255

Table of Contents