Cache Misses - CS 61C Course Notes

1Learning Outcomes¶

Define the three types of cache misses: compulsory miss, capacity miss, and conflict miss.
Explain how conflict miss helps evaluate our cache associativity.

🎥 Lecture Video

The classical approach to improving cache behavior is to reduce miss rates.

To gain better insights into the causes of misses, we first categorize misses into three categories:

Compulsory Miss: Caused by the first access to a line that has never been in the cache. The very first access to a block cannot be in the cache, so the block must be brought in. Also called cold-start misses or first-reference misses.
Capacity Miss: Caused when the cache cannot contain all the lines needed during the execution of a program. Occur when lines were in the cache, replaced, and later retrieved.
Conflict Miss: Multiple lines compete for the same location in the cache, even when the cache has not reached full capacity. These misses are also called collision misses.

2Categorizing Misses¶

To categorize misses, we can first run an address trace against a set of caches, then perform the following steps^[1]:

First, consider an infinite-size, fully-associative cache. For every miss that occurs now, consider it a compulsory miss.
Next, consider a finite-sized cache (of the size you want to examine) with full-associativity. Every miss that is not in #1 is a capacity miss.
Finally, consider a finite-size cache with finite-associativity. All of the remaining misses that are not #1 or #2 are conflict misses.
Conflict misses occur only with set-associative or direct-mapped caches. Conflict misses are those that occur going from fully associative to, say, 8-way associative, 4-way associative, and so on.

Show Answer

Compulsory misses
Capacity misses

Show Answer

Compulsory misses
Capacity misses
Conflict misses

A slightly more precise definition of conflict misses is therefore:

Misses that would not occur if the cache was fully-associative and had LRU replacement, (with all other noncompulsory misses being capacity).

We will see in the next section that this definition does not precisely hold water for specific memory accesses, but it is sufficient to categorize conflict miss rates in general.

3Conflict misses, in detail¶

This section is useful if you are still confused about the difference between the two non-compulsory misses: capacity misses and conflict misses. However, the precise details of the example are out of scope.

3.1Example¶

Let’s compare the hit patterns of a few cache types. The below diagram shows the hit/miss pattern of various caches when run on a Matrix Multiply Example.

FA 1M (Fully Associative, 1 million blocks. This is our “infinite” cache with as many hits as possible)
FA 4 blocks (Fully Associative)
2SA 4 blocks (Set Associative)
DM 4 blocks (Direct Mapped)

Figure 1 illustrates each memory access as columns going from left to right. Each column is a hit or a miss in each of the four caches.

"Cache-access sequence graphic marking misses in red and hits in white to contrast miss categories for fully associative cache with 1M blocks, fully associative cache with 4 blocks, 2-way-set-associative cache with 4 blocks, and direct mapped cache with 4 blocks." — Figure 1:Cache misses are in red, and cache hits are transparent (in white).

As we go through each of the three cache misses by definition, we realize there is a “fourth” category of misses that we have not covered. See the animation below.

Our current definition of conflict misses assumes that all misses in equivalent FA cache also cause misses in lower associativities. As we see in our example, sometimes we get “conflict hits” because a block that would have been replaced ends up staying in the cache longer. This behavior depends on the chosen replacement policy and the precise ordering of memory accesses.

3.2In practice: Conflict miss rates¶

In practice, computer architects do not look at every single instruction to analyze the type of miss. This is expensive and seems unnecessary. Rather, memory access performance is analyzed as aggregate miss rates against program benchmarks.

Define the conflict miss rate as the total measured miss rate, minus the miss rate due to compulsory misses, minus the fully associative LRU non-compulsory miss rate.

This is the definition that most benchmarkers use for several reasons:

It is easy to calculate.
The definition hinges not on the misses of any given memory access, but rather the aggregate misses incurred on our program benchmark across different cache designs.
This definition truly characterizes what we care about: how associativity impacts cache performance via cache misses: not a miss because of cold caches, neither a miss because we maxed out associativity.

For some examples, see our section on cache optimizations.

3.3Possible redefinition¶

This section is out of scope. If we truly wanted to be precise about misses, we could discuss a second definition of conflict misses by memory access:

Define a conflict miss as a miss that would not have happened if the cache was fully associative under at least one “consistent” replacement policy. Define a “consistent” replacement policy as one that keeps all the data in the lower-associativity cache, i.e., any replacement policy such that the data in the fully associative cache is a superset of the data in our target cache. This definition guarantees that we don’t have “conflict hits”, but also tends to classify misses more as capacity misses than as conflict misses.

In practice, this second definition is not used. Again, in performance we care more about average runtime and overall categories of misses. So the first definition is the one you should take away should you strive to evaluate memory access performance.

Footnotes¶

Thanks Professor Kubiatowicz for this algorithm.
↩