Two architectural techniques are presented and analyzed in this paper that aim at eliminating the Translation Lookaside Buffer (TLB) access delay from the critical path of physical address cache-based scalar processors. The first technique, parallel address translation, masks the TLB access delay by using a set-associative virtual memory map to extend the cache size beyond the product of the cache associativity and the virtual memory page size. The second technique, lazy address translation, bypasses the TLB access completely by using the base register and offset in a memory reference as a caching mechanism for its corresponding physical page. Consequently the TLB access is needed only when this caching scheme fails. A trace-driven simulation study is conducted and the experimental results show that under the given workload the parallel address translation scheme works best when the virtual memory is 16-way set associative, and the penalty on the average cycle-per-instruction (CPI) due to lazy address translation is less than 1.3%.
Mendeley helps you to discover research relevant for your work.
CITATION STYLE
Chiueh, T. cker, & Katz, R. H. (1992). Eliminating the address translation bottleneck for physical address cache. In International Conference on Architectural Support for Programming Languages and Operating Systems - ASPLOS (Vol. 27, pp. 137–148). Publ by ACM. https://doi.org/10.1145/143371.143501