IBM®
Skip to main content
    Country/region [change]    Terms of use
 
 
 
    Home    Products    Services & solutions    Support & downloads    My account    

IBM Journal of Research and Development

Exploratory Systems Research   Volume 50, Number 2/3, 2005
Table of contents: HTMLPDF This article: HTMLPDF   Copyright info

Decomposing the load–store queue by function for power reduction and scalability - References

by L. Baugh
and C. Zilles
References

  1. A. Roth, “A High-Bandwidth Load-Store Unit for Single- and Multi-Threaded Processors,” Technical Report MS-CIS-04-09, University of Pennsylvania, Philadelphia, PA 19104, 2004.
  2. H. W. Cain and M. H. Lipasti, “Memory Ordering: A Value-Based Approach,” Proceedings of the 31st Annual International Symposium on Computer Architecture, 2004, pp. 90–101.
  3. A. Roth, “Store Vulnerability Window (SVW): Re-Execution Filtering for Enhanced Load Optimization,” Proceedings of the 32nd Annual International Symposium on Computer Architecture, 2005, pp. 458–468.
  4. Il Park, C.-L. Ooi, and T. N. Vijaykumar, “Reducing Design Complexity of the Load-Store Queue,” Proceedings of the 36th Annual IEEE/ACM International Symposium on Microarchitecture, 2003, pp. 411–422.
  5. G. Z. Chrysos and J. S. Emer, “Memory Dependence Prediction Using Store Sets,” Proceedings of the 25th Annual International Symposium on Computer Architecture, 1998, pp. 142–153.
  6. S. Sethumadhavan, R. Desikan, D. Burger, C. R. Moore, and S. W. Keckler, “Scalable Hardware Memory Disambiguation for High-ILP Processors,” Proceedings of the 36th Annual IEEE/ACM International Symposium on Microarchitecture, 2003, pp. 399–410.
  7. H. Akkary, R. Rajwar, and S. T. Srinivasan, “Checkpoint Processing and Recovery: Towards Scalable Large Instruction Window Processors,” Proceedings of the 36th Annual IEEE/ACM International Symposium on Microarchitecture, 2003, pp. 423–434.
  8. E. F. Torres, P. Ibanez, V. Vinals, and J. M. Llaberia, “Store Buffer Design in First-Level Multibanked Data Caches,” Proceedings of the 32nd Annual International Symposium on Computer Architecture, 2005, pp. 469–480.
  9. R. E. Kessler, “The Alpha 21264 Microprocessor,” IEEE Micro 19, No. 2, 24–36 (1999).
  10. M. F. Chowdhury and D. M. Carmean, “Method, Apparatus, and System for Maintaining Processor Ordering by Checking Load Addresses of Unretired Load Instructions Against Snooping Store Addresses,” U.S. Patent Application 6484254, November 2002.
  11. V. Adve, C. Lattner, M. Brukman, A. Shukla, and B. Gaeke, “LLVA: A Low-Level Virtual Instruction Set Architecture,” Proceedings of the 36th Annual IEEE/ACM International Symposium on Microarchitecture, 2003, pp. 205–216.
  12. K. Ebcioglu and E. R. Altman, “DAISY: Dynamic Compilation for 100% Architectural Compatibility,” Proceedings of the 24th Annual International Symposium on Computer Architecture, 1997, pp. 26–37.
  13. B. Fahs, S. Bose, M. Crum, B. Slechta, F. Spadini, T. Tung, S. J. Patel, and S. S. Lumetta, “Performance Characterization of a Hardware Framework for Dynamic Optimization,” Proceedings of the 34th Annual IEEE/ACM International Symposium on Microarchitecture, 2001, pp. 16–27.
  14. A. Klaiber, “The Technology Behind Crusoe Processors,” white paper, Transmeta Corporation, 3990 Freedom Circle, Santa Clara, CA 95054, January 2000.
  15. C. Zilles and G. Sohi, “Master/Slave Speculative Parallelization,” Proceedings of the 35th Annual IEEE/ACM International Symposium on Microarchitecture, 2002, pp. 85–96.
  16. C. N. Keltcher, K. J. McGrath, A. Ahmed, and P. Conway, “The AMD Opteron Processor for Multiprocessor Servers,” IEEE Micro 23, No. 2, 66–76 (2003).
  17. T. Austin, E. Larson, and D. Ernst, “SimpleScalar: An Infrastructure for Computer System Modeling,” IEEE Computer 35, No. 2, 59–67 (2002).
  18. P. Shivakumar and N. Jouppi, “CACTI 3.0: An Integrated Cache Timing, Power, and Area Model,” Technical Report 2001/2, COMPAQ Western Research Laboratory, Palo Alto, CA 94301, 2001.


    About IBMPrivacyContact