dc.contributor.author |
Singh, Sarabjeet |
|
dc.contributor.author |
Surana, Neelam |
|
dc.contributor.author |
Prasad, Kailash |
|
dc.contributor.author |
Jain, Pranjali |
|
dc.contributor.author |
Mekie, Joycee |
|
dc.contributor.author |
Awasthi, Manu |
|
dc.coverage.spatial |
United States of America |
|
dc.date.accessioned |
2023-07-06T15:05:54Z |
|
dc.date.available |
2023-07-06T15:05:54Z |
|
dc.date.issued |
2023-06 |
|
dc.identifier.citation |
Singh, Sarabjeet; Surana, Neelam; Prasad, Kailash; Jain, Pranjali; Mekie, Joycee and Awasthi, Manu, "HyGain: high-performance, energy-efficient hybrid gain cell-based cache hierarchy", ACM Transactions on Architecture and Code Optimization, DOI: 10.1145/3572839, vol. 20, no. 2, Jun. 2023. |
|
dc.identifier.issn |
1544-3566 |
|
dc.identifier.issn |
1544-3973 |
|
dc.identifier.uri |
https://doi.org/10.1145/3572839 |
|
dc.identifier.uri |
https://repository.iitgn.ac.in/handle/123456789/8999 |
|
dc.description.abstract |
In this article, we propose a "full-stack" solution to designing high-apacity and low-latency on-chip cache hierarchies by starting at the circuit level of the hardware design stack. We propose a novel half VDD precharge 2T Gain Cell (GC) design for the cache hierarchy. The GC has several desirable characteristics, including ~50% higher storage density and ~50% lower dynamic energy as compared to the traditional 6T SRAM, even after accounting for peripheral circuit overheads. We also demonstrate data retention time of 350 us (~17.5� of eDRAM) at 28 nm technology with VDD = 0.9V and temperature = 27�C that, combined with optimizations like staggered refresh, makes it an ideal candidate to architect all levels of on-chip caches. We show that compared to 6T SRAM, for a given area budget, GC-based caches, on average, provide 30% and 36% increase in IPC for single- and multi-programmed workloads, respectively, on contemporary workloads, including SPEC CPU 2017. We also observe dynamic energy savings of 42% and 34% for single- and multi-programmed workloads, respectively. Finally, in a quest to utilize the best of all worlds, we combine GC with STT-RAM to create hybrid hierarchies. We show that a hybrid hierarchy with GC caches at L1 and L2 and an LLC split between GC and STT-RAM is able to provide a 46% benefit in energy-delay product (EDP) as compared to an all-SRAM design, and 13% as compared to an all-GC cache hierarchy, averaged across multi-programmed workloads. |
|
dc.description.statementofresponsibility |
by Sarabjeet Singh, Neelam Surana, Kailash Prasad, Pranjali Jain, Joycee Mekie and Manu Awasthi |
|
dc.format.extent |
vol. 20, no. 2 |
|
dc.language.iso |
en_US |
|
dc.publisher |
Association for Computing Machinery |
|
dc.subject |
HyGain |
|
dc.subject |
eDRAM |
|
dc.subject |
EDP |
|
dc.subject |
GC cache |
|
dc.subject |
SPEC CPU |
|
dc.title |
HyGain: high-performance, energy-efficient hybrid gain cell-based cache hierarchy |
|
dc.type |
Article |
|
dc.relation.journal |
ACM Transactions on Architecture and Code Optimization |
|