4.6 Article

RAW-Tag: Replicating in Altered Cache Ways for Correcting Multiple-Bit Errors in Tag Array

Journal

Publisher

IEEE COMPUTER SOC
DOI: 10.1109/TDSC.2017.2706263

Keywords

Cache memories; error-checking; multiple-bit upset; redundant design; soft errors; tag replication

Ask authors/readers for more resources

Tag array in on-chip caches is one of the most vulnerable components to radiation-induced soft errors. Protecting the tag array in some processors is limited to error detection using the parity check, since the overheads of error correcting codes are not affordable in this component. State-of-the-art tag protection schemes combine the parity check with replication to provide error correction capability. Classifying these replication-based schemes into partial-replication and full-replication, the former offers a low overhead protection in which a large fraction of detectable errors remain uncorrectable, whereas the latter imposes a significant overhead to correct all of the errors. This paper proposes a low overhead full-replication scheme, so called Replicating in Altered Ways of Tag (RAW-Tag), to correct all detectable errors. RAW-Tag manipulates the cache replacement algorithm and keeps track of the incoming/evicting cache lines to not only provide a replica for all tags, but also eliminate the simultaneous susceptibility of both a tag and its replica to a single Multiple-Bit Upset (MBU). The simulation results show that RAW-Tag imposes no performance overhead and increases the energy consumption of L1 and L2 caches by only 6.6 and 0.3 percent, respectively, as compared with the baseline.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.6
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Hardware & Architecture

TAMER: an adaptive task allocation method for aging reduction in multi-core embedded real-time systems

Faezeh Sadat Saadatmand, Nezam Rohbani, Farshad Baharvand, Hamed Farbeh

Summary: This paper proposes a system-level aging mitigation method, TAMER, which smooths the temperature pattern inside the chip by considering core utilization and internal units' activity, preventing the occurrence of hotspots. Experimental results show that TAMER reduces the temperature standard deviation of the cores by 56% and 37% compared to the previous algorithm, without imposing any additional overhead on the system.

JOURNAL OF SUPERCOMPUTING (2021)

Article Computer Science, Hardware & Architecture

A System-Level Framework for Analytical and Empirical Reliability Exploration of STT-MRAM Caches

Elham Cheshmikhani, Hamed Farbeh, Hossein Asadi

IEEE TRANSACTIONS ON RELIABILITY (2020)

Article Computer Science, Hardware & Architecture

ECC-United Cache: Maximizing Efficiency of Error Detection/Correction Codes in Associative Cache Memories

Hamed Farbeh, Leila Delshadtehrani, Hyeonggyu Kim, Soontae Kim

Summary: The ECC-United Cache (EUC) architecture improves the efficiency of Error Detection/Correction Codes (EDCs/ECCs) in set-associative L1 caches by extending the data protection granularity and providing flexible protection capabilities. EUC can reduce the number of check bits or increase burst error detection/correction capability, offering a trade-off between overhead and protection capability.

IEEE TRANSACTIONS ON COMPUTERS (2021)

Article Engineering, Electrical & Electronic

A link adaptation scheme for reliable downlink communications in narrowband IoT

Farshid Sanei, Hamed Farbeh

Summary: NB-IoT is a new technology introduced by 3GPP to meet the requirements of LPWAN. The proposed link adaptation scheme aims to enhance network coverage by repeating data transmission and control signals, reducing resource consumption and active time.

MICROELECTRONICS JOURNAL (2021)

Article Computer Science, Hardware & Architecture

LETHOR: a thermal-aware proactive routing algorithm for 3D NoCs with less entrance to hot regions

Maede Safari, Zahra Shirmohammadi, Nezam Rohbani, Hamed Farbeh

Summary: This paper proposes an efficient proactive thermal-aware routing algorithm called LETHOR to reduce the temperature of 3D NoCs. By considering the temperature information of all nodes in each layer, LETHOR can decrease the standard deviation of chip temperature, statistical traffic load distribution, and global average delay compared to the state-of-the-art routing algorithm.

JOURNAL OF SUPERCOMPUTING (2022)

Article Computer Science, Hardware & Architecture

Data block manipulation for error rate reduction in STT-MRAM based main memory

Nooshin Mahdavi, Farhad Razaghian, Hamed Farbeh

Summary: This paper proposes a low-cost microarchitectural technique to mitigate write failure and read disturbance in Spin-Transfer Torque Magnetic Random-Accesses Memory (STT-MRAM). By prewriting the blocks and using effective encoding, the reliability of STT-MRAM is improved.

JOURNAL OF SUPERCOMPUTING (2022)

Article Computer Science, Hardware & Architecture

An Architectural-Level Reliability Improvement Scheme in STT-MRAM Main Memory

Nooshin Mahdavi, Farhad Razaghian, Hamed Farbeh

Summary: This study improves the reliability of computer system main memory with minimal changes in architecture, reducing the probability of write and retention failures.

MICROPROCESSORS AND MICROSYSTEMS (2022)

Article Engineering, Electrical & Electronic

CRP: Conditional Replacement Policy for Reliability Enhancement of STT-MRAM Caches

Marjan Rahbari, Hamed Farbeh

Summary: This article introduces an error-aware cache replacement policy (CRP) to enhance the reliability of STT-MRAM caches. By reducing read disturbance and write failure rate, this policy effectively decreases the overall error rate while improving performance and energy efficiency.

IEEE TRANSACTIONS ON MAGNETICS (2022)

Article Computer Science, Hardware & Architecture

An adaptive data coding scheme for energy consumption reduction in SDN-based Internet of Things

Shahab Salehi, Hamed Farbeh, Alireza Rokhsari

Summary: The Internet of Things (IoT) and the Internet of Everything (IoE) are rapidly expanding, but their energy consumption and network management pose challenges. This study proposes a data manipulation method that reduces energy consumption and network traffic by minimizing data exchange. The efficiency of this method is enhanced using Software-Defined Networking (SDN). Simulation and experimental results demonstrate the effectiveness of the proposed method.

COMPUTER NETWORKS (2023)

Article Computer Science, Hardware & Architecture

GraphA: An efficient ReRAM-based architecture to accelerate large scale graph processing

Seyed Ali Ghasemi, Belal Jahannia, Hamed Farbeh

Summary: This paper proposes a ReRAM-based PIM architecture, GraphA, with a novel reordering algorithm and data mapping to improve performance and energy efficiency in graph analytics.

JOURNAL OF SYSTEMS ARCHITECTURE (2022)

Article Engineering, Electrical & Electronic

A fault-tolerant resource locking protocol for multiprocessor real-time systems

Seyede Sahebeh Nabavi, Hamed Farbeh

Summary: This paper presents the first fault-tolerant resource locking protocol for multiprocessor real-time systems. The protocol considers transient faults in shared resources and proposes a checkpointing-based fault-tolerant mechanism. It also solves the problem of priority inversion in periodic tasks. Simulation results show that the protocol can tolerate at least one transient fault with a 15% blocking time overhead compared to its non-fault-tolerant configuration.

MICROELECTRONICS JOURNAL (2023)

Proceedings Paper Automation & Control Systems

A Novel Neuromorphic Processors Realization of Spiking Deep Reinforcement Learning for Portfolio Management

Seyyed Amirhossein Saeidi, Forouzan Fallah, Soroush Barmaki, Hamed Farbeh

Summary: A spiking deep reinforcement learning (SDRL) algorithm optimized for Intel's Loihi neuromorphic processor is proposed in this study. It can predict financial markets in unpredictable environments and achieve portfolio management goals with reduced energy consumption and increased processing speed.

PROCEEDINGS OF THE 2022 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE 2022) (2022)

Article Computer Science, Hardware & Architecture

3RSeT: Read Disturbance Rate Reduction in STT-MRAM Caches by Selective Tag Comparison

Elham Cheshmikhani, Hamed Farbeh, Hossein Asadi

Summary: This article introduces Spin-Transfer Torque Magnetic RAM (STT-MRAM) as a promising replacement for SRAM in on-chip cache memories and proposes a low-cost scheme called 3RSeT to reduce the occurrence of read disturbance errors in STT-MRAM caches. The evaluations show that 3RSeT significantly reduces the read disturbance rate in the tag array, improves the Mean Time To Failure (MTTF), and reduces energy consumption.

IEEE TRANSACTIONS ON COMPUTERS (2022)

Proceedings Paper Automation & Control Systems

Enhancing Reliability of STT-MRAM Caches by Eliminating Read Disturbance Accumulation

Elham Cheshmikhani, Hamed Farbeh, Hossein Asadi

2019 DESIGN, AUTOMATION & TEST IN EUROPE CONFERENCE & EXHIBITION (DATE) (2019)

No Data Available