4.7 Article

BIC Codes: Bit Insertion-Based Constrained Codes With Error Correction for DNA Storage

Journal

IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTING
Volume 11, Issue 3, Pages 764-777

Publisher

IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC
DOI: 10.1109/TETC.2023.3268274

Keywords

Bit insertion technique; constrained coding; DNA storage; GC-content; low-density parity-check (LDPC) code; maxmimum homopolymer run; rate-compatible code

Ask authors/readers for more resources

In this article, a new coding algorithm for DNA storage is proposed, which can be applied to both error-free and error channels. For the error-free case, a constrained code called bit insertion-based constrained (BIC) code is proposed, which converts a binary data sequence to multiple oligo sequences satisfying the maximum homopolymer run constraint by inserting dummy bits. A constrained coding algorithm satisfying the GC-content constraint as well as the RL constraint is obtained by combining a balancing technique with BIC codes. For the DNA storage channel with errors, the proposed constrained coding algorithm is integrated with a rate-compatible low-density parity-check (LDPC) code to correct errors and erasures. Simulation results demonstrate that the proposed integrated coding algorithm outperforms existing coding algorithms in terms of information density and error correctability.
In this article, we propose a new coding algorithm for DNA storage over both error-free and error channels. For the error-free case, we propose a constrained code called bit insertion-based constrained (BIC) code. BIC codes convert a binary data sequence to multiple oligo sequences satisfying the maximum homopolymer run (i.e., run-length (RL)) constraint by inserting dummy bits. We show that the BIC codes nearly achieves the capacity in terms of information density while the simple structure of the BIC codes allows linear-time encoding and fast parallel decoding. Also, by combining a balancing technique with the BIC codes, we obtain the constrained coding algorithm to satisfy the GC-content constraint as well as the RL constraint. Next, for DNA storage channel with errors, we integrate the proposed constrained coding algorithm with a rate-compatible low-density parity-check (LDPC) code to correct errors and erasures. Specifically, we incorporate LDPC codes adopted in the 5 G new radio standard because they have powerful error-correction capability and appealing features for the integration. Simulation results show that the proposed integrated coding algorithm outperforms existing coding algorithms in terms of information density and error correctability.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.7
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

Article Computer Science, Information Systems

Optimal Power Allocation for Maximizing Energy Efficiency in DAS-Based IoT Network

Zaki Masood, Hosung Park, Han Seung Jang, Sunyong Yoo, Sokhee P. Jung, Yonghoon Choi

Summary: This article discusses the maximization of energy efficiency (EE) for distributed antenna systems based on simultaneous wireless information and power transfer (SWIPT), presenting a tradeoff between EE and spectral efficiency and proposing an algorithm to optimize power allocation. The proposed algorithm uses Lagrangian multiplier method and Karush-Kuhn-Tucker conditions to find the optimal solution without iterative computation, achieving maximum energy transfer through optimal power splitting ratio.

IEEE SYSTEMS JOURNAL (2021)

Article Computer Science, Information Systems

Deep Learning-Based Detector for Dual Mode OFDM With Index Modulation

Junghyun Kim, Hyejin Ro, Hosung Park

Summary: In this letter, a deep learning-based detector DeepDM is proposed, which uses a concatenation of CNN and DNN to detect index bits and carrier bits separately. A loss function is used to approach the BER performance of the maximum likelihood detector. Through simulations, it is shown that DeepDM outperforms conventional detectors in terms of BER performance and computational complexity under the Rayleigh fading channel.

IEEE WIRELESS COMMUNICATIONS LETTERS (2021)

Article Computer Science, Information Systems

Distributed CRC scheme for low-complexity successive cancellation flip decoding of polar codes

Haseong Kim, Hyunjee Lee, Hosung Park

Summary: The paper proposes a novel decoding method, which reduces computational complexity by using distributed CRC bits and decreases the number of estimations for information bits by early termination of decodings, while minimizing additional sorting operations. Simulation results demonstrate that this method significantly reduces computational complexity compared to conventional decoding.

ICT EXPRESS (2022)

Article Engineering, Electrical & Electronic

Optimization of SC-LDPC Codes for Window Decoding With Target Window Sizes

Hee-Youl Kwak, Jae-Won Kim, Hosung Park, Jong-Seon No

Summary: This paper proposes an optimization method for protograph-based spatially coupled LDPC codes under window decoding. By introducing a new performance metric, the window mean parameter, the finite-length performance of the codes is improved, and a code family optimized for different window sizes is provided. Further investigation of the structural characteristics of the codes explains the performance improvement and enhances the understanding of SC-LDPC codes for window decoding.

IEEE TRANSACTIONS ON COMMUNICATIONS (2022)

Article Computer Science, Information Systems

Design of Protograph LDPC Codes Using Resolvable Block Designs for Block Fading Channel

Jaewha Kim, Chanki Kim, Hosung Park, Jong-Seon No

Summary: This letter proposes new protograph low-density parity-check (LDPC) codes with high code rates for block fading (BF) channels using resolvable block designs (RBDs). To analyze the performance of the proposed LDPC codes, an upper bound on the bit error rate (BER) is derived using the novel method called gamma evolution. Finally, numerical results demonstrate that the frame error rate (FER) of the proposed LDPC codes has a slope approaching the channel outage probability.

IEEE WIRELESS COMMUNICATIONS LETTERS (2022)

Article Telecommunications

Neural Min-Sum Decoding for Generalized LDPC Codes

Hee-Youl Kwak, Jae-Won Kim, Yongjune Kim, Sang-Hyo Kim, Jong-Seon No

Summary: This letter investigates the MS and NMS decoding algorithms for GLDPC codes. By employing the NMS decoding algorithm and optimizing the component PCM, the performance can be improved.

IEEE COMMUNICATIONS LETTERS (2022)

Article Computer Science, Information Systems

Generative Adversarial Networks for DNA Storage Channel Simulator

Sanghoon Kang, Yunfei Gao, Jaeho Jeong, Seong-Joon Park, Jae-Won Kim, Jong-Seon No, Hahyeon Jeon, Jeong Wook Lee, Sunghwan Kim, Hosung Park, Albert No

Summary: DNA data storage systems have made rapid progress with the introduction of error-correcting techniques, random access algorithms, and query systems. However, designing algorithms for such systems is challenging due to the unpredictable nature of errors and the high cost of experiments. Therefore, the development of a simulator that can mimic the error statistics of a DNA storage system is of great interest.

IEEE ACCESS (2023)

Article Telecommunications

Toward Energy Efficient and Balanced User Associations and Power Allocations in Multiconnectivity-Enabled mmWave Networks

Kezhong Jin, Xuebing Cai, Jianing Du, Hosung Park, Zhenzhou Tang

Summary: This paper discusses the application of mmWave spectrum in 5G wireless communication systems and the challenges it faces. It proposes a multi-objective optimization problem and a novel algorithm to address the two key challenges in multiconnectivity enabled mmWave networks. Simulation results show that the proposed scheme achieves good performance in overall energy efficiency, fairness of user rate, and balance of mBSs traffic load.

IEEE TRANSACTIONS ON GREEN COMMUNICATIONS AND NETWORKING (2022)

Article Computer Science, Information Systems

The Cycle-Concentrating PEG Algorithm for Protograph Generalized LDPC Codes

Dae-Young Yun, Jae-Won Kim, Hee-Youl Kwak, Jong-Seon No

Summary: In this paper, the CC-PEG algorithm is proposed for lifting protograph GLDPC codes to improve decoding performance by concentrating local cycles at doped VNs and avoiding them at vulnerable undoped VNs. The algorithm collects edges causing the maximum undoped girth and selects the edge with the lowest concentrated cycle metric. Simulation results show a performance gain of up to 20 times lower frame error rate compared to conventional lifting algorithms.

IEEE ACCESS (2023)

Article Biochemical Research Methods

Reducing cost in DNA-based data storage by sequence analysis-aided soft information decoding of variable-length reads

Seong-Joon Park, Sunghwan Kim, Jaeho Jeong, Albert No, Jong-Seon No, Hosung Park

Summary: This study proposes encoding and decoding procedures for DNA storage. The encoding procedure utilizes a carefully designed low-density parity-check code to correct errors and dropouts efficiently. New clustering and alignment methods are applied to improve the decoding performance.

BIOINFORMATICS (2023)

Article Computer Science, Information Systems

Construction of Protograph-Based Partially Doped Generalized LDPC Codes

Jaewha Kim, Jae-Won Kim, Hee-Youl Kwak, Jong-Seon No

Summary: This paper proposes a new code design technique, called partial doping, for improving generalized low-density parity-check (GLDPC) codes. The proposed technique allows for finer tuning of the code structure, resulting in a higher degree of freedom for optimization. Experimental results show that the partially doped GLDPC codes constructed using this technique outperform conventional GLDPC codes and are competitive with state-of-the-art protograph-based LDPC codes over the binary erasure channel.

IEEE ACCESS (2022)

Article Computer Science, Information Systems

Optimizing Code Parameters of Finite-Length SC-LDPC Codes Using the Scaling Law

Hee-Youl Kwak, Jae-Won Kim, Jong-Seon No

Summary: This paper optimizes the code parameters of finite-length SC-LDPC codes by extending the scaling law without constraints, showing that the coupling width w directly affects the performance curve slope and exploring trade-offs inherent in the code parameters. The scaling law allows for the determination of the optimal code parameter set for best finite-length performance, which varies depending on the coupling and uncoupled code lengths.

IEEE ACCESS (2021)

No Data Available