4.3 Article

Hardware Implementation of a Fixed-Point Decoder for Low-Density Lattice Codes

Publisher

SPRINGER
DOI: 10.1007/s11265-021-01735-2

Keywords

Low-density lattice codes; Gaussian mixture; Fixed-point arithmetic; Serial and parallel FPGA architecture; Hardware architecture; Pipelining

Funding

  1. Natural Sciences and Engineering Research Council of Canada (NSERC)

Ask authors/readers for more resources

This paper presents an FPGA implementation of a fixed-point LDLC decoder, which approximates Gaussian mixture messages to a single Gaussian for improved performance. A quantization study is conducted to determine the required number of bits, and efficient numerical methods are devised to approximate the non-linear functions. A novel pipelined LDLC decoder architecture is proposed, achieving high throughput by resource re-utilization and pipelining.
This paper describes a field-programmable gate array (FPGA) implementation of a fixed-point low-density lattice code (LDLC) decoder where the Gaussian mixture messages that are exchanged during the iterative decoding process are approximated to a single Gaussian. A detailed quantization study is first performed to find the minimum number of bits required for the fixed-point decoder to attain a frame error rate (FER) performance similar to floating-point. Then efficient numerical methods are devised to approximate the required non-linear functions. Finally, the paper presents a comparison of the performance of the different decoder architectures as well as a detailed analysis of the resource requirements and throughput trade-offs of the primary design blocks for the different architectures. A novel pipelined LDLC decoder architecture is proposed where resource re-utilization along with pipelining allows for a parallelism equivalent to 50 variable nodes on the target FPGA device. The pipelined architecture attains a throughput of 10.5 Msymbols/sec at a distance of 5 dB from capacity which is a 1.8x improvement in throughput compared to an implementation with 20 parallel variable nodes without pipelining. This implementation also achieves 24x improvement in throughput over a baseline serial decoder.

Authors

I am an author on this paper
Click your name to claim this paper and add it to your profile.

Reviews

Primary Rating

4.3
Not enough ratings

Secondary Ratings

Novelty
-
Significance
-
Scientific rigor
-
Rate this paper

Recommended

No Data Available
No Data Available