TY - JOUR
T1 - Hardware Design of Low-Power High-Throughput Sorting Unit
AU - Lin, Shih Hsiang
AU - Chen, Pei Yin
AU - Lin, Yu Ning
N1 - Publisher Copyright:
© 1968-2012 IEEE.
PY - 2017/8
Y1 - 2017/8
N2 - Sorting is one of the most fundamental topics in computer science. Since partial sorting with lower costs would be much more feasible than the complete sorting method for some applications, this paper focuses on the architecture sorting N values from M inputs. To meet the real-time processing requirement, hardware acceleration is commonly employed to enhance the performance. However, high-speed computation usually raises power consumption drastically.To overcome that problem, a low-power, high-throughput, and modular hardware design of partial sorting network is presented. By applying a pointer-like design, the comparing modules move the indexes of samples instead of moving the input data directly. Power dissipation is reduced by minimizing switching activities and signal transitions. To prevent unnecessary comparing of the large data set, an iterative architecture is proposed which uses both the low-power sorting module and a novel clipping mechanism. The proposed design was simulated with a 90-nm cell library and the results are compared to those of other works on hardware sorting. Experiment results show that power consumption is reduced by 64.9 percent and high-throughput performance can also be achieved.
AB - Sorting is one of the most fundamental topics in computer science. Since partial sorting with lower costs would be much more feasible than the complete sorting method for some applications, this paper focuses on the architecture sorting N values from M inputs. To meet the real-time processing requirement, hardware acceleration is commonly employed to enhance the performance. However, high-speed computation usually raises power consumption drastically.To overcome that problem, a low-power, high-throughput, and modular hardware design of partial sorting network is presented. By applying a pointer-like design, the comparing modules move the indexes of samples instead of moving the input data directly. Power dissipation is reduced by minimizing switching activities and signal transitions. To prevent unnecessary comparing of the large data set, an iterative architecture is proposed which uses both the low-power sorting module and a novel clipping mechanism. The proposed design was simulated with a 90-nm cell library and the results are compared to those of other works on hardware sorting. Experiment results show that power consumption is reduced by 64.9 percent and high-throughput performance can also be achieved.
UR - http://www.scopus.com/inward/record.url?scp=85028959276&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=85028959276&partnerID=8YFLogxK
U2 - 10.1109/TC.2017.2672966
DO - 10.1109/TC.2017.2672966
M3 - Article
AN - SCOPUS:85028959276
SN - 0018-9340
VL - 66
SP - 1383
EP - 1395
JO - IEEE Transactions on Computers
JF - IEEE Transactions on Computers
IS - 8
M1 - 7862779
ER -