BitFuSCNN: A variable bitwidth compressed sparse CNN accelerator

Pruning a DNN model results in a lot of sparsity in weights and ReLU layer introduces sparsity in activations. The SCNN accelerator makes use of this fact to reduce the number of computations performed. DNNs can operate with reduced bitwidth without degradation in classification accuracy. The BitFusion accelerator introduces a bitwidth - computation tradeoff to support different inference accelerator performance points. Lesser number of bits, more the number of computations per processing engine. BitFuSCNN strives to combine the benefits of both sparsity and quantization in CNNs. The design is similar to that of SCNN, except the bitwidths are variable. Instead of a fixed 4x4 multiplier operating on 16 bit values like in SCNN, we have a variable bitwidth multiplier operating on 16x16, 8x8, or 4x4 input either 2, 4, or 8 bits wide.

BitFuSCNN report
BitFuSCNN slides

Name		Name	Last commit message	Last commit date
Latest commit History 80 Commits
data		data
verilog		verilog
.gitignore		.gitignore
README.md		README.md
bitfuscnn.py		bitfuscnn.py
ppu.py		ppu.py
ppu_test.py		ppu_test.py
test.py		test.py
top.py		top.py
utils.py		utils.py
weight_fifo.py		weight_fifo.py
weight_fifo_test.py		weight_fifo_test.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BitFuSCNN: A variable bitwidth compressed sparse CNN accelerator

About

Releases

Packages

Contributors 3

Languages

lite-david/bitfuscnn

Folders and files

Latest commit

History

Repository files navigation

BitFuSCNN: A variable bitwidth compressed sparse CNN accelerator

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages