Baremetal-NN

Baremetal-NN is a tool for converting PyTorch models into raw C codes that can be executed standalone in a baremetal runtime on research chips.

Note: After a discussion with @iansseijelly, we decided to switch to the simpler way of assuming array will be contiguous, and therefore directly use shape to index into elements, instead of the more generic strided access. The previous strided implementation can be access on the "strided" branch.

Getting Started

Refer to the API Doc for an overview of the available datatypes and functions.

Run Test

Building for x86

first, we clean any previous builds

rm -rf ./build/

cmake -S ./ -B ./build/ -D CMAKE_BUILD_TYPE=Debug
cmake --build ./build/ --target tests
./build/tests/tests

Building for RISC-V

first, we clean any previous builds

rm -rf ./build/

# make sure $RISCV is set
cmake -S ./ -B ./build/ -D CMAKE_BUILD_TYPE=Debug -D CMAKE_TOOLCHAIN_FILE=./riscv-gcc.cmake
cmake --build ./build/ --target tests
spike ./build/tests/tests.elf

Building for RISC-V with Vector Support

first, we clean any previous builds

rm -rf ./build/

# make sure $RISCV is set
cmake -S ./ -B ./build/ -D CMAKE_BUILD_TYPE=Debug -D CMAKE_TOOLCHAIN_FILE=./riscv-gcc.cmake -D CONFIG_BACKEND_RISCV_V=ON
cmake --build ./build/ --target tests
spike --isa=rv64gcv_zicntr_zfh ./build/tests/tests.elf

Running with FP16 support

cmake -S ./ -B ./build/ -D CMAKE_BUILD_TYPE=Debug -D CMAKE_TOOLCHAIN_FILE=./riscv-gcc.cmake -D CONFIG_BACKEND_RISCV_V=ON -D CONFIG_BACKEND_RISCV_ZVFH=ON
cmake --build ./build/ --target tests
spike --isa=rv64gcv_zicntr_zfh_zvfh ./build/tests/tests.elf

Running with FP16 support with GCC<14.0

For GCC<14.0, it does not support the fp16 intrinsics, so we need to use the assembly implementation. (TO BE FIXED)

cmake -S ./ -B ./build/ -D CMAKE_BUILD_TYPE=Debug -D CMAKE_TOOLCHAIN_FILE=./riscv-gcc.cmake -D CONFIG_BACKEND_RISCV_V=ON -D RISCV_ZVFH=ON -D RISCV_V_ASM=ON
cmake --build ./build/ --target tests
spike --isa=rv64gcv_zicntr_zfh_zvfh ./build/tests/tests.elf

Building for RISC-V with Gemmini (Not working for now)

first, we clean any previous builds

rm -rf ./build/

cmake -S ./ -B ./build/ -D CMAKE_BUILD_TYPE=Debug -D CMAKE_TOOLCHAIN_FILE=./riscv-gcc.cmake -D GEMMINI=ON
cmake --build ./build/ --target all
spike --extension=gemmini ./build/tests/tests.elf

Building for K230 board

first, we clean any previous builds

rm -rf ./build/

cmake -S ./ -B ./build/ -G "Unix Makefiles" -D CMAKE_TOOLCHAIN_FILE=./k230-gcc.cmake -D CMAKE_BUILD_TYPE=Debug -D RISCV_V=ON -D RISCV_V_ASM=ON
cmake --build ./build/ --target all

Cleaning build files

cmake --build ./build/ --target clean

Cleaning CMake files

rm -rf ./build/

Supported config flags

CONFIG_DTYPE_ENABLE_F16: enable F16 support.

CONFIG_DTYPE_ENABLE_I32: enable I32 support.

CONFIG_BACKEND_RISCV_V: use RISC-V Vector backend.

CONFIG_BACKEND_RISCV_ZVFH: use RISC-V Vector Floating-Point Hardware for the FP16 operations.

CONFIG_DEBUG_RISCV_V_USE_REDOSUM: use REDOSUM for the reduction operation in RVV. By default, it uses REDUSUM.

Support matrix of backends and operators

Operator	Variants	Scalar CPU	RISC-V Vector
min	F16	✅	✅ (ZVFH)
	F32	✅	✅
max	F16	✅	✅ (ZVFH)
	F32	✅	✅
add	I32	✅	🔜
	F16	✅	✅ (ZVFH)
	F32	✅	✅
addscalar	F16	✅	✅ (ZVFH)
	F32	✅	✅
mul	F16	✅	❌ (ZVFH)
	F32	✅	❌
mulscalar	F16	✅	❌ (ZVFH)
	F32	✅	❌
matmul (mm)	F16	✅	✅ (ZVFH)
	F32	✅	✅
addmatmul (addmm)	I32	✅	🔜
	F16	✅	✅ (ZVFH)
	F32	✅	✅
linear	F16	✅	✅ (ZVFH)
	F32	✅	✅
elu	F16	✅
	F32	✅
relu	F16	✅	✅ (ZVFH)
	F32	✅	✅
tanh	F16	✅
	F32	✅
softmax	F16
	F32	✅
scaled_dot_product_attention	F16
	F32	⚠️

✅: supported

⚠️: partially supported, failing on some tests

❌: not supported

🔜: planned

Convert the model

python ./scripts/convert.py

the converter will dump out three files:

nn.h: stores the library definition.

operators.h: stores the operator definitions.

weights.h: stores the weights and biases of the network.

model.h: stores the code representation of the model forward pass.

Memory layout

Baremetal-NN uses the NHWC memory layout and supports up to 4-dimension tensor.

N: batch, H: height, W: width, C: channels

Code organization

The main header files are under nn/. The header files can be copied to your project and used as a library.

device/ provides a set of header files to use on different devices.

src/asm/ provides the assembly implementation for the RISC-V Vector backend for cases where the compiler does not support the fp16 intrinsics.

Function APIs

The function APIs are in general of the form:

void nn_operator<num>d_<dtype>(Tensor<num>D_<DTYPE> *out, const Tensor<num>D_<DTYPE> *in1, const Tensor<num>D_<DTYPE> *in2, ...<, additional arguments>);

num: the number of dimensions of the input tensors. The tensors are statically dimensioned and support up to 4 dimensions.

dtype: the datatype of the operands, such as i8, u16, f32.

out / in1 / in2 / ...: the pointer to the tensor.

additional arguments: additional arguments for the operator, such as scaling factors and the target dimension to operate on.

Stats

Star History

Acknowledgement

If you find this code useful, we would appreciate if you would cite it with the following:

@software{baremetal-nn,
  author = {Yufeng Chi},
  title = {{Baremetal-NN: A tool for running PyTorch models in resource-constrained embedded environments.}},
  url = {https://github.com/ucb-bar/Baremetal-NN},
  year = {2024},
  version = {0.2.0}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Baremetal-NN

Getting Started

Run Test

Building for x86

Building for RISC-V

Building for RISC-V with Vector Support

Building for RISC-V with Gemmini (Not working for now)

Building for K230 board

Cleaning build files

Cleaning CMake files

Supported config flags

Support matrix of backends and operators

Convert the model

Memory layout

Code organization

Function APIs

Stats

Star History

Acknowledgement

Files

README.md

Latest commit

History

README.md

File metadata and controls

Baremetal-NN

Getting Started

Run Test

Building for x86

Building for RISC-V

Building for RISC-V with Vector Support

Building for RISC-V with Gemmini (Not working for now)

Building for K230 board

Cleaning build files

Cleaning CMake files

Supported config flags

Support matrix of backends and operators

Convert the model

Memory layout

Code organization

Function APIs

Stats

Star History

Acknowledgement