From ae89d639751375500397ef37a4afb78319bfca70 Mon Sep 17 00:00:00 2001 From: Joseph Melber Date: Wed, 24 Apr 2024 16:50:39 -0600 Subject: [PATCH] Fixup section 5 (#1403) --- programming_guide/section-5/README.md | 7 ++++--- 1 file changed, 4 insertions(+), 3 deletions(-) diff --git a/programming_guide/section-5/README.md b/programming_guide/section-5/README.md index 8ecefa98c9..46786fe2a6 100644 --- a/programming_guide/section-5/README.md +++ b/programming_guide/section-5/README.md @@ -34,6 +34,10 @@ The [passthrough DMAs](../../programming_examples/basic/passthrough_dmas/) examp | [Vector Reduce Max](../../programming_examples/basic/vector_reduce_max/) | bfloat16 | Returns the maximum of all elements in a vector | | [Vector Reduce Min](../../programming_examples/basic/vector_reduce_min/) | bfloat16 | Returns the minimum of all elements in a vector | | [Vector Exp](../../programming_examples/basic/vector_exp/) | bfloat16 | Returns a vector representing $e^x$ of the inputs | +| [DMA Transpose](../../programming_examples/basic/dma_transpose/) | i32 | Transposes a matrix with the Shim DMA using `npu_dma_memcpy_nd` | +| [Single core GEMM](../../programming_examples/basic/matrix_multiplication/single_core) | bfloat16 | A single core matrix-matrix multiply | +| [Multi core GEMM](../../programming_examples/basic/matrix_multiplication/whole_array) | bfloat16 | A matrix-matrix multiply using 16 AIEs with operand broadcast. Uses a simple "accumulate in place" strategy | +| [GEMV](../../programming_examples/basic/matrix_multiplication/matrix_vector) | bfloat16 | A vector-matrix multiply returning a vector ## Machine Kearning Kernels @@ -43,9 +47,6 @@ The [passthrough DMAs](../../programming_examples/basic/passthrough_dmas/) examp | [Eltwise Mul](../../programming_examples/ml/eltwise_mul/) | i32 | An element by element multiplication of two vectors | | [ReLU](../../programming_examples/ml/relu/) | bfloat16 | Rectified linear unit (ReLU) activation function on a vector| | [Softmax](../../programming_examples/ml/softmax/) | bfloat16 | Softmax operation on a matrix | -| [Single core GEMM](../../programming_examples/basic/matrix_multiplication/single_core) | bfloat16 | A single core matrix-matrix multiply | -| [Multi core GEMM](../../programming_examples/basic/matrix_multiplication/whole_array) | bfloat16 | A matrix-matrix multiply using 16 AIEs with operand broadcast. Uses a simple "accumulate in place" strategy | -| [GEMV](../../programming_examples/basic/matrix_multiplication/matrix_vector) | bfloat16 | A vector-matrix multiply returning a vector | [Conv2D](../../programming_examples/ml/conv2d) | i8 | A single core 2D convolution for CNNs | | [Conv2D+ReLU](../../programming_examples/ml/conv2d_fused_relu) | i8 | A Conv2D with a ReLU fused at the vector register level |