Skip to content

Commit

Permalink
Merge pull request #164 from Burning1020/update-doc
Browse files Browse the repository at this point in the history
Signed-off-by: Kevin Wang <[email protected]>
  • Loading branch information
kevin-wangzefeng authored Jan 3, 2025
2 parents 4ad477e + 2a83121 commit 54d5d2e
Showing 1 changed file with 13 additions and 2 deletions.
15 changes: 13 additions & 2 deletions docs/wasm/How-to-run-Llama-3-8B-with-Kubernetes.md
Original file line number Diff line number Diff line change
Expand Up @@ -10,7 +10,7 @@ details in: https://www.secondstate.io/articles/wasm-runtime-agi/.

## How to run a llm inference application in Kuasar?

Since Kuasar v0.8.0, Kuasar wasm-sandboxer with `wasmedge` and `wasmedge_wasi_nn`
Since Kuasar v1.0.0, Kuasar wasm-sandboxer with `wasmedge` and `wasmedge_wasi_nn`
features allows your WasmEdge application use the ability of WASI API for
performing Machine Learning inference: https://github.com/WebAssembly/wasi-nn.

Expand All @@ -22,7 +22,6 @@ which introducing how to create an OpenAI-compatible API service for Llama-3-8B.
+ Install WasmEdge and plugins:
`curl -sSf https://raw.githubusercontent.com/WasmEdge/WasmEdge/master/utils/install.sh | bash -s -- -v 0.13.5 --plugins wasi_logging wasi_nn-ggml`


### 1. Build docker image

We already have an example docker image on dockerhub: `docker.io/kuasario/llama-api-server:v1`.
Expand Down Expand Up @@ -50,6 +49,18 @@ CMD ["llama-api-server.wasm", "--prompt-template", "llama-3-chat", "--ctx-size",
```
Build it with `docker build -t docker.io/kuasario/llama-api-server:v1 .`

Otherwise, you can use this DOCKERFILE if you are debugging while developing:
```dockerfile
FROM ubuntu:22.04
RUN apt-get update && apt-get install -y curl
RUN curl -LO https://huggingface.co/second-state/Llama-3-8B-Instruct-GGUF/resolve/main/Meta-Llama-3-8B-Instruct-Q5_K_M.gguf
RUN curl -LO https://github.com/second-state/chatbot-ui/releases/latest/download/chatbot-ui.tar.gz; tar xzf chatbot-ui.tar.gz; rm chatbot-ui.tar.gz
COPY *.wasm .
```

This would download all prerequisites to docker image cache, allowing you pay attention only on development.
`docker build -f Dockerfile -t docker.io/kuasario/llama-api-server:v2 $path_to_app`

### 2. Build and run Kuasar Wasm Sandboxer

```bash
Expand Down

0 comments on commit 54d5d2e

Please sign in to comment.