Add k8s with llama server example #39

CaptainVincent · 2024-01-07T02:27:10Z

I tried to start the server not using the kubectl.sh run command, but rather by applying a YAML file.
However, I encountered an odd phenomenon where the network connection has not been established.

Therefore, I've submitted a pull request for this version.
Later, I'll push the apply yaml file approach to another branch for backup to check issue after openvino task.

juntao · 2024-01-07T02:27:13Z

Hello, I am a code review bot on flows.network. Here are my reviews of code commits in this PR.

Commit 78d2f94261a6f45f39d694061c4fb2658374d994

The key change in this pull request is the addition of a Kubernetes example that demonstrates running a WasmEdge LLAMA chat server application with Containerd over Kubernetes. The pull request includes new files and modifications to existing files.

Potential problems that could be addressed include:

The README.md file seems to be missing the output of the test command in the "Run LLAMA chat server app" section. This could provide more information to users who want to verify the success of the deployment.
The .github/workflows/kubernetes-containerd-llama-server.yml file has a long shell script in the run step, which may make it difficult to read and maintain. Consider breaking it down into separate run steps or putting it in a separate shell script file.
The llama_server_application.sh script seems to include hardcoded paths and environment variable assignments. It could be made more flexible and reusable by allowing users to configure these paths and variables.
The use of sudo in the llama_server_application.sh script may cause problems if the script is run by a user without sudo privileges. Consider removing or rethinking the need for sudo in the script.
The patch does not include any validation or checks after running the application in Kubernetes. It would be helpful to include some validation steps to ensure the application has been deployed successfully and is running as expected.

Overall, the changes seem to introduce a new Kubernetes example, but there are some areas for improvement to make the example more user-friendly and robust.

hydai · 2024-01-07T06:02:38Z

.github/workflows/kubernetes-containerd-llama-server.yml

+
+      - name: Install containerd, WasmEdge, and crun with supprt of plugins and nn-preoload
+        run: |
+          sed 's|https://github.com/containers/crun|-b enable-wasmedge-plugin https://github.com/CaptainVincent/crun|g' containerd/install.sh | bash


Let's put the patch in https://github.com/second-state/crun instead of a personal repo.

Signed-off-by: vincent <[email protected]>

hydai requested changes Jan 7, 2024

View reviewed changes

Add k8s with llama server example

78d2f94

Signed-off-by: vincent <[email protected]>

CaptainVincent force-pushed the k8s-llama-server branch from 9b9a604 to 78d2f94 Compare January 7, 2024 08:25

hydai approved these changes Jan 7, 2024

View reviewed changes

CaptainVincent merged commit 3d876fc into second-state:main Jan 8, 2024
42 of 44 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add k8s with llama server example #39

Add k8s with llama server example #39

CaptainVincent commented Jan 7, 2024

juntao commented Jan 7, 2024 •

edited

Loading

hydai Jan 7, 2024

Add k8s with llama server example #39

Add k8s with llama server example #39

Conversation

CaptainVincent commented Jan 7, 2024

juntao commented Jan 7, 2024 • edited Loading

Commit 78d2f94261a6f45f39d694061c4fb2658374d994

hydai Jan 7, 2024

Choose a reason for hiding this comment

juntao commented Jan 7, 2024 •

edited

Loading