-
TensorRT-LLM Public
Forked from NVIDIA/TensorRT-LLMTensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficie…
C++ Apache License 2.0 UpdatedDec 6, 2023 -
FasterTransformer Public
Forked from NVIDIA/FasterTransformerTransformer related optimization, including BERT, GPT
C++ Apache License 2.0 UpdatedSep 4, 2023 -
InferLLM Public
Forked from MegEngine/InferLLMa lightweight LLM model inference framework
C++ Apache License 2.0 UpdatedAug 25, 2023 -
onnx-mlir Public
Forked from onnx/onnx-mlirRepresentation and Reference Lowering of ONNX Models in MLIR Compiler Infrastructure
C++ Apache License 2.0 UpdatedJan 6, 2023 -
llvm-project Public
Forked from llvm/llvm-projectThe LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at…
Other UpdatedOct 11, 2022 -
Open standard for machine learning interoperability
C++ Apache License 2.0 UpdatedMay 11, 2022 -
psi Public
Forked from microsoft/psiPlatform for Situated Intelligence
C# Other UpdatedSep 14, 2021 -
-
code_search Public
Forked from hamelsmu/code_searchCode For Medium Article: "How To Create Natural Language Semantic Search for Arbitrary Objects With Deep Learning"
Jupyter Notebook MIT License UpdatedOct 22, 2019 -
search-1047 Public
Forked from ifuding/search-1047A simple search engine based on Nutch and Hadoop.
Java Apache License 2.0 UpdatedApr 21, 2015