-
Notifications
You must be signed in to change notification settings - Fork 97
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
feat(tests) : Update CI to use new workflow and silicon. (#145)
* Update CI to use new workflow and silicon. * remove third-party trtllm * Pin TensorRT-LLM version * Pin TensorRT-LLM to first 0.12 beta version * Ensure torch dependency constraints match TRTLLM * Remove quantization for now * Allow restoring from local dir * build all ranks * Improve checkpoints and engines discovery * use DirEntry.name in the lambda to retrieve filepath * quality * Update checkpoint detection regex * add logic to reload from local directory if model_id is a local transformers save_pretrained model path * quality * make sure to create the sharding config * move to latest trtllm version * remove bad_words for now * in fact its now called bad_token_ids * move on to GenerationExecutor, removing tokenizer param * specify constructor call order for models * specify local workspace for converting local model * fix wrong way of symlinking files and folder in save_pretrained * detect model_type from engine config if applicable * remove deprecated test_config.py * handled dtype from prebuilt engine * handled model_type from different config format * added test for model_type_from_known_config * quality * fix wrong ref name in test_hub * More robust way to load artifacts * fix save_pretrained * fix missing branch for raise in model type inference * quality
- Loading branch information
1 parent
011b5a9
commit fa8556c
Showing
17 changed files
with
314 additions
and
275 deletions.
There are no files selected for viewing
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,3 +0,0 @@ | ||
[submodule "third-party/tensorrt-llm"] | ||
path = third-party/tensorrt-llm | ||
url = https://github.com/nvidia/tensorrt-llm | ||
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.