Accelerated LLM inference. Write fast kernel and kernel optimization on the side
Highlights
Pinned Loading
-
pytorch/pytorch
pytorch/pytorch PublicTensors and Dynamic neural networks in Python with strong GPU acceleration
-
apache/tvm
apache/tvm PublicOpen deep learning compiler stack for cpu, gpu and specialized accelerators
-
pytorch/torchdynamo
pytorch/torchdynamo PublicA Python-level JIT compiler designed to make unmodified PyTorch programs faster.
-
pytorch/benchmark
pytorch/benchmark PublicTorchBench is a collection of open source benchmarks used to evaluate PyTorch performance.
Something went wrong, please refresh the page to try again.
If the problem persists, check the GitHub status page or contact support.
If the problem persists, check the GitHub status page or contact support.