Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ExllamaV2 tensor parallelism to increase multi gpu inference speeds code help #6356

Merged
merged 11 commits into from
Sep 28, 2024

Commits on Jul 25, 2024

  1. Merge pull request oobabooga#6271 from oobabooga/dev

    Merge dev branch
    oobabooga authored Jul 25, 2024
    Configuration menu
    Copy the full SHA
    dd97a83 View commit details
    Browse the repository at this point in the history
  2. UI: fix saving characters

    oobabooga committed Jul 25, 2024
    Configuration menu
    Copy the full SHA
    498fec2 View commit details
    Browse the repository at this point in the history

Commits on Aug 1, 2024

  1. Merge pull request oobabooga#6300 from oobabooga/dev

    Merge dev branch
    oobabooga authored Aug 1, 2024
    Configuration menu
    Copy the full SHA
    d011040 View commit details
    Browse the repository at this point in the history

Commits on Aug 20, 2024

  1. Merge pull request oobabooga#6336 from oobabooga/dev

    Merge dev branch
    oobabooga authored Aug 20, 2024
    Configuration menu
    Copy the full SHA
    073694b View commit details
    Browse the repository at this point in the history
  2. Merge pull request oobabooga#6337 from oobabooga/dev

    Merge dev branch
    oobabooga authored Aug 20, 2024
    Configuration menu
    Copy the full SHA
    1b62cd8 View commit details
    Browse the repository at this point in the history
  3. Merge pull request oobabooga#6339 from oobabooga/dev

    Merge dev branch
    oobabooga authored Aug 20, 2024
    Configuration menu
    Copy the full SHA
    5522584 View commit details
    Browse the repository at this point in the history

Commits on Aug 29, 2024

  1. Add files via upload

    Code to get exllamaV2 tensor parallelization working.
    RandomInternetPreson authored Aug 29, 2024
    Configuration menu
    Copy the full SHA
    377018e View commit details
    Browse the repository at this point in the history

Commits on Aug 31, 2024

  1. Configuration menu
    Copy the full SHA
    3e44373 View commit details
    Browse the repository at this point in the history

Commits on Sep 28, 2024

  1. Configuration menu
    Copy the full SHA
    dc06495 View commit details
    Browse the repository at this point in the history
  2. Lint

    oobabooga committed Sep 28, 2024
    Configuration menu
    Copy the full SHA
    1a4c054 View commit details
    Browse the repository at this point in the history
  3. Simplify

    oobabooga committed Sep 28, 2024
    Configuration menu
    Copy the full SHA
    725a463 View commit details
    Browse the repository at this point in the history