Activity

ggml : fix batch for ggml_conv_2d (ggml-org#528)

jploskipushed 53 commits to master • 244776a…13e2fa6 •

on Sep 30, 2023

ggml : implementation of xPos RoPE (ggml-org#441); also extends ggml_…

jploskicreated xpos • 879a92a •

on Aug 9, 2023

ggml : mul mat tweaks (ggml-org#439)

jploskipushed 95 commits to master • 4a7db90…244776a •

on Aug 9, 2023

Alternative implementation using ggml_flash_attn (~16% slower than fa…

jploskicreated falcon40b-flash • fac72a2 •

on Jul 15, 2023

Removed unnecessary reshapes when retrieving kv from cache

jploskipushed 1 commit to falcon40b • d8c51b2…d5295b4 •

on Jun 26, 2023

Eliminated need for ggml_repeat2 by using a modified version of ggml-…

jploskicreated falcon40b-norepeat • 2e30a2b •

on Jun 25, 2023

Merge remote-tracking branch 'origin/pr/224' into falcon40b

jploskipushed 2 commits to falcon40b • 78ccbba…d8c51b2 •

on Jun 25, 2023

Merge branch 'master' into falcon40b

jploskipushed 40 commits to falcon40b • 8b22ea8…78ccbba •

on Jun 25, 2023

py : add requirements.txt (ggml-org#201)

jploskipushed 39 commits to master • f52d2a0…4a7db90 •

on Jun 25, 2023

py : add requirements.txt (ggml-org#201)

jploskipushed 39 commits to master • f52d2a0…4a7db90 •

on Jun 25, 2023

Fixed regression because of incorrect ctx_size calculation

jploskipushed 1 commit to falcon40b • 8f2609a…8b22ea8 •

on Jun 15, 2023

Merge pull request #2 from ochafik/falcon40b-ctx-size

Pull request merge

jploskipushed 3 commits to falcon40b • cc8ac10…8f2609a •

on Jun 15, 2023

Merge pull request #1 from KerfuffleV2/feat-improve-falcon-convert-hf

Pull request merge

jploskipushed 2 commits to falcon40b • fe13c37…cc8ac10 •

on Jun 14, 2023

Added Kerfuffle's magic context size fix

jploskipushed 1 commit to falcon40b • 5ee0488…fe13c37 •

on Jun 14, 2023

Added rearrange of qkv weight memory layout to convert-hf-to-ggml.py …

jploskipushed 1 commit to falcon40b • b8bb9b8…5ee0488 •

on Jun 14, 2023

Fixed offset calculation bug during extraction of query vectors

jploskipushed 1 commit to falcon40b • 7db8803…b8bb9b8 •

on Jun 14, 2023

Fixed quantized version not working due to wrong data type

jploskipushed 1 commit to falcon40b • 68f6133…7db8803 •

on Jun 13, 2023

Removed debug code

jploskipushed 1 commit to falcon40b • d332539…68f6133 •

on Jun 13, 2023

Updated falcon-quantize to match 7B/40B format produced by convert-hf…

jploskipushed 1 commit to falcon40b • 3bc786b…d332539 •

on Jun 13, 2023

Version which exactly reproduces outputs of the Python implementation…

jploskipushed 2 commits to falcon40b • 27cf1ad…3bc786b •

on Jun 13, 2023

Experimental support for Falcon-40B (and Falcon-7B); breaks 7B GGML c…

jploskicreated falcon40b • 27cf1ad •

on Jun 10, 2023

cmake : export all symbols on windows when building shared library (g…

jploskipushed 80 commits to master • ff6e03c…f52d2a0 •

on Jun 10, 2023

cmake : export all symbols on windows when building shared library (g…

jploskipushed 80 commits to master • ff6e03c…f52d2a0 •

on Jun 10, 2023

Added mention of missing ALiBi to README.md

jploskipushed 1 commit to mpt-experiment • 2a0b7f8…29b5732 •

on May 7, 2023

Fixed obvious typo in layer mapping

jploskipushed 1 commit to mpt-experiment • b633277…2a0b7f8 •

on May 7, 2023

Fixed syntax error, added comment about ALiBi

jploskipushed 1 commit to mpt-experiment • 70d7062…b633277 •

on May 7, 2023

Added README.md

jploskipushed 6 commits to mpt-experiment • ff6e03c…70d7062 •

on May 7, 2023

examples : add dolly-v2 sample inference (ggml-org#132)

jploskicreated mpt-experiment • ff6e03c •

on May 7, 2023

examples : add dolly-v2 sample inference (ggml-org#132)

Force push

jploskiforce pushed to master • a650a36…ff6e03c •

on May 7, 2023

ggml : backwards compatibility with gcc < 9 (ggml-org#99)

jploskipushed 1 commit to master • e999951…a650a36 •

on Apr 24, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ggml : fix batch for ggml_conv_2d (ggml-org#528)

ggml : implementation of xPos RoPE (ggml-org#441); also extends ggml_…

ggml : mul mat tweaks (ggml-org#439)

Alternative implementation using ggml_flash_attn (~16% slower than fa…

Removed unnecessary reshapes when retrieving kv from cache

Eliminated need for ggml_repeat2 by using a modified version of ggml-…

Merge remote-tracking branch 'origin/pr/224' into falcon40b

Merge branch 'master' into falcon40b

py : add requirements.txt (ggml-org#201)

py : add requirements.txt (ggml-org#201)

Fixed regression because of incorrect ctx_size calculation

Merge pull request #2 from ochafik/falcon40b-ctx-size

Merge pull request #1 from KerfuffleV2/feat-improve-falcon-convert-hf

Added Kerfuffle's magic context size fix

Added rearrange of qkv weight memory layout to convert-hf-to-ggml.py …

Fixed offset calculation bug during extraction of query vectors

Fixed quantized version not working due to wrong data type

Removed debug code

Updated falcon-quantize to match 7B/40B format produced by convert-hf…

Version which exactly reproduces outputs of the Python implementation…

Experimental support for Falcon-40B (and Falcon-7B); breaks 7B GGML c…

cmake : export all symbols on windows when building shared library (g…

cmake : export all symbols on windows when building shared library (g…

Added mention of missing ALiBi to README.md

Fixed obvious typo in layer mapping

Fixed syntax error, added comment about ALiBi

Added README.md

examples : add dolly-v2 sample inference (ggml-org#132)

examples : add dolly-v2 sample inference (ggml-org#132)

ggml : backwards compatibility with gcc < 9 (ggml-org#99)