feat(blog): Add new post on `llama-cpp-python` and `instructor` library usage #434

jxnl · 2024-02-12T21:02:58Z

	🚀 This PR description was created by Ellipsis for commit `bf826c3`.

Summary:

This PR adds a new blog post discussing the use of llama-cpp-python for structured outputs and the enhancement of create calls with the instructor library, including a Python code example.

Key points:

Added a new blog post in /docs/blog/posts/llama-cpp-python.md
The post discusses using llama-cpp-python for structured outputs
It also covers enhancing create calls with instructor library
Includes a Python code example demonstrating these features

Generated with ❤️ by ellipsis.dev

ellipsis-dev

Looks good to me! Reviewed entire PR up to commit bf826c3

Reviewed 121 lines of code across 1 files in 1 minute(s) and 3 second(s).

See details

Skipped files: 0 (please contact us to request support for these files)
Confidence threshold: 85%
Drafted 0 additional comments.
Workflow ID: wflow_b0ZkUnb5GvB5ybuj

Something look wrong? You can customize Ellipsis by editing the ellipsis.yaml for this repository.

Generated with ❤️ by ellipsis.dev

docs/blog/posts/llama-cpp-python.md

abetlen · 2024-02-12T21:59:10Z

docs/blog/posts/llama-cpp-python.md

+
+Recently llama-cpp-python has made support structured outputs via JSON schema available. This is a time-saving alternative to extensive prompt engineering and can be used to obtain structured outputs.
+
+In this example we'll cover a more advanced use case of by using `JSON_SCHEMA` mode to stream out partial models. To learn more partial streaming check out [partial streaming](../../concepts/partial.md).


In this example we'll cover a more advanced use case of JSON_SCHEMA mode to stream out partial models. To learn more partial streaming check out partial streaming.

abetlen · 2024-02-12T22:00:26Z

docs/blog/posts/llama-cpp-python.md

+    console.print(obj)
+```
+
+1. We use `LlamaPromptLookupDecoding` to obtain structured outputs using JSON schema via a mixture of constrained sampling and speculative decoding. 10 is good for GPU, 2 is good for CPU.


We use LlamaPromptLookupDecoding to speed up structured output generation using speculative decoding. The draft model generates candidate tokens during generation 10 is good for GPU, 2 is good for CPU.

docs/blog/posts/llama-cpp-python.md

blog for llama-cpp

bf826c3

ellipsis-dev bot changed the title ~~...~~ feat(blog): Add new post on llama-cpp-python and instructor library usage Feb 12, 2024

ellipsis-dev bot reviewed Feb 12, 2024

View reviewed changes

jxnl added 4 commits February 12, 2024 16:05

update with notes

b46ecfc

bump

6a13de9

bump

68e1be1

more docs

174569b

abetlen suggested changes Feb 12, 2024

View reviewed changes

jxnl commented Feb 12, 2024

View reviewed changes

docs/blog/posts/llama-cpp-python.md Outdated Show resolved Hide resolved

jxnl commented Feb 12, 2024

View reviewed changes

docs/blog/posts/llama-cpp-python.md Outdated Show resolved Hide resolved

jxnl commented Feb 12, 2024

View reviewed changes

docs/blog/posts/llama-cpp-python.md Outdated Show resolved Hide resolved

Apply suggestions from code review

4516ad8

jxnl merged commit 1ddb147 into main Feb 12, 2024
11 of 12 checks passed

jxnl deleted the llama-cpp-blog branch February 12, 2024 22:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(blog): Add new post on `llama-cpp-python` and `instructor` library usage #434

feat(blog): Add new post on `llama-cpp-python` and `instructor` library usage #434

jxnl commented Feb 12, 2024 •

edited by ellipsis-dev bot

Loading

ellipsis-dev bot left a comment

abetlen Feb 12, 2024

abetlen Feb 12, 2024


		Recently llama-cpp-python has made support structured outputs via JSON schema available. This is a time-saving alternative to extensive prompt engineering and can be used to obtain structured outputs.

		In this example we'll cover a more advanced use case of by using `JSON_SCHEMA` mode to stream out partial models. To learn more partial streaming check out [partial streaming](../../concepts/partial.md).

feat(blog): Add new post on llama-cpp-python and instructor library usage #434

feat(blog): Add new post on llama-cpp-python and instructor library usage #434

Conversation

jxnl commented Feb 12, 2024 • edited by ellipsis-dev bot Loading

Summary:

ellipsis-dev bot left a comment

Choose a reason for hiding this comment

abetlen Feb 12, 2024

Choose a reason for hiding this comment

abetlen Feb 12, 2024

Choose a reason for hiding this comment

feat(blog): Add new post on `llama-cpp-python` and `instructor` library usage #434

feat(blog): Add new post on `llama-cpp-python` and `instructor` library usage #434

jxnl commented Feb 12, 2024 •

edited by ellipsis-dev bot

Loading