feat(llm): Integrating OpenAI seed parameter (#708) (#742)

Co-authored-by: Gamliel Cohen <[email protected]>
sinaptik-ai · Nov 10, 2023 · 1c6598f · 1c6598f
1 parent bb1d7c7
commit 1c6598f
Show file tree

Hide file tree

Showing 5 changed files with 65 additions and 1 deletion.
diff --git a/docs/determinism.md b/docs/determinism.md
@@ -0,0 +1,58 @@
+# Determinism
+## Introduction
+In the realm of Language Model (LM) applications, determinism plays a crucial role, especially when consistent and predictable outcomes are desired. Our library, which integrates multiple LLMs including OpenAI's models, offers users the ability to control this aspect of model behavior through specific parameters like temperature and seed. This document aims to elucidate the importance of these parameters, their usage, and current limitations with different LLM instances such as AzureOpenAI.
+
+## Why Determinism Matters
+Determinism in language models refers to the ability to produce the same output consistently given the same input under identical conditions. This characteristic is vital for:
+
+- Reproducibility: Ensuring the same results can be obtained across different runs, which is crucial for debugging and iterative development.
+- Consistency: Maintaining uniformity in responses, particularly important in scenarios like automated customer support, where varied responses to the same query might be undesirable.
+- Testing: Facilitating the evaluation and comparison of models or algorithms by providing a stable ground for testing.
+
+## The Role of temperature=0
+The temperature parameter in language models controls the randomness of the output. A higher temperature increases diversity and creativity in responses, while a lower temperature makes the model more predictable and conservative. Setting `temperature=0` essentially turns off randomness, leading the model to choose the most likely next word at each step. This is critical for achieving determinism as it minimizes variance in the model's output.
+
+## Implications of temperature=0
+- Predictable Responses: The model will consistently choose the most probable path, leading to high predictability in outputs.
+- Creativity: The trade-off for predictability is reduced creativity and variation in responses, as the model won't explore less likely options.
+
+## Utilizing seed for Enhanced Control
+The seed parameter is another tool to enhance determinism. It sets the initial state for the random number generator used in the model, ensuring that the same sequence of "random" numbers is used for each run. This parameter, when combined with `temperature=0`, offers an even higher degree of predictability.
+
+## Example:
+```py
+import pandas as pd
+from pandasai import SmartDataframe
+from pandasai.llm import OpenAI
+
+# Sample DataFrame
+df = pd.DataFrame({
+    "country": ["United States", "United Kingdom", "France", "Germany", "Italy", "Spain", "Canada", "Australia", "Japan", "China"],
+    "gdp": [19294482071552, 2891615567872, 2411255037952, 3435817336832, 1745433788416, 1181205135360, 1607402389504, 1490967855104, 4380756541440, 14631844184064],
+    "happiness_index": [6.94, 7.16, 6.66, 7.07, 6.38, 6.4, 7.23, 7.22, 5.87, 5.12]
+})
+
+# Instantiate a LLM
+llm = OpenAI(
+    api_token="YOUR_API_TOKEN",
+    temperature=0,
+    seed=26
+)
+
+df = SmartDataframe(df, config={"llm": llm})
+df.chat('Which are the 5 happiest countries?') # answer should me (mostly) consistent across devices.
+```
+
+## Current Limitation:
+### AzureOpenAI Instance
+While the seed parameter is effective with the OpenAI instance in our library, it's important to note that this functionality is not yet available for AzureOpenAI. Users working with AzureOpenAI can still use `temperature=0` to reduce randomness but without the added predictability that seed offers.
+
+### System fingerprint
+As mentioned in the documentation ([OpenAI Seed](https://platform.openai.com/docs/guides/text-generation/reproducible-outputs)) :
+> Sometimes, determinism may be impacted due to necessary changes OpenAI makes to model configurations on our end. To help you keep track of these changes, we expose the system_fingerprint field. If this value is different, you may see different outputs due to changes we've made on our systems.
+
+## Workarounds and Future Updates
+For AzureOpenAI Users: Rely on `temperature=0` for reducing randomness. Stay tuned for future updates as we work towards integrating seed functionality with AzureOpenAI.
+For OpenAI Users: Utilize both `temperature=0` and seed for maximum determinism.
+
+
diff --git a/mkdocs.yml b/mkdocs.yml
@@ -21,6 +21,7 @@ nav:
       - shortcuts.md
       - custom-head.md
       - save-dataframes.md
+      - determinism.md
       - cache.md
       - middlewares.md
       - custom-response.md

diff --git a/pandasai/llm/base.py b/pandasai/llm/base.py
@@ -221,6 +221,7 @@ class BaseOpenAI(LLM, ABC):
     frequency_penalty: float = 0
     presence_penalty: float = 0.6
     stop: Optional[str] = None
+    seed: Optional[int] = None
     # support explicit proxy for OpenAI
     openai_proxy: Optional[str] = None
 
@@ -229,7 +230,7 @@ def _set_params(self, **kwargs):
         Set Parameters
         Args:
             **kwargs: ["model", "engine", "deployment_id", "temperature","max_tokens",
-            "top_p", "frequency_penalty", "presence_penalty", "stop", ]
+            "top_p", "frequency_penalty", "presence_penalty", "stop", "seed", ]
 
         Returns:
             None.
@@ -246,6 +247,7 @@ def _set_params(self, **kwargs):
             "frequency_penalty",
             "presence_penalty",
             "stop",
+            "seed",
         ]
         for key, value in kwargs.items():
             if key in valid_params:
@@ -267,6 +269,7 @@ def _default_params(self) -> Dict[str, Any]:
             "top_p": self.top_p,
             "frequency_penalty": self.frequency_penalty,
             "presence_penalty": self.presence_penalty,
+            "seed": self.seed,
         }
 
     def completion(self, prompt: str) -> str:

diff --git a/tests/llms/test_azure_openai.py b/tests/llms/test_azure_openai.py
@@ -105,6 +105,7 @@ def test_completion(self, mocker):
             top_p=openai.top_p,
             frequency_penalty=openai.frequency_penalty,
             presence_penalty=openai.presence_penalty,
+            seed=openai.seed
         )
 
         assert result == expected_text

diff --git a/tests/llms/test_openai.py b/tests/llms/test_openai.py
@@ -81,6 +81,7 @@ def test_completion(self, mocker):
             top_p=openai.top_p,
             frequency_penalty=openai.frequency_penalty,
             presence_penalty=openai.presence_penalty,
+            seed=openai.seed
         )
 
         assert result == expected_text