Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Fix max tokens for embedding models #18

Open
wants to merge 1 commit into
base: main
Choose a base branch
from

Conversation

nkgami
Copy link
Contributor

@nkgami nkgami commented Sep 11, 2024

Based on other documentation, the max tokens for embedding models should be 8,192 but not 8,191. Reference: https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/embeddings?tabs=console

Based on other documentation, the max tokens for embedding models should be 8,192 but not 8,191. Reference: https://learn.microsoft.com/en-us/azure/ai-services/openai/how-to/embeddings?tabs=console
Copy link
Contributor

@nkgami : Thanks for your contribution! The author(s) have been notified to review your proposed change.

Copy link

Learn Build status updates of commit 605ee39:

✅ Validation status: passed

File Status Preview URL Details
articles/ai-services/openai/concepts/models.md ✅Succeeded

For more details, please refer to the build report.

For any questions, please:

@Court72
Copy link
Contributor

Court72 commented Sep 11, 2024

@mrbullwinkle

Can you review the proposed changes?

Important: When the changes are ready for publication, adding a #sign-off comment is the best way to signal that the PR is ready for the review team to merge.

#label:"aq-pr-triaged"
@MicrosoftDocs/public-repo-pr-review-team

@prmerger-automator prmerger-automator bot added the aq-pr-triaged C+L Pull Request Review Team label label Sep 11, 2024
learn-build-service-prod bot pushed a commit that referenced this pull request Sep 12, 2024
@mrbullwinkle
Copy link
Contributor

@nkgami Let me get back to you on this. We previously had 8192 set everywhere and then were asked by the PG to update to 8191 so the location that still says 8192 was an oversight in that round of changes.

8191 is what is currently stated in OpenAI's docs as well:
image

https://platform.openai.com/docs/guides/embeddings/embedding-models

With that said I am aware and have tested that if you exceed the token limit both with OAI, and AOAI the error message that is returned says 8192 is the limit, and the 8192 value appears in various code comments in the OpenAI Python library.

It is possible that the issue is with the error message stating 8192, but I haven't been able to track down a definitive answer yet.

Copy link

This pull request has been inactive for at least 14 days. If you are finished with your changes, don't forget to sign off. See the contributor guide for instructions.
Get Help
Docs Support Teams Channel
Resolve Merge Conflict

@github-actions github-actions bot added the inactive This PR is inactive for more than 14 days label Sep 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants