Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[REQUEST] Add image captioning #618

Open
robomotic opened this issue Jan 12, 2025 · 0 comments
Open

[REQUEST] Add image captioning #618

robomotic opened this issue Jan 12, 2025 · 0 comments
Labels
enhancement New feature or request

Comments

@robomotic
Copy link

Reference Issues

No response

Summary

When a document is decomposed and image are extracted, you could also extract captions for each and store the text in the document, this will provide better results.

Basic Example

We could support BLIP and OpenaAI Vision API initially.

Drawbacks

Since the text doesn't exist in the original file, we should show that the text refers to an image.

Additional information

No response

@robomotic robomotic added the enhancement New feature or request label Jan 12, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant