[REQUEST] Add image captioning #618

robomotic · 2025-01-12T10:28:10Z

Reference Issues

No response

Summary

When a document is decomposed and image are extracted, you could also extract captions for each and store the text in the document, this will provide better results.

Basic Example

We could support BLIP and OpenaAI Vision API initially.

Drawbacks

Since the text doesn't exist in the original file, we should show that the text refers to an image.

Additional information

No response

robomotic added the enhancement New feature or request label Jan 12, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[REQUEST] Add image captioning #618

[REQUEST] Add image captioning #618

robomotic commented Jan 12, 2025

[REQUEST] Add image captioning #618

[REQUEST] Add image captioning #618

Comments

robomotic commented Jan 12, 2025

Reference Issues

Summary

Basic Example

Drawbacks

Additional information