Skip to content

Latest commit

 

History

History
4 lines (3 loc) · 558 Bytes

README.md

File metadata and controls

4 lines (3 loc) · 558 Bytes

Revisiting Structure OCR with multi-modal LLMs - detecting and resolving hallucinations

As Large Language Models (LLMs) evolve to process multiple modalities, we face challenges reminiscent of early text-based LLMs: basic inconsistency, inaccuracy, and hallucinations.

This example notebook demonstrates techniques to increase the accuracy of vision use cases using various techniques. Inlucded is a quick introduction to AWS Gen AI services landscape and how to get started with using Anthropic Claude and other LLMs on Amazon Bedrock.