Skip to content

Commit

Permalink
Updated with README fixes
Browse files Browse the repository at this point in the history
  • Loading branch information
chettiargautam committed May 14, 2024
1 parent b484803 commit 3765a4f
Showing 1 changed file with 1 addition and 3 deletions.
4 changes: 1 addition & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -219,9 +219,7 @@ This command will read the `requirements.txt` file and install the specified pac
- Extracts OCR data from a PDF and augments it with HTML tag and style information.
- Supports extraction at word, line, or paragraph level (specified by `target`).
- Generates a JSON file containing the augmented OCR data if `output_json_path` is provided.
- Requires `pdf2image` and `pytesseract` libraries for PDF to image conversion and OCR
, respectively.
- Requires `pdf2image` and `pytesseract` libraries for PDF to image conversion and OCR, respectively.
## Sample Usage
Expand Down

0 comments on commit 3765a4f

Please sign in to comment.