You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
However, with use_llm: True and force_ocr: True, all text blocks just say Text and none of the actual text is captured.
Layout seems solid, image attached.
Hi, I ran python convert_single.py FILEPATH --page_range 19-21 --force_ocr --use_llm --output_format json as a test, and the output looked fine to me:
{
"id": "/page/19/Text/3",
"block_type": "Text",
"html": "<p block-type=\"Text\">One of the major applications of measurement is the development of guidelines which help to standardize product design and engineering practice.</p>",
"polygon": [
[
20.221757322175733,
274.2795180722892
],
[
340.169921875,
What command are you running when you get the issues?
Used this 270 page OCR needed PDF from another GitHub issue: https://nvlpubs.nist.gov/nistpubs/Legacy/SP/nbsspecialpublication340.pdf
The processing speed is fine (using M3 Pro)
However, with use_llm: True and force_ocr: True, all text blocks just say Text and none of the actual text is captured.
Layout seems solid, image attached.
"id": "/page/19/Text/3",
","block_type": "Text",
"html": "<p block-type="Text">
Not sure where this is getting dropped, since the OCR is taking 40 minutes and it is definitely doing something.
The text was updated successfully, but these errors were encountered: