OCR model example using two stage pipeline #563

aljazkonec1 · 2024-12-16T20:10:56Z

This PR adds an OCR model example using a two stage pipeline.

klemen1999

Generally LGTM, left some comments

klemen1999 · 2024-12-18T08:42:52Z

gen3/neural-networks/advanced-examples/ocr/ocr/README.md

-python3 main.py
+# Instalation
+Running this example requires a **Luxonis OAK4 device** connected to your computer. You can find more information about the supported devices and the set up instructions in our [Documentation](https://rvc4.docs.luxonis.com/hardware).
+Moreover, you need to prepare a **Python 3.10** environment with [DepthAI](https://pypi.org/project/depthai/) and [DepthAI Nodes](https://pypi.org/project/depthai-nodes/) packages installed. You can do this by running:


Where does py3.10 dependency come from since depthai and depthai-nodes should both work also with 3.8?

I left it the same as we have in the general README

gen3/neural-networks/advanced-examples/ocr/ocr/README.md

gen3/neural-networks/advanced-examples/ocr/ocr/main.py

klemen1999 · 2024-12-18T08:49:16Z

gen3/neural-networks/advanced-examples/ocr/ocr/main.py

+        replay_node.setLoop(True)
+
+        video_resize_node = pipeline.create(dai.node.ImageManipV2)
+        video_resize_node.initialConfig.setOutputSize(1728, 960)


Is the reason we request 3x the input size just for nicer visualization at the end?

The OCR model accepts 320x48, so cropping from a small 576x320 image we would have to upsample cropped detection by a lot and we would loose accuracy in second stage

gen3/neural-networks/advanced-examples/ocr/ocr/utils/host_process_detections.py

aljazkonec1 added 5 commits December 13, 2024 14:43

mre

216eb0b

mre

77d6c77

fixed

6fff528

updated host

c3acb94

added docs and replay option

a1e8785

klemen1999 reviewed Dec 18, 2024

View reviewed changes

implementing suggestions

0dd5baf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

OCR model example using two stage pipeline #563

OCR model example using two stage pipeline #563

aljazkonec1 commented Dec 16, 2024

klemen1999 left a comment

klemen1999 Dec 18, 2024

aljazkonec1 Dec 18, 2024

klemen1999 Dec 18, 2024

aljazkonec1 Dec 18, 2024

OCR model example using two stage pipeline #563

Are you sure you want to change the base?

OCR model example using two stage pipeline #563

Conversation

aljazkonec1 commented Dec 16, 2024

klemen1999 left a comment

Choose a reason for hiding this comment

klemen1999 Dec 18, 2024

Choose a reason for hiding this comment

aljazkonec1 Dec 18, 2024

Choose a reason for hiding this comment

klemen1999 Dec 18, 2024

Choose a reason for hiding this comment

aljazkonec1 Dec 18, 2024

Choose a reason for hiding this comment