Note - Install necessary libraries such as opencv-python, pdf2image, img2pdf, numpy etc. Download Poppler https://github.com/oschwartz10612/poppler-windows/releases/
This tool converts any document PDF to its segmented version PDF.
All middle work can be found in input and output image folder.
-
Download the ZIP file of the project and open its source folder.
-
Navigate to layout.py and run it directly or by using
python layout.py
-
If you want to use your own document add it to folder and change the path in layout.py
pdf_file = '...\\(PDF_name).pdf'
-
Observe the results in Processed Files folder.
- input_img : will show all images in openCV format when extracted from PDF.
- output_img : will show all output images after resizing and object detection is done.
- output_pdf : will display the resultant PDFs of merged output images.