From a55db446ebb907239b452cfd94c769e9fd4f06e0 Mon Sep 17 00:00:00 2001 From: Justin Hsu <141555665+JustinHsu1019@users.noreply.github.com> Date: Thu, 14 Nov 2024 10:40:33 +0800 Subject: [PATCH] docs: tab (test for main branch protection) --- README.md | 20 ++++++++++---------- 1 file changed, 10 insertions(+), 10 deletions(-) diff --git a/README.md b/README.md index 2a8c3fc..d2b1352 100644 --- a/README.md +++ b/README.md @@ -104,21 +104,21 @@ To set up the development environment, follow these steps: 在程式碼中配置 Tesseract 和 Poppler 的路徑: -```python -# Configure Tesseract path if necessary (update this path as needed) -pytesseract.pytesseract.tesseract_cmd = r'C:\Program Files\Tesseract-OCR\tesseract.exe' - -# Specify the path to the Poppler binaries -poppler_path = r"C:\Program Files\poppler-24.08.0\Library\bin" -``` + ```python + # Configure Tesseract path if necessary (update this path as needed) + pytesseract.pytesseract.tesseract_cmd = r'C:\Program Files\Tesseract-OCR\tesseract.exe' + + # Specify the path to the Poppler binaries + poppler_path = r"C:\Program Files\poppler-24.08.0\Library\bin" + ``` 確保將上述路徑替換為本地實際安裝的路徑。 確保您的 ZIP 文件包含以下資料夾和文件 (下載官方 dataset 後): -- `競賽資料集/reference/faq/pid_map_content.json` -- `競賽資料集/reference/finance/*.pdf` -- `競賽資料集/reference/insurance/*.pdf` + - `競賽資料集/reference/faq/pid_map_content.json` + - `競賽資料集/reference/finance/*.pdf` + - `競賽資料集/reference/insurance/*.pdf` 運行 data preprocess scripts: