diff --git a/README.md b/README.md index e015ac2..5497169 100644 --- a/README.md +++ b/README.md @@ -121,21 +121,21 @@ To set up the development environment, follow these steps: 在程式碼中配置 Tesseract 和 Poppler 的路徑: - ```python - # Configure Tesseract path if necessary (update this path as needed) - pytesseract.pytesseract.tesseract_cmd = r'C:\Program Files\Tesseract-OCR\tesseract.exe' +```python +# Configure Tesseract path if necessary (update this path as needed) +pytesseract.pytesseract.tesseract_cmd = r'C:\Program Files\Tesseract-OCR\tesseract.exe' - # Specify the path to the Poppler binaries - poppler_path = r"C:\Program Files\poppler-24.08.0\Library\bin" - ``` +# Specify the path to the Poppler binaries +poppler_path = r"C:\Program Files\poppler-24.08.0\Library\bin" +``` 確保將上述路徑替換為本地實際安裝的路徑。 確保您的 ZIP 文件包含以下資料夾和文件 (下載官方 dataset 後): - - `競賽資料集/reference/faq/pid_map_content.json` - - `競賽資料集/reference/finance/*.pdf` - - `競賽資料集/reference/insurance/*.pdf` + • `競賽資料集/reference/faq/pid_map_content.json` + • `競賽資料集/reference/finance/*.pdf` + • `競賽資料集/reference/insurance/*.pdf` 運行 data preprocess scripts: