Releases: py-pdf/pypdf_table_extraction
Releases · py-pdf/pypdf_table_extraction
v1.0.1
Changes
🪲 Fixes
📦 Dependencies
- Bump safety from 3.2.10 to 3.2.11 (#281) @dependabot
- Bump tomli from 2.0.2 to 2.1.0 (#280) @dependabot
- Bump packaging from 24.1 to 24.2 (#278) @dependabot
- Bump pydantic from 2.5.3 to 2.9.2 (#279) @dependabot
- Bump dparse from 0.6.4b0 to 0.6.4 (#276) @dependabot
v1.0.0
Changes
- Update version manually for bumpversion plugin to work (#267) @snanda85
- Make plot dependencies optional (#275) @bosd
- Handlers.py Fixup leftover typing (#272) @bosd
- Various documentation fixes (#227) @bosd
- Network parser Fix B903 (#217) @bosd
- Fix test_image_warning test (#196) @bosd
- [REF] flag fontsize (#193) @bosd
- Fix S310 Audit url open for permitted schemes. (#194) @bosd
- Rebrand Image based error message (#168) @bosd
- Silence S311 Error (#177) @bosd
- [REF] Image processing, Simplyify, Cleanup, Optimize (#152) @bosd
- Fix variable naming and qa checks (#164) @bosd
💥 Breaking Changes
🚀 Features
- 🧑🎓 add parser comparison notebook ✨ (#263) @bosd
- 🧑🎓 [ADD] Hybrid Parser notebook (#262) @bosd
- 🧑🎓 Quickstart notebook (#231) @bosd
- add pdfium backend and set it as default image conversion backend ✨ (#230) @bosd
- [Fix] Image conversion backend Fallback, More verbose backend exception feedback (#183) @bosd
- Add typing for camelot/backends (#12) @foarsitter
- Release the new Network and Hybrid parser ❇️ 🚀 (#163) @bosd
🔥 Removals and Deprecations
🪲 Fixes
- [FIX] hybrid Keyerror (#251) @bosd
- [FIX] compute_parse_error, Index out of range (#249) @bosd
- [REF]: core set_border: Improve performance, Fix index out of Range (#247) @bosd
- [FIX] Network parser running infinitly (#246) @bosd
- [REF] netw gen bbox (#244) @bosd
- boundaries to split lines Fix index out of range (#233) @bosd
- [REF]: Core Table set edges to reduce complexity (#223) @bosd
- Fix custom backend functionality (#225) @snanda85
- [Fix] Image conversion backend Fallback, More verbose backend exception feedback (#183) @bosd
- Various fixes (#154) @bosd
🐎 Performance
- [IMP] reduce pdf object loop (#253) @bosd
- Eliminated duplicate processes. (#255) @bosd
- [REF] Compute_plausible_gaps, Efficiency, Stability (#243) @bosd
- [REF] remove_unconnected_edges (#242) @bosd
- [REF]: core set_border: Improve performance, Fix index out of Range (#247) @bosd
- [REF] netw gen bbox (#244) @bosd
- [REF] slow np.isclose to math.isclose (#166) @bosd
🚨 Testing
- Activate pre-commit on gh actions (#228) @bosd
- [FIX] Warning on test cli quiet (#189) @bosd
- [REF] silence download_url S310,add typing (#187) @bosd
- New test for matplotlib importerror (#167) @bosd
👷 Continuous Integration
📚 Documentation
- Fixup leftover renaming in documentation (#273) @bosd
- Update Documentation Reflect new namespace (#271) @bosd
- 🧑🎓 add parser comparison notebook ✨ (#263) @bosd
- 🧑🎓 [ADD] Hybrid Parser notebook (#262) @bosd
- 🧑🎓 Quickstart notebook (#231) @bosd
- Update docstrings, add backends and fallback (#229) @bosd
- [IMP] CLI Documentation (#182) @bosd
- Release the new Network and Hybrid parser ❇️ 🚀 (#163) @bosd
🔨 Refactoring
- Clean code (#261) @bosd
- [REF] Compute_plausible_gaps, Efficiency, Stability (#243) @bosd
- [REF] remove_unconnected_edges (#242) @bosd
- [REF] netw gen bbox (#244) @bosd
- Various Flake8 fixes (#224) @bosd
- [REF] Simplify and fix Table.set_span (#226) @bosd
- [REF]: Core Table set edges to reduce complexity (#223) @bosd
- Fix custom backend functionality (#225) @snanda85
- [REF]: network parser generate_table_bbox -> split into mark_processe… (#211) @bosd
- [REF] Network parser search header split into sub methods to reduce complexity (#216) @bosd
- [REF]: find lines (#204) @bosd
- Flake8 fixes base parser (#205) @bosd
- [REF] search_table_body (#203) @bosd
- [REF]: get_table_areas (#199) @bosd
- [REF]: Find_closest_tls (#202) @bosd
- [REF]: get_table_index (#200) @bosd
- [REF] copy spanning text (#198) @bosd
- [REF] lattice -reduce_index (#197) @bosd
- [REF] Fix B028 (#201) @bosd
- [REF] Scale Image (#195) @bosd
- [REF] get_index_closest_point (#175) @bosd
- [REF] split_textline (#178) @bosd
- Activate mypy type checking (#190) @bosd
- [REF] silence download_url S310,add typing (#187) @bosd
- [REF] Compute accuracy, score_val to lowercase (#176) @bosd
- [REF] slow np.isclose to math.isclose (#166) @bosd
- Fix Flake8 warning on test_invalid_url (#184) @bosd
- Release the new Network and Hybrid parser ❇️ 🚀 (#163) @bosd
💄 Style
- [IMP] add typing to handlers, update docstings and pdfminer url (#254) @bosd
- Pre-commit fixes (#185) @bosd
- Various fixes (#154) @bosd
📦 Dependencies
- 💣 Make Ghostscript an optional dependency ✨ (#258) @bosd
- Bump typer from 0.12.5 to 0.13.0 (#270) @dependabot
- Bump pypa/gh-action-pypi-publish from 1.12.0 to 1.12.2 (#269) @dependabot
- Bump pypa/gh-action-pypi-publish from 1.11.0 to 1.12.0 (#268) @dependabot
- Bump rich from 13.9.3 to 13.9.4 (#266) @dependabot
- Bump setuptools from 75.2.0 to 75.3.0 (#265) @dependabot
- Bump typeguard from 4.3.0 to 4.4.0 (#237) @dependabot
- Update pandas and lockfile to fix #250 (#256) @bosd
- Bump et-xmlfile from 1.1.0 to 2.0.0 (#236) @dependabot
- Bump safety from 3.2.9 to 3.2.10 (#239) @dependabot
- Bump virtualenv from 20.27.0 to 20.27.1 (#240) @dependabot
- Bump safety-schemas from 0.0.5 to 0.0.8 (#238) @dependabot
- add pdfium backend and set it as default image conversion backend ✨ (#230) @bosd
- Bump crazy-max/ghaction-github-labeler from 5.0.0 to 5.1.0 (#235) @dependabot
- Bump pip from 24.2 to 24.3.1 in /.github/workflows (#234) @dependabot
- Bump virtualenv from 20.27.0 to 20.27.1 in /.github/workflows (#241) @dependabot
- Bump pypa/gh-action-pypi-publish from 1.10.3 to 1.11.0 (#248) @dependabot
- Bump pydantic from 2.5.3 to 2.9.2 (#219) @dependabot
- Bump safety from 3.2.8 to 3.2.9 (#220) @dependabot
- Bump safety-schemas from 0.0.7 to 0.0.8 (#218) @dependabot
- Updated pypdf dependency to include v5 (#221) @snanda85
- Bump safety-schemas from 0.0.5 to 0.0.7 (#215) @dependabot
- Bump rich from 13.9.2 to 13.9.3 (#213) @dependabot
- Bump pydata-sphinx-theme from 0.15.4 to 0.16.0 in /docs (#212) @dependabot
- Bump mypy from 1.12.1 to 1.13.0 (#214) @dependabot
- Bump virtualenv from 20.26.6 to 20.27.0 (#206) @dependabot
- Bump mypy from 1.12.0 to 1.12.1 (#207) @dependabot
- Bump virtualenv from 20.26.6 to 20.27.0 in /.github/workflows (#209) @dependabot
- Bump cryptography from 43.0.1 to 43.0.3 (#208) @dependabot
- Activate mypy type checking (#190) @bosd
- Bump setuptools from 75.1.0 to 75.2.0 (#188) @dependabot
- Bump mypy from 1.11.2 to 1.12.0 (#186) @dependabot
- Bump sphinx from 8.1.0 to 8.1.3 in /docs (#180) @dependabot
- Bump sphinx from 8.0.2 to 8.1.0 in /docs (#173) @dependabot
- Bump distlib from 0.3.8 to 0.3.9 (#171) @dependabot
- Bump nox from 2024.4.15 to 2024.10.9 in /.github/workflows (#170) @dependabot
- Bump charset-normalizer from 3.3.2 to 3.4.0 (#172) @dependabot
- Bump pypa/gh-action-pypi-publish from 1.10.2 to 1.10.3 (#160) @dependabot
- Bump rich from 13.9.1 to 13.9.2 (#161) @dependabot
- Bump pre-commit-hooks from 4.6.0 to 5.0.0 (#162) @dependabot
v0.0.2
Changes
- Version bump to 0.0.2 release (#159) @bosd
- Remove Ghostscript deprecation warning (#155) @bosd
- [MRG] added test to validate when plot_type is None (#41) @bosd
- [MRG] IMP Coverage: test for invalid url (#40) @bosd
- Added supported for CLI margins option. (#111) @Niremizov
- Bump sphinx-prompt from 1.8.0 to 1.9.0 in /docs (#95) @bosd
- Update .Gitignore (#92) @bosd
- [DEV] Run test workflow against main (#50) @MasterOdin
- Run actions only against pull requests. (#26) @foarsitter
🚀 Features
- Rebrand CLI to pypdf_table_extraction (#158) @bosd
- Reflect camelot in pypdf_table_extraction namespace (#11) @foarsitter
- [MRG] Python 3.12 (#44) @bosd
- [IMP] Update readthedocs url (#59) @bosd
- [UPD] Update repo link in toml file (#64) @bosd
- Rebrand: New Logo (#38) @bosd
- add iter() for TableList to support enumerate() (#13) @stonyw
- Add support for parsing PDF pages in parallel (multiprocessing) (#17) @phoewass
🪲 Fixes
- Bugfix for Stream._group_rows (#19) @ollynowell
- [IMP] Improve poppler subprocess security (#153) @bosd
- [FIX]/[IMP] Windows Support (#149) @bosd
- Fixes: IndexError while using split_text (#21) @snanda85
- Poppler backend: search for pdftopng in current environment (#4) @orent
🐎 Performance
- Speedup as in camelot-dev#161 (#94) @bosd
- [IMP] Opencv headless (#139) @bosd
- [MRG] Utils: optimise get_page_layout (#5) @karlowich
🚨 Testing
👷 Continuous Integration
- Improve Quiet cli test (#156) @bosd
- [FIX]/[IMP] Windows Support (#149) @bosd
- [FIX]: Fix Github action: Coverage combine (#116) @bosd
- Use Up/down artifactv4 (#114) @bosd
- [IMP]: Add pytest-mpl (#60) @bosd
- [FIX]: codecov action (#61) @bosd
- Fix CI: 🟢 mac-os specific ghostscript install (#36) @bosd
- Use the absolute path to constraints.txt (#25) @foarsitter
📚 Documentation
- Rebrand CLI to pypdf_table_extraction (#158) @bosd
- [IMP] Update readthedocs url (#59) @bosd
- [IMP][UPD] Documentation Big Bang (#88) @bosd
- [UPD] Update version, name to new repo (#65) @bosd
- [MRG] Delete FUNDING.yml (#34) @bosd
- Update Documentation Syntax (#51) @bosd
- [MRG] Update CODE_OF_CONDUCT.md: Repo link (#33) @bosd
🔨 Refactoring
- Refactor method
Table.set_edges
for readability (#52) @tomprogrammer - Reflect camelot in pypdf_table_extraction namespace (#11) @foarsitter
- [IMP] Versioning, centralize metadata, Flake8 fixes (#157) @bosd
- Improve Quiet cli test (#156) @bosd
💄 Style
- Refactor method
Table.set_edges
for readability (#52) @tomprogrammer - Pre commit Fixes (#148) @bosd
- [IMP][UPD] Documentation Big Bang (#88) @bosd
- Optimised and cleaned the code. (#10) @python3-dev
📦 Dependencies
- [IMP] Opencv headless (#139) @bosd
- [IMP] Versioning, centralize metadata, Flake8 fixes (#157) @bosd
- Bump tomli from 2.0.1 to 2.0.2 (#151) @dependabot
- Bump rich from 13.8.1 to 13.9.1 (#150) @dependabot
- Bump virtualenv from 20.26.5 to 20.26.6 (#147) @dependabot
- Bump virtualenv from 20.26.5 to 20.26.6 in /.github/workflows (#145) @dependabot
- Bump safety from 3.2.7 to 3.2.8 (#146) @dependabot
- Bump fonttools from 4.54.0 to 4.54.1 (#144) @dependabot
- Bump bandit from 1.7.9 to 1.7.10 (#142) @dependabot
- Bump tzdata from 2024.1 to 2024.2 (#143) @dependabot
- Bump fonttools from 4.53.1 to 4.54.0 (#141) @dependabot
- Bump pypa/gh-action-pypi-publish from 1.10.1 to 1.10.2 (#140) @dependabot
- Bump platformdirs from 4.3.3 to 4.3.6 (#136) @dependabot
- Bump pydantic from 2.9.1 to 2.9.2 (#137) @dependabot
- Bump virtualenv from 20.26.4 to 20.26.5 in /.github/workflows (#138) @dependabot
- Bump virtualenv from 20.26.4 to 20.26.5 (#135) @dependabot
- Bump setuptools from 75.0.0 to 75.1.0 (#134) @dependabot
- Bump setuptools from 74.1.2 to 75.0.0 (#131) @dependabot
- Bump platformdirs from 4.3.2 to 4.3.3 (#130) @dependabot
- Bump zipp from 3.20.1 to 3.20.2 (#132) @dependabot
- Bump identify from 2.6.0 to 2.6.1 (#133) @dependabot
- Bump idna from 3.8 to 3.10 (#129) @dependabot
- Bump urllib3 from 2.2.2 to 2.2.3 (#128) @dependabot
- Bump pytz from 2024.1 to 2024.2 (#127) @dependabot
- Bump importlib-metadata from 8.4.0 to 8.5.0 (#126) @dependabot
- Bump rich from 13.8.0 to 13.8.1 (#125) @dependabot
- Bump pytest from 8.3.2 to 8.3.3 (#124) @dependabot
- Bump importlib-resources from 6.4.4 to 6.4.5 (#123) @dependabot
- Bump pydantic from 2.9.0 to 2.9.1 (#122) @dependabot
- Bump mdit-py-plugins from 0.4.1 to 0.4.2 (#121) @dependabot
- Bump platformdirs from 4.2.2 to 4.3.2 (#118) @dependabot
- Bump virtualenv from 20.26.3 to 20.26.4 (#119) @dependabot
- Bump virtualenv from 20.26.3 to 20.26.4 in /.github/workflows (#120) @dependabot
- Bump pydantic from 2.8.2 to 2.9.0 (#117) @dependabot
- Bump setuptools from 73.0.1 to 74.1.2 (#115) @dependabot
- Bump cffi from 1.17.0 to 1.17.1 (#112) @dependabot
- Bump cryptography from 43.0.0 to 43.0.1 (#110) @dependabot
- Bump kiwisolver from 1.4.5 to 1.4.7 (#113) @dependabot
- Bump pypa/gh-action-pypi-publish from 1.10.0 to 1.10.1 (#108) @dependabot
- [MRG] Python 3.12 (#44) @bosd
- Bump mypy from 1.11.1 to 1.11.2 (#86) @dependabot
- Bump pypa/gh-action-pypi-publish from 1.9.0 to 1.10.0 (#105) @dependabot
- Bump certifi from 2024.7.4 to 2024.8.30 (#103) @dependabot
- Bump pyparsing from 3.1.2 to 3.1.4 (#104) @dependabot
- Bump pbr from 6.0.0 to 6.1.0 (#102) @dependabot
- Bump importlib-resources from 6.4.3 to 6.4.4 (#101) @dependabot
- Bump safety-schemas from 0.0.4 to 0.0.5 (#100) @dependabot
- Bump idna from 3.7 to 3.8 (#98) @dependabot
- Bump zipp from 3.20.0 to 3.20.1 (#99) @dependabot
- Bump safety from 3.2.5 to 3.2.7 (#97) @dependabot
- [MRG] bump pypdf version (#46) @bosd
- [MRG] Bump pdfminer Version (#45) @bosd
- Bump stevedore from 5.2.0 to 5.3.0 (#83) @dependabot
- Bump typer from 0.12.4 to 0.12.5 (#84) @dependabot
- Bump rich from 13.7.1 to 13.8.0 (#82) @dependabot
- Bump opencv-python from 4.8.1.78 to 4.10.0.84 in /docs (#79) @dependabot
- Bump nox from 2023.4.22 to 2024.4.15 in /.github/workflows (#77) @dependabot
- Bump actions/setup-python from 4 to 5 (#87) @dependabot
- Bump sphinx-click from 5.0.1 to 6.0.0 in /docs (#78) @dependabot
- Bump matplotlib from 3.8.0 to 3.9.2 in /docs (#76) @dependabot
- Bump sphinx from 7.2.6 to 8.0.2 in /docs (#80) @dependabot
- [IMP][UPD] Documentation Big Bang (#88) @bosd
- Bump authlib from 1.3.1 to 1.3.2 (#81) @dependabot
- Bump pypa/gh-action-pypi-publish from 1.8.10 to 1.9.0 (#72) @dependabot
- Bump virtualenv from 20.24.5 to 20.26.3 in /.github/workflows (#69) @dependabot
- Bump salsify/action-detect-and-tag-new-version from 2.0.1 to 2.0.3 (#74) @dependabot
- Bump pip from 23.2.1 to 24.2 in /.github/workflows (#71) @dependabot
- Bump actions/cache from 3 to 4 (#68) @dependabot
- Bump release-drafter/release-drafter from 5.24.0 to 6.0.0 (#70) @dependabot
- Poetry update (#35) @bosd
- Fix safety issues by update lockfile (#7) @foarsitter