GitHub - garabik/pdfshapeminer: Extract text from pdf using pdfminer and shapely

garabik / pdfshapeminer Public

Notifications You must be signed in to change notification settings
Fork 1
Star 3

Extract text from pdf using pdfminer and shapely

LGPL-3.0 license

3 stars 1 fork Branches Tags Activity

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 29 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
pdf2txt.py		pdf2txt.py

Repository files navigation

This is a rough attempt to text extractions from PDF documents.

requirements:

python2.7
pdfminer: https://github.com/euske/pdfminer tested with version 20140328
shapely: http://toblerity.org/shapely/project.html tested wih 1.3 (from debian)

About

Extract text from pdf using pdfminer and shapely

LGPL-3.0 license

Report repository

Releases

No releases published

Packages

No packages published

Languages

Python 100.0%