gituser/production/: pdfminer-six-20191110 metadata and description

Homepage Simple index

PDF parser and analyzer

author Yusuke Shinyama + Philippe Guglielmetti
author_email pdfminer@goulu.net
classifiers
  • Programming Language :: Python
  • Programming Language :: Python :: 2.6
  • Programming Language :: Python :: 2.7
  • Programming Language :: Python :: 3.4
  • Programming Language :: Python :: 3.5
  • Programming Language :: Python :: 3.6
  • Development Status :: 5 - Production/Stable
  • Environment :: Console
  • Intended Audience :: Developers
  • Intended Audience :: Science/Research
  • License :: OSI Approved :: MIT License
  • Topic :: Text Processing
keywords pdf parser,pdf converter,layout analysis,text mining
license MIT/X
provides_extras docs
requires_dist
  • pycryptodome
  • six
  • sortedcontainers
  • chardet ; python_version > "3.0"
  • nose ; extra == 'dev'
  • tox ; extra == 'dev'
  • sphinx ; extra == 'docs'
  • sphinx-argparse ; extra == 'docs'
File Tox results History
pdfminer.six-20191110-py3-none-any.whl
Size
5 MB
Type
Python Wheel
Python
3

Fork of PDFMiner using six for Python 2+3 compatibility

PDFMiner is a tool for extracting information from PDF documents. Unlike other PDF-related tools, it focuses entirely on getting and analyzing text data. PDFMiner allows to obtain the exact location of texts in a page, as well as other information such as fonts or lines. It includes a PDF converter that can transform PDF files into other text formats (such as HTML). It has an extensible PDF parser that can be used for other purposes instead of text analysis.