#
xpdf
Here are 14 public repositories matching this topic...
-
Updated
Mar 24, 2022 - TypeScript
bug
Something isn't working
help wanted
Extra attention is needed
good first issue
Good for newcomers
This is a highly efficient python wrapper for tesseract-ocr.
-
Updated
Apr 30, 2021 - Python
Static library built from source of www.xpdfreader.com with most of dependencies built within
-
Updated
Jan 27, 2022 - C++
"Documents Search Engine" based on Lucene for indexing and searching in many type of the documents
-
Updated
Feb 2, 2017 - PHP
From using xpdf, rvest, and quanteda on United Nations Digital Library search results to applying dictionaries to speeches in United Nations meeting records
-
Updated
Apr 16, 2019 - R
Batch-convert pdf to text, extract data from pdf in python
pdf-converter
pandas
data-extraction
pdf-to-text
regular-expressions
pdf-reader
data-cleaning
pdf-to-excel
pypdf2
pdftotext
batch-conversion
pdf-parser
pdf-data-extraction
xpdf
pdf-tools
pypdf
python-automation
python-pdf
batch-converter
indirectobject
-
Updated
Sep 29, 2021 - Python
Improve this page
Add a description, image, and links to the xpdf topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the xpdf topic, visit your repo's landing page and select "manage topics."
Config.load_file()removes the xpdfrc settings from pyxpdf_data.As pyxpdf_data introduce new encodings with the help of automatic generated xpdfrc and loading another xpdfrc will discard them.
It can be solved by appending the user provided xpdfrc to the pyxpdf_data's xpdfrc.