#
documents
Here are 420 public repositories matching this topic...
MM-Wiki 一个轻量级的企业知识分享与团队协同软件,可用于快速构建企业 Wiki 和团队知识分享平台。部署方便,使用简单,帮助团队构建一个信息共享、文档管理的协作环境。
-
Updated
Jul 13, 2020 - Go
iText 7 for Java represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enhance PDF documents, iText 7 can be a boon to nearly every workflow.
svg
pdf
security
library
sdk
encryption
accessibility
archiving
digital-signature
documents
pades
acroform
pdf-generation
itext
pdfa
gdpr
xfdf
pdfua
pades-standard
ccpa
-
Updated
Jul 17, 2020 - Java
Emacs document annotator, using Org-mode
-
Updated
Jul 1, 2020 - Emacs Lisp
iText 7 for .NET is the .NET version of the iText 7 library, formerly known as iTextSharp, which it replaces. iText 7 represents the next level of SDKs for developers that want to take advantage of the benefits PDF can bring. Equipped with a better document engine, high and low-level programming capabilities and the ability to create, edit and enhance PDF documents, iText 7 can be a boon to nearly every workflow.
svg
pdf
security
library
sdk
encryption
accessibility
archiving
digital-signature
documents
itextsharp
pades
acroform
pdf-generation
itext
pdfa
gdpr
xfdf
pdfua
ccpa
-
Updated
Jul 17, 2020 - C#
Document-oriented, embedded SQL database, works with Bolt, Badger and memory
-
Updated
Jul 17, 2020 - Go
Kotlin 语言中文站
-
Updated
Jul 16, 2020 - JavaScript
The Learning Hub for UoL's Online CS Students
slack
computer-science
students
youtube
books
university
modules
curriculum
calendar
notes
resources
podcasts
websites
courses
software
documents
bsc
bugs
professors
london
uol
graduate
degree
goldsmiths
-
Updated
Jul 16, 2020 - Python
Python based Open Source ETL tools for file crawling, document processing (text extraction, OCR), content analysis (Entity Extraction & Named Entity Recognition) & data enrichment (annotation) pipelines & ingestor to Solr or Elastic search index & linked data graph database
python
pdf
elasticsearch
enrichment
ocr
annotation
etl
solr
rdf
extractor
extract
extract-information
named-entity-recognition
documents
ingest
extract-text
enrichment-analysis
solr-dataimporter
ingests-documents
ingestion-pipeline
-
Updated
Apr 12, 2020 - Python
Cryptee's web client source code for all platforms.
-
Updated
Jun 13, 2020 - JavaScript
LexPredict ContraxSuite
-
Updated
Jul 3, 2020 - Python
search document dumps: ingest and explore in one extensible framework
-
Updated
Jun 22, 2020 - JavaScript
Uwazi is a web-based, open-source solution for building and sharing document collections
-
Updated
Jul 17, 2020 - JavaScript
Read SVG files and convert them to other formats.
-
Updated
Mar 30, 2020 - Python
Implementation of my paper "Real-time Document Localization in Natural Images by Recursive Application of a CNN."
machine-learning
real-time
computer-vision
tensorflow
paper
cnn
pytorch
dataset
documents
convolutional-neural-networks
-
Updated
Feb 1, 2019 - Python
Coleção de validadores e geradores para documentos oficiais brasileiros, como: Inscrição Estadual, CPF, CNPJ.
-
Updated
Feb 5, 2020 - Ruby
CGo bindings to LibreOfficeKit
-
Updated
Jan 24, 2018 - Go
Improve this page
Add a description, image, and links to the documents topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the documents topic, visit your repo's landing page and select "manage topics."
Right now, there's a "languages" feature implemented, which allows the user to define, in what kind of languages the documents he usually uploads are written in. Under "Settings", each Paperwork user can select the languages he'd like Paperwork to support for his account.
This was being implemented, so that tesseract can be called with the according language option, which helps OCR.
Now, thi