#
pdfbox
Here are 139 public repositories matching this topic...
Converts a pdf file into a text file while keeping the layout of the original pdf. Useful to extract the content from a table in a pdf file for instance. This is a subclass of PDFTextStripper class (from the Apache PDFBox library).
-
Updated
Sep 7, 2021 - Java
An HTML to PDF library for the JVM. Based on Flying Saucer and Apache PDF-BOX 2. With SVG image support. Now also with accessible PDF support (WCAG, Section 508, PDF/UA)!
-
Updated
May 26, 2022 - Java
Boxable is a library that can be used to easily create tables in pdf documents.
-
Updated
Jun 5, 2022 - Java
(Java)A Method to Extract Tabular Content from PDF Files
-
Updated
Jun 15, 2021 - HTML
Java utility for parsing PDF tabular data using Apache PDFBox and OpenCV
-
Updated
Apr 3, 2022 - Java
-
Updated
Jan 15, 2019 - Java
Checks the PDFs submitted to a conference, e.g., for formatting violations and double anonymous violations
-
Updated
Dec 11, 2021 - Java
Java library for creating fluid page layouts with Apache PDFBox. Supporting multi-page tables, different page layouts etc.
-
Updated
May 30, 2022 - Java
A Java tool/maven plugin/library to generate HMTL and PDF from markdown text intended for project documentation. Supports JSON based "stylesheet" for PDFs.
-
Updated
Jun 10, 2022 - Groovy
可以将word(doc、docx)、excel、pdf、ppt、csv、txt文件的文本内容提取出来,同时能够提取出word、pdf文件的目录
-
Updated
Jun 17, 2022 - Java
howawong
commented
Mar 18, 2018
The workaround is poorly written. Better refactor to a configuration file and a function.
QRScan: recognition of QR codes in PDF files of scanned documents
-
Updated
Jun 13, 2022 - Java
A Struts2 plugin for creating PDF-s from HTML-s, JSP-s, FreeMarker templates and Apache Tiles definitions.
-
Updated
Apr 22, 2022 - Java
A desktop tool to mix, reorder and select PDF pages
-
Updated
Nov 26, 2020 - Java
Improve this page
Add a description, image, and links to the pdfbox topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the pdfbox topic, visit your repo's landing page and select "manage topics."
Looks like the function below returns bytes with value 1 instead of 255 which produces near black png. for all other type of filters it works fine.
Filter: FlateDecode
ColorSpace: DeviceGray
BitsPerComponent: 1
public static byte[] Convert(ColorSpaceDetails details, IReadOnlyList decoded, int bitsPerComponent, int imageWidth, int imageHeight);