Home
Java
CSharp
Python
Javascript
Go
Rust
Machine Learning
Contact Us
Main Menu
Home
Open Source
Articles
Tech Stack
News
Contact Us
Connect:
Search
Suggested keywords:
Java
Docker
Git
React
NextJs
Spring boot
Laravel
Projects
text-extraction
Pandoc - General Markup Converter
mail-parser
talon - Mailgun library to extract message quotations and signatures
text-analysis
Apache Tika - A content analysis toolkit
pdf
borb - Library for reading, creating and manipulating PDF files in Python
ocr
JavaOCR
text-extraction
TCPDF - PHP class for generating PDF
document-pipeline
UIMA - Unstructured information management architecture
ocr
Tessnet2
text-extraction
PDF Library - PDF manipulation in .NET
pdf
jPod - PDF manipulating and rendering framework
document-conversion
JODConverter - Automates document conversions using OpenOffice
pdf
PDFJet - PDF library for Java and .NET
document-processing
documents4j - Java library for converting documents into another document format
TechStack
java (16)
python (3)
c (2)
csharp (2)
haskell (2)
php (1)
ruby (1)
vbnet (1)
Tagcloud
pdf (11)
pdf-library (10)
document-conversion (7)
pdf-library-dotnet (6)
document-processing (5)
pdf-library-java (5)
content-connector (4)
connector (3)
document-pipeline (3)
microsoft-documents (3)
ocr (3)
optical-character-recognition (3)
text-analysis (3)
crawler (2)
document (2)
library (2)
markup (2)
office (2)
pdf-generation (2)
License
apache (11)
agpl (5)
bsd (4)
lgpl (3)
gplv2 (2)
gpl (1)
lgplv3 (1)
mit (1)
public (1)
1
2