Bitextor Team
Translation memories generator
Grow your team on GitHub
GitHub is home to over 40 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign up-
bicleaner
Parallel corpus classifier/cleaner
-
bitextor
Bitextor generates translation memories from multilingual websites.
-
bifixer
Tool to fix bitexts and tag near-duplicates for removal
-
pdf-extract
PDF parser and converter to HTML
-
python-pdfextract
Forked from misja/python-boilerpipePython interface to pdf-extract, HTML extraction from PDF
-
bitextor-data
Repository for data models, dictionaries and more resources for Bitextor
-
binonymizer
Anonymizer module for Bicleaner's pipeline (WIP)
-
hunalign
The hunalign sentence aligner. Forked from version 1.2