Internet Archive

The Internet Archive is "the library of the Internet", and a big supporter of Free Software.

iaux-modal-manager Public
A Modal Manager WebComponent

TypeScript 0 AGPL-3.0 1 0 5 Updated Jun 22, 2022
wcdimportbot Public
Import workflows for the Wikipedia Citations Database

Python 2 3 0 1 Updated Jun 22, 2022
infogami Public

Python 33 AGPL-3.0 40 9 5 Updated Jun 22, 2022
openlibrary Public
One webpage for every book ever published!

Python 3,605 AGPL-3.0 847 674 (30 issues need help) 77 Updated Jun 22, 2022
iaux-collection-browser Public

TypeScript 0 AGPL-3.0 1 0 8 Updated Jun 22, 2022
heritrix3 Public
Heritrix is the Internet Archive's open-source, extensible, web-scale, archival-quality web crawler project.

Java 2,225 711 49 7 Updated Jun 22, 2022
trough Public
Trough: Big data, small databases.

Python 31 BSD-2-Clause 7 1 6 Updated Jun 21, 2022
iaux Public
Monorepo for Archive.org UX development and prototyping.

JavaScript 57 AGPL-3.0 86 80 (5 issues need help) 59 Updated Jun 20, 2022
brozzler Public
brozzler - distributed browser-based web crawler

Python 533 Apache-2.0 86 26 7 Updated Jun 20, 2022
bookreader Public
The Internet Archive BookReader

JavaScript 722 AGPL-3.0 355 99 (5 issues need help) 73 Updated Jun 20, 2022

Pinned