Navigator for Web Archive
-
Updated
Apr 1, 2023 - JavaScript
Navigator for Web Archive
An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.
Parse And Create Web ARChive (WARC) files with node.js
Extract web archive data using Wayback Machine and Common Crawl
Create WebKit/Safari .webarchive files on any platform
Simple python OSINT tool for urls recon thanks to the waybackmachine.
A robust web archive analytics toolkit
Quick Cache and Archive search buttons
A utility for simultaneously creating full-page PDF snapshots and web archives of web pages in DEVONthink Pro.
Seeder - Czech webarchive curating tool and public site
Shepherding our web archives from crawl to access.
A Splitable Hadoop InputFormat for Concatenated GZIP Files and *.(w)arc.gz
This command line converts .webarchive file to resources embed .html file
Parser for WARC (aka WebArchive) files
📑 Rust utilities for working with Apple's Web Archive file format
A .NET Standard 2.0 library to extract a Safari web archive to a folder
Bookmarked archived links
Add a description, image, and links to the webarchive topic page so that developers can more easily learn about it.
To associate your repository with the webarchive topic, visit your repo's landing page and select "manage topics."