#
webarchives
Here are 15 public repositories matching this topic...
The Archives Unleashed Toolkit is an open-source toolkit for analyzing web archives.
java
scala
spark
apache-spark
hadoop
analysis
python3
pyspark
digital-humanities
dataframe
big-data-analytics
webarchives
-
Updated
Jun 26, 2020 - Scala
A Rails engine supporting the discovery of web archives.
-
Updated
Jun 27, 2020 - Ruby
A dockerized, queued high fidelity web archiver based on Squidwarc
-
Updated
Dec 6, 2018 - Python
Links on the web break all the time, robustify them!
-
Updated
Jun 23, 2020 - JavaScript
Docker image for the Archives Unleashed Toolkit
-
Updated
Jun 18, 2020 - Dockerfile
Rails application for the Archives Unleashed Cloud.
-
Updated
Jun 18, 2020 - HTML
Seeder - Czech webarchive curating tool and public site
-
Updated
Jun 19, 2020 - Python
anjackson
commented
Mar 20, 2019
Our curators have suggested that a function that returns the most recent crawl date (rather than the crawl status) would be useful. e.g.
=WEBARCHIVE_LAST_MEMENTO_DATE_UKWA({url})
This repository contains source code for interacting with Archive-It.
-
Updated
Mar 3, 2020 - Python
shawnmjones
commented
Jul 10, 2019
Raintale already supports the link and text story element types in JSON input. An image story element type would allow users to handle images in a special way with their templates. It should not be handled that differently from text. T
A framework of algorithms for sampling mementos from a web archive collection.
-
Updated
Jun 25, 2020 - Python
-
Updated
Sep 20, 2017 - JavaScript
-
Updated
Nov 28, 2017 - TypeScript
Improve this page
Add a description, image, and links to the webarchives topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the webarchives topic, visit your repo's landing page and select "manage topics."
Are you submitting a bug report or a feature request?
Feature request/documentation enhancement
What is the current behavior?
The requirements for a user to get up and running are insufficient with regard to the requirements and dependencies. I encountered this experience when trying to resolve #31 on a fresh Win