Skip to content
Avatar
Block or Report

Block or report helgeho

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Popular repositories

  1. An Apache Spark framework for easy data processing, extraction as well as derivation for web archives and archival collections, developed at Internet Archive.

    Scala 125 19

  2. Web2Warc Public

    An easy-to-use and highly customizable crawler that enables you to create your own little Web archives (WARC/CDX)

    Scala 22 4

  3. Scripts to transfer archive.org collections, using https://github.com/jjjake/internetarchive

    Python 9 2

  4. A Splitable Hadoop InputFormat for Concatenated GZIP Files and *.(w)arc.gz

    Java 9 4

  5. A Hadoop input format to use gaphs in WebGraph's BV format with Hadoop and Spark.

    Java 7 3

  6. Exspec Public

    Don't write specs anymore, just save 'em while testing your code interactively. Specs will become a byproduct.

    Ruby 5

31 contributions in the last year

Dec Jan Feb Mar Apr May Jun Jul Aug Sep Oct Nov Dec Mon Wed Fri

Contribution activity

December 2022

Created 1 commit in 1 repository
3 contributions in private repositories Dec 7

Seeing something unexpected? Take a look at the GitHub profile guide.