Pinned repositories
Repositories
-
scio
A Scala API for Apache Beam and Google Cloud Dataflow.
-
-
heroic Archived
The Heroic Time Series Database
-
dockerfile-mode
An emacs mode for handling Dockerfiles
-
styx
"The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.
-
-
-
web-scripts
A collection of base configs and CLI wrappers used to speed up development @ Spotify.
-
-
XCLogParser
Tool to parse Xcode and xcodebuild logs stored in the xcactivitylog format
-
cassandra-medusa
Apache Cassandra backup and restore tool
-
luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
-
scanctl
A tool to facilitate managing Whitesource data
-
docker_interface
🐳 Declarative interface for building images and running commands in containers using Docker. -
ramlfications
Python parser for RAML
-
flink-on-k8s-operator
Forked from GoogleCloudPlatform/flink-on-k8s-operatorKubernetes operator for managing the lifecycle of Apache Flink and Beam applications.
-
klio
Smarter data pipelines for audio.
-
magnolify
A collection of Magnolia add-on modules
-
ratatool
A tool for data sampling, data generation, and data diffing
-
flyte-flink-plugin
Flyte Flink k8s plugin.
-
zoltar
Common library for serving TensorFlow, XGBoost and scikit-learn models in production.
-
dbeam
DBeam exports SQL tables into Avro files using JDBC and Apache Beam
-
NFDriver
A cross platform C++ audio driver with low latency.
-
gordon-introspection
Introspection Server Plugin for Gordon: Event-Driven Cloud DNS registration
-
-
-
featran
A Scala feature transformation library for data science and machine learning
-
gcs-tools
GCS support for avro-tools, parquet-tools and protobuf
-
big-data-rosetta-code
Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code