Grow your team on GitHub
GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign upPinned repositories
Repositories
-
-
ratatool
A tool for data sampling, data generation, and data diffing
-
magnolify
A collection of Magnolia add-on modules
-
featran
A Scala feature transformation library for data science and machine learning
-
web-scripts
A collection of base configs and CLI wrappers used to speed up development @ Spotify.
-
klio
Smarter data pipelines for audio.
-
-
scio
A Scala API for Apache Beam and Google Cloud Dataflow.
-
-
missinglink
Build time tool for detecting link problems in java projects
-
SPTDataLoader
The HTTP library used by the Spotify iOS client
-
-
annoy
Approximate Nearest Neighbors in C++/Python optimized for memory usage and loading/saving to disk
-
-
luigi
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
-
-
styx
"The path to execution", Styx is a service that schedules batch data processing jobs in Docker containers on Kubernetes.
-
zoltar
Common library for serving TensorFlow, XGBoost and scikit-learn models in production.
-
reactochart
📈 React chart component library📉 -
JniHelpers
Tools for writing great JNI code
-
NFPlayerJS
A JavaScript/TypeScript audio engine for the Web and Server capable of multitrack time stretching, pitch shifting, declarative effects, faster than realtime processing, and more!
-
completable-futures
Utilities for working with futures in Java 8
-
-
-
big-data-rosetta-code
Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code
-
-
-
dbeam
DBeam exports SQL tables into Avro files using JDBC and Apache Beam