Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spar…
#
spark
Repositories 3,561
Learn and understand Docker technologies, with real DevOps practice!
Deeplearning4j, ND4J, DataVec and more - deep learning & linear algebra for Java/Scala with GPUs + Spark
汇总java生态圈常用技术框架、开源中间件,系统架构、数据库、大公司架构案例、常用三方类库、项目管理、线上问题排查、个人成长、思考等知识
Updated Apr 20, 2019
Kubernetes中文指南/云原生应用架构实践手册 - https://jimmysong.io/kubernetes-handbook
macOS development environment setup: Easy-to-understand instructions with automated setup scripts for developer tools…
List of Data Science Cheatsheets to rule the world
Updated Apr 11, 2019
A Flexible and Powerful Parameter Server for large-scale machine learning
Alluxio, formerly Tachyon, Unify Data at Memory Speed
alluxio
distributed-storage
big-data
memory-speed
hadoop
spark
virtual-file-system
presto
tensorflow
Java
Updated May 1, 2019
Open Source Fast Scalable Machine Learning Platform For Smarter Applications: Deep Learning, Gradient Boosting & XGBo…
h2o
machine-learning
data-science
deep-learning
big-data
ensemble-learning
gbm
random-forest
naive-bayes
pca
opensource
distributed
multi-threading
java
python
r
hadoop
spark
gpu
automatic
Java
Updated May 1, 2019
PipelineAI: Real-Time Enterprise AI Platform
machine-learning
artificial-intelligence
tensorflow
kubernetes
elasticsearch
cassandra
spark
kafka
netflixoss
presto
airflow
pipeline
docker
redis
neural-network
gpu
microservices
nifi
scikit
prediction
Java
Updated Apr 17, 2019
TensorFlowOnSpark brings TensorFlow programs to Apache Spark clusters.
Python
Updated Apr 18, 2019
Open-source IoT Platform - Device management, data collection, processing and visualization.
BigDL: Distributed Deep Learning Library for Apache Spark
Scala
Updated Apr 30, 2019
Interactive and Reactive Data Science using Scala and Spark.
Python clone of Spark, a MapReduce alike framework in Python
Python
Updated Jan 23, 2019
酷玩 Spark: Spark 源代码解析、Spark 类库等
Scala
Updated Feb 6, 2019
REST job server for Apache Spark
A minimal benchmark for scalability, speed and accuracy of commonly used open source implementations (R packages, Pyt…
machine-learning
data-science
r
python
gradient-boosting-machine
random-forest
deep-learning
xgboost
h2o
spark
R
Updated Sep 15, 2018
deeplearning4j / nd4j Archived
1.6k
Fast, Scientific and Numerical Computing for the JVM (NDArrays)
java
gpu
scientific
nd4j
jvm
dl4j
backend
scala-notebook
spark
artificial-intelligence
scientific-computing
numerical-calculations
Java
Updated Jun 16, 2018
DataStax Spark Cassandra Connector
Scala
Updated Apr 17, 2019
A large-scale entity and relation database supporting aggregation of properties
TransmogrifAI (pronounced trăns-mŏgˈrə-fī) is an AutoML library for building modular, reusable, strongly typed machin…
Microsoft Machine Learning for Apache Spark
spark ml 算法原理剖析以及具体的源码实现分析
Updated Mar 25, 2019
Compile-time Language Integrated Queries for Scala
Scala
Updated Apr 30, 2019
The Hunting ELK
Machine Learning Platform and Recommendation Engine built on Kubernetes
machine-learning
deep-learning
deployment
kubernetes
docker
microservices
spark
kafka
kafka-streams
tensorflow
python
java
cloud
aws
gcp
azure
seldon
recommender-system
recommendation-engine
prediction
Java
Updated Jul 28, 2018
A better compressed bitset in Java
Distributed Deep learning with Keras & Spark
Python
Updated Apr 10, 2019