-
Updated
Nov 4, 2021 - Python
#
hadoop
Here are 2,652 public repositories matching this topic...
Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
python
aws
data-science
machine-learning
caffe
theano
big-data
spark
deep-learning
hadoop
tensorflow
numpy
scikit-learn
keras
pandas
kaggle
scipy
matplotlib
mapreduce
Luigi is a Python module that helps you build complex pipelines of batch jobs. It handles dependency resolution, workflow management, visualization etc. It also comes with Hadoop support built in.
-
Updated
Nov 9, 2021 - Python
编程电子书,电子书,编程书籍,包括C,C#,Docker,Elasticsearch,Git,Hadoop,HeadFirst,Java,Javascript,jvm,Kafka,Linux,Maven,MongoDB,MyBatis,MySQL,Netty,Nginx,Python,RabbitMQ,Redis,Scala,Solr,Spark,Spring,SpringBoot,SpringCloud,TCPIP,Tomcat,Zookeeper,人工智能,大数据类,并发编程,数据库类,数据挖掘,新面试题,架构设计,算法系列,计算机类,设计模式,软件测试,重构优化,等更多分类
mysql
python
linux
docker
redis
elasticsearch
spark
spring
hadoop
rabbitmq
solr
jvm
netty
springboot
mybatis
springcloud
-
Updated
Oct 8, 2021
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
python
java
data-science
machine-learning
multi-threading
opensource
r
big-data
spark
deep-learning
hadoop
random-forest
gpu
naive-bayes
h2o
distributed
pca
gbm
ensemble-learning
automl
-
Updated
Nov 14, 2021 - Jupyter Notebook
Alluxio, data orchestration for analytics and machine learning in the cloud
spark
presto
hadoop
tensorflow
data-analysis
alluxio
memory-speed
data-orchestration
virtual-distributed-filesystem
-
Updated
Nov 14, 2021 - Java
At LinkedIn, we are using this curriculum for onboarding our entry-level talents into the SRE role.
-
Updated
Nov 5, 2021 - HTML
findepi
commented
Sep 6, 2021
With Hive connector
trino:default> CREATE TABLE one (a varchar);
-> CREATE VIEW two AS SELECT * FROM one;
CREATE TABLE
CREATE VIEW
DROP TABLE is rejected on a view:
trino:default> DROP TABLE two;
Query 20210906_150832_00015_id3y3 failed: line 1:1: Table 'hive.default.two' does not exist, but a view with that name exists. Did you mean DROP VIEW hive.default.t
Apache Ignite
iot
cloud
sql
database
big-data
hadoop
cache
osgi
ignite
network-server
in-memory-database
data-management-platform
network-client
distributed-sql-database
in-memory-computing
-
Updated
Nov 13, 2021 - Java
基于Spark的电影推荐系统,包含爬虫项目、web网站、后台管理系统以及spark推荐系统
-
Updated
Apr 1, 2019 - Java
生产环境的海量数据计算产品,文档地址:
-
Updated
Nov 11, 2021 - Java
AI on Hadoop
-
Updated
Jul 22, 2021 - Java
zhanghaicheng1
commented
Dec 9, 2019
问题描述:关于工程中某个工作流节点单独执行不报错,保存工作流时提示:NotClassDefFoundError:Cloud not initialize class dispatch.Http$
问题原因:linkis-publish微服务关于netty-3.6.2.Final.jar升级包缺失
问题解决:将升级包上传重启linkis-publish微服务即可。
Open
Refactor existing hashCode() to include class for methods using the Apache Commons Lang 3 style
3
GCHQDev404
commented
Nov 3, 2020
Append class to all HashCodeBuilders in Gaffer for the below issue to minimise hash collisions.
@Test
void name() {
Foo foo = new Foo();
Bar bar = new Bar();
assertFalse(foo.equals(bar));
assertNotEquals(foo.hashCode(), bar.hashCode()); //fails
}
class Bar {
int a = 3;
@Override
public int hashCode() {
emilyreff7
commented
Nov 3, 2021
See #3097 which introduces support for Pandas, Dask, and Pyspark.
Apache Hadoop docker image
-
Updated
Sep 15, 2021 - Shell
深圳地铁大数据客流分析系统🚇 🚄 🌟
mysql
docker
redis
elasticsearch
phoenix
kibana
scala
kafka
spark
mongodb
hive
hadoop
clickhouse
hbase
zookeeper
springboot
flink
kylin
cdh6
szt-bigdata
-
Updated
May 9, 2021 - Scala
High performance data store solution
-
Updated
Nov 14, 2021 - Scala
MooseFS – Open Source, Petabyte, Fault-Tolerant, Highly Performing, Scalable Network Distributed File System (Software-Defined Storage)
fuse
big-data
hadoop
storage
clustering
filesystem
scalability
distributed-computing
snapshot
posix
file-system
distributed-storage
high-availability
software-defined-storage
distributed-file-system
storage-tiering
moosefs
posix-compliant
petabyte
ditributed-systems
-
Updated
Oct 20, 2021 - C
Improve this page
Add a description, image, and links to the hadoop topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the hadoop topic, visit your repo's landing page and select "manage topics."
At this moment relu_layer op doesn't allow threshold configuration, and legacy RELU op allows that.
We should add configuration option to relu_layer.