Grow your team on GitHub
GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign upRepositories
-
-
incubator-superset
Apache Superset (incubating) is a modern, enterprise-ready business intelligence web application
-
spark
Apache Spark - A unified analytics engine for large-scale data processing
-
-
shardingsphere
Distributed database middleware
-
zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
-
incubator-dolphinscheduler
Dolphin Scheduler is a distributed and easy-to-extend visual workflow scheduling platform, dedicated to solving the complex dependencies in data processing, making the scheduling system out of the box for data processing.(分布式易扩展的可视化工作流任务调度)
-
-
beam
Apache Beam is a unified programming model for Batch and Streaming
-
incubator-gobblin
Gobblin is a distributed big data integration framework (ingestion, replication, compliance, retention) for batch and streaming systems. Gobblin features integrations with Apache Hadoop, Apache Kafka, Salesforce, S3, MySQL, Google etc.
-
skywalking
APM, Application Performance Monitoring System
-
-
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
-
incubator-mxnet
Lightweight, Portable, Flexible Distributed/Mobile Deep Learning with Dynamic, Mutation-aware Dataflow Dep Scheduler; for Python, R, Julia, Scala, Go, Javascript and more
-
lucene-solr
Apache Lucene and Solr open-source search software
-
submarine
Submarine is Cloud Native Machine Learning Platform.
-
-
-
spamassassin
Read-only mirror of Apache SpamAssassin. Submit patches to https://bz.apache.org/SpamAssassin/. Do not send pull requests
-
infrastructure-blocky-client
Blocky client app for ASF Infra
-
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication…
-
incubator-ratis
Open source Java implementation for Raft consensus protocol.