Repositories
-
openwhisk
Apache OpenWhisk is an open source serverless cloud platform
-
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication…
-
incubator-gobblin
A distributed data integration framework that simplifies common aspects of big data integration such as data ingestion, replication, organization and lifecycle management for both streaming and batch data ecosystems.
-
-
spark
Apache Spark - A unified analytics engine for large-scale data processing
-
ozone
Scalable, redundant, and distributed object store for Apache Hadoop
-
-
incubator-teaclave
Apache Teaclave (incubating) is an open source universal secure computing platform
-
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
-
incubator-ratis
Open source Java implementation for Raft consensus protocol.
-
beam
Apache Beam is a unified programming model for Batch and Streaming
-
skywalking
APM, Application Performance Monitoring System
-
lucene-solr
Apache Lucene and Solr open-source search software
-
pulsar
Apache Pulsar - distributed pub-sub messaging system
-
incubator-dolphinscheduler
Dolphin Scheduler is a distributed and easy-to-extend visual workflow scheduling platform, dedicated to solving the complex dependencies in data processing, making the scheduling system out of the box for data processing.(分布式易扩展的可视化工作流任务调度)
-
superset
Apache Superset is a Data Visualization and Data Exploration Platform
-
nutch
Apache Nutch is an extensible and scalable web crawler
-