Grow your team on GitHub
GitHub is home to over 50 million developers working together. Join them to grow your own development teams, manage permissions, and collaborate on projects.
Sign upRepositories
-
-
spark
Apache Spark - A unified analytics engine for large-scale data processing
-
arrow
Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication…
-
apisix-ingress-controller
ingress controller for K8s
-
airflow
Apache Airflow - A platform to programmatically author, schedule, and monitor workflows
-
zeppelin
Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
-
camel
Apache Camel is an open source integration framework that empowers you to quickly and easily integrate various systems consuming or producing data.
-
shardingsphere
Distributed database middleware
-
-
-
-
hudi
Upserts, Deletes And Incremental Processing on Big Data.
-
comdev-site
Website sources for the Apache Community Development Website
-
servicecomb-service-center
A standalone service center to allow services to register their instance information and to discover providers of a given service
-
-
-
-
incubator-superset
Apache Superset is a Data Visualization and Data Exploration Platform
-
beam
Apache Beam is a unified programming model for Batch and Streaming
-
ignite-3
Apache Ignite 3.x
-
cassandra-in-jvm-dtest-api
Apache Cassandra in-JVM DTest API
-
camel-k-runtime
Apache Camel K runtime
-