oap-project

native-sql-engine

Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.

spark arrow native-sql-engine vectorized-simd-optimizations native-kernels

Scala Apache-2.0 15 19 44 14 Updated Apr 28, 2021

oap-mllib

Optimized Spark package to accelerate machine learning algorithms in Apache Spark MLlib.

Scala Apache-2.0 3 2 10 3 Updated Apr 28, 2021

raydp

RayDP: Distributed data processing library that provides simple APIs for running Spark on Ray and integrating Spark with distributed deep learning and machine learning frameworks.

spark ray

Python Apache-2.0 17 73 19 4 Updated Apr 28, 2021

sql-ds-cache

Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.

Scala Apache-2.0 8 9 7 1 Updated Apr 27, 2021

solution-navigator

Example solutions or code for using OAP features.

Jupyter Notebook Apache-2.0 1 0 0 0 Updated Apr 27, 2021

Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication…

C++ Apache-2.0 1,885 0 0 2 Updated Apr 27, 2021

C++ Apache-2.0 51 0 0 0 Updated Mar 16, 2021

arrow-data-source

Spark DataSouce plugin for reading files from various formats like Parquet into Arrow compatible columnar vectors.

Scala Apache-2.0 7 3 3 0 Updated Mar 12, 2021

oap-project

Repositories

oap-tools

native-sql-engine

oap-mllib

raydp

sql-ds-cache

solution-navigator

arrow

oap-project.github.io

pmem-shuffle

remote-shuffle

pmem-spill

pmem-common

libhdfs3-downstream

arrow-data-source

Top languages

Most used topics

People