Skip to content
@oap-project

oap-project

Repositories

  • C 4 0 2 2 Updated Apr 28, 2021
  • native-sql-engine

    Native SQL Engine plugin for Spark SQL with vectorized SIMD optimizations.

    Scala Apache-2.0 15 19 44 14 Updated Apr 28, 2021
  • oap-mllib

    Optimized Spark package to accelerate machine learning algorithms in Apache Spark MLlib.

    Scala Apache-2.0 3 2 10 3 Updated Apr 28, 2021
  • raydp

    RayDP: Distributed data processing library that provides simple APIs for running Spark on Ray and integrating Spark with distributed deep learning and machine learning frameworks.

    Python Apache-2.0 17 73 19 4 Updated Apr 28, 2021
  • sql-ds-cache

    Spark* plug-in for accelerating Spark* SQL performance by using cache and index at SQL data source layer.

    Scala Apache-2.0 8 9 7 1 Updated Apr 27, 2021
  • solution-navigator

    Example solutions or code for using OAP features.

    Jupyter Notebook Apache-2.0 1 0 0 0 Updated Apr 27, 2021
  • arrow

    Forked from apache/arrow

    Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication…

    C++ Apache-2.0 1,885 0 0 2 Updated Apr 27, 2021
  • oap-project.github.io

    The OAP project web site

    JavaScript Apache-2.0 0 0 0 0 Updated Apr 27, 2021
  • pmem-shuffle

    Spark* Shuffle plugin for support shuffling through remote persistent memory over fabrics, which leverages the RDMA network and remote persistent memory (for read) to provide extremely high performance and low latency shuffle solutions for Spark*.

    C++ Apache-2.0 6 4 2 0 Updated Apr 25, 2021
  • remote-shuffle

    Spark* shuffle plugin for support shuffling data through a remote Hadoop-compatible file system, as opposed to vanilla Spark's local-disks.

    Scala Apache-2.0 4 4 1 1 Updated Apr 22, 2021
  • pmem-spill

    Spark plug-in package for accelerating Spark runtime spill functions using PMem such as RDD cache PMem extension.

    Scala Apache-2.0 3 2 8 2 Updated Apr 21, 2021
  • pmem-common

    Common library for accessing PMEM native library functions including memkind, vmemcache and so on.

    Java Apache-2.0 4 2 2 1 Updated Apr 21, 2021
  • libhdfs3-downstream

    Forked from martindurant/libhdfs3-downstream

    a native c/c++ hdfs client (downstream fork from apache-hawq)

    C++ Apache-2.0 51 0 0 0 Updated Mar 16, 2021
  • arrow-data-source

    Spark DataSouce plugin for reading files from various formats like Parquet into Arrow compatible columnar vectors.

    Scala Apache-2.0 7 3 3 0 Updated Mar 12, 2021

Top languages

Loading…

Most used topics

Loading…