Skip to content
#

arrow

Here are 193 public repositories matching this topic...

Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.

  • Updated Feb 27, 2021
  • C++
ttnghia
ttnghia commented Feb 26, 2021

Currently, aggregation APIs (groupby, reductions, rolling, etc.) are scattered around in multiple files and there are inconsistencies between the directory structures in cpp/include/, cpp/src/, cpp/tests/, and cpp/benchmarks/. For example:

cpp/include/:

  • include/cudf/aggregation.hpp
  • include/cudf/groupby.hpp
  • include/cudf/rolling.hpp
  • ....

cpp/src/:

  • src/aggregati
blazingsql
beckernick
beckernick commented Nov 18, 2020

I'd like to be able to call LCASE on a string column to convert it to lowercase, like in MySQL. This is an alias for LOWER, which is noted in #1135 . This is listed as a supported operation on string columns in the Calcite reference, but it may need not currently be available without some changes based on the following:

from pyspark.sql import SparkSession
from blazingsql import Bla
lr4d
lr4d commented Oct 8, 2020

Problem description

Our dask update graphs are not properly optimized.

We ussually use dask.dataframe optimization and set ave_width=repartition_ratio for kartothek.io.dask.dataframe.update_dataset_from_ddf graphs. We should return an optimized graph from update_dataset_from_ddf to make our users' life simple.

We already have code that does this, whoever picks this up can ping me

Improve this page

Add a description, image, and links to the arrow topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the arrow topic, visit your repo's landing page and select "manage topics."

Learn more