Skip to content
#

arrow

Here are 204 public repositories matching this topic...

Apache Arrow is a cross-language development platform for in-memory data. It specifies a standardized language-independent columnar memory format for flat and hierarchical data, organized for efficient analytic operations on modern hardware. It also provides computational libraries and zero-copy streaming messaging and interprocess communication. Languages currently supported include C, C++, Java, JavaScript, Python, and Ruby.

  • Updated Apr 17, 2021
  • C++
arrow
tlitetrasci
tlitetrasci commented Apr 15, 2021

Issue Description

Arrow does not seem to perform validation on timestamps for unusual formats where the information conflicts.

The following code snippet runs just fine:

>>> arrow.get("2021-01-30 14:00:00 AM", "YYYY-MM-DD hh:mm:ss A")
<Arrow [2021-01-30T14:00:00+00:00]>

First of all, since hh is documented to go to a maximum value of 12, I expect arrow to raise an err

williamBlazing
williamBlazing commented Apr 15, 2021

Is your feature request related to a problem? Please describe.
I would be useful to have the ability to make a reverse copy of a column and/or table.
I am thinking of std::reverse (https://en.cppreference.com/w/cpp/algorithm/reverse) . But not necessarily using iterators, since that might make things more complicated.

Describe the solution you'd like
The following APIs could be

ritchie46
ritchie46 commented Apr 10, 2021

TurboDBC is probably the fastest method to communicate to a db with arrow data. We can implement utility functions for this in Python.

If turbodbc isn't installed we raise an exception. It probably has to be installed with conda, so we cannot add it as optional dependency.

All required arrow interop functions are already available to make this work.

NeroCorleone
NeroCorleone commented Aug 11, 2020

Problem description

Reading a dataset with eager's read functionality raises a ValueError when providing columns.

Example code (ideally copy-pastable)

import pandas as pd

from tempfile import TemporaryDirectory
from functools import partial
from storefact import get_store_from_url

from kartothek.io.eager import store_dataframes_as_dataset, read_dataset_as_data

Improve this page

Add a description, image, and links to the arrow topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the arrow topic, visit your repo's landing page and select "manage topics."

Learn more