Skip to content
#

python-api

Here are 159 public repositories matching this topic...

vuule
vuule commented Nov 4, 2020

Current default value for rows_per_chunk parameter of the CSV writer is 8, which means that the input table is by default broken into many small slices that are written out sequentially. This reduces the performance by an order on magnitude in some cases.

In Python layer, the default is the number of rows (i.e. write table out in a single pass). We can follow this by setting rows_per_chunk

hzeller
hzeller commented May 15, 2020

When running the regression, the resulting logs seem to end up in third_party/tests/$TEST/.... This of course is not unnoticed by git, so a git status shows a ton of non-added new files added.

To reproduce:

make
make regression
git status   # observe all the files

Build or test artifacts should never clutter the rest of the code-base (we should regard them as read-only in

Improve this page

Add a description, image, and links to the python-api topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the python-api topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.