Skip to content
#

pydata

Here are 82 public repositories matching this topic...

jsignell
jsignell commented Nov 5, 2020

I just ran into an issue when trying to use to_csv with distributed workers that don't share a file system. I shouldn't have been surprised that writing to a local file system from a distributed worker doesn't work. It shouldn't work. But the error I got was just a File Not Found error. That brought me to:dask/dask#2656 (comment) - which was the answer.

jcrist
jcrist commented Sep 15, 2020

Clients created with worker_client or get_client don't respect the timeout settings (e.g. distributed.comm.timeouts.connect. The timeout is available to set programmatically, but defaults to 3 rather than falling back to the config file. I think this should be as simple as replacing timeout=3 with timeout=None throughout that code path.

lr4d
lr4d commented Oct 8, 2020

Problem description

Our dask update graphs are not properly optimized.

We ussually use dask.dataframe optimization and set ave_width=repartition_ratio for kartothek.io.dask.dataframe.update_dataset_from_ddf graphs. We should return an optimized graph from update_dataset_from_ddf to make our users' life simple.

We already have code that does this, whoever picks this up can ping me

randyzwitch
randyzwitch commented Mar 28, 2019

In trying to write tests for #189, I'm finding very difficult to add columns to existing tests, as in some cases like the all_types table, the table is defined in a separate file than the tests and multiple tests try to write to the same table.

Additionally, our test suite doesn't prove that the data that are uploaded are the same as the data downloaded for all types.

We should consider m

Improve this page

Add a description, image, and links to the pydata topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the pydata topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.