dask
Here are 209 public repositories matching this topic...
Add MPdist Tutorial
-
Updated
Dec 19, 2020 - Python
What happened:
When creating a LocalCluster object the comm is started on a random high port, even if there are no other clusters running.
What you expected to happen:
Should use port 8786.
Minimal Complete Verifiable Example:
$ conda create -n dask-lc-test -c conda-forge -y python=3.8 ipython dask distributed
$ conda activate dask-lc-testThe `d
-
Updated
Mar 10, 2021 - Jupyter Notebook
-
Updated
Dec 15, 2020 - Python
Describe the bug
According to the multiscene documentation, the property all_same_area does:
Determine if all contained Scenes have the same ‘area’.
However, I have created a multiscene where all scenes have the same area (they just differ between datasets), yet the property returns Fa
-
Updated
Feb 27, 2021 - Python
-
Updated
Feb 3, 2021 - Python
Code Sample, a minimal, complete, and verifiable piece of code
from pyresample.boundary import Boundary
b = Boundary(my_lons, my_lats)
print(b.contour_poly.area())Problem description
The above code doesn't fail if the provided lons/lats are 2D (not sure on 3D+), but the class and all functions/utilities underneath it assume 1D arrays. The end results are incor
from dask_jobqueue import SLURMCluster
cluster = SLURMCluster(cores=1, memory='1GB')
print(cluster.job_script()) #!/usr/bin/env bash
#SBATCH -J dask-worker
#SBATCH -n 1
#SBATCH --cpus-per-task=1
#SBATCH --mem=954M
#SBATCH -t 00:30:00
/home/lesteve/miniconda3/bin/python -m distributed.cli.dask_worker tcp://192.168.0.11:44065 --nthreads 1 --memory-limit 1000.00MB -
Problem description
Our dask update graphs are not properly optimized.
We ussually use dask.dataframe optimization and set ave_width=repartition_ratio for kartothek.io.dask.dataframe.update_dataset_from_ddf graphs. We should return an optimized graph from update_dataset_from_ddf to make our users' life simple.
We already have code that does this, whoever picks this up can ping me
Currently all of the metrics computed are independent of a target variable or column, but if lens.summarise took the name of a column as the target variable, the output of some metrics could be more interpretable even if the target variable is not used in any kind of predictive modelling.
A good example of this could be PCA (see #14), which could plot the different categories of the target va
-
Updated
Mar 10, 2021 - Vue
._datasets is dict.
Implement:
- data.vars
- nbytes
- coords
- dims
- xr.Dataset methods
- +-*/
- tab completion
- contains
- iter
-
Updated
Apr 25, 2018 - Python
-
Updated
Jul 3, 2018 - Python
-
Updated
Mar 9, 2021 - JavaScript
-
Updated
Jun 17, 2020 - Python
-
Updated
Feb 21, 2021 - Python
-
Updated
Mar 2, 2021 - Python
-
Updated
Mar 8, 2021 - Python
Improve this page
Add a description, image, and links to the dask topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the dask topic, visit your repo's landing page and select "manage topics."
I just ran into an issue when trying to use
to_csvwith distributed workers that don't share a file system. I shouldn't have been surprised that writing to a local file system from a distributed worker doesn't work. It shouldn't work. But the error I got was just aFile Not Founderror. That brought me to:dask/dask#2656 (comment) - which was the answer.