dask
Here are 193 public repositories matching this topic...
Is your feature request related to a problem? Please describe.
I am trying to drop particular indices from a dimension that doesn't have coordinates.
Following: [drop_sel() documentation](http://xarray.pydata.org/en/stable/generated/xarray.Dataset.drop_sel.html#xarray.Dataset.drop_se
Add MPdist Tutorial
-
Updated
Oct 11, 2020 - Python
In our API docs we currently use
.. autosummary::
Client
Client.call_stack
Client.cancel
...
To generate a table of Client methods at the top of the page. Later on we use
.. autoclass:: Client
:members:
to display the docstrings for all the public methods on Client (here an example for
-
Updated
Dec 16, 2020 - Jupyter Notebook
-
Updated
Dec 15, 2020 - Python
Describe the bug
According to the multiscene documentation, the property all_same_area does:
Determine if all contained Scenes have the same ‘area’.
However, I have created a multiscene where all scenes have the same area (they just differ between datasets), yet the property returns Fa
-
Updated
Nov 3, 2020 - Python
-
Updated
Nov 27, 2020 - Python
There are various examples of how to define an AreaDefinition throughout the documentation and in test files like https://github.com/pytroll/pyresample/blob/master/pyresample/test/test_files/areas.yaml. However, some of these have projection definitions with only a defined (equatorial radius). This is technically incomplete and although it works in PROJ, rasterio/GDAL are a little more strict an
from dask_jobqueue import SLURMCluster
cluster = SLURMCluster(cores=1, memory='1GB')
print(cluster.job_script()) #!/usr/bin/env bash
#SBATCH -J dask-worker
#SBATCH -n 1
#SBATCH --cpus-per-task=1
#SBATCH --mem=954M
#SBATCH -t 00:30:00
/home/lesteve/miniconda3/bin/python -m distributed.cli.dask_worker tcp://192.168.0.11:44065 --nthreads 1 --memory-limit 1000.00MB -
Problem description
Our dask update graphs are not properly optimized.
We ussually use dask.dataframe optimization and set ave_width=repartition_ratio for kartothek.io.dask.dataframe.update_dataset_from_ddf graphs. We should return an optimized graph from update_dataset_from_ddf to make our users' life simple.
We already have code that does this, whoever picks this up can ping me
Currently all of the metrics computed are independent of a target variable or column, but if lens.summarise took the name of a column as the target variable, the output of some metrics could be more interpretable even if the target variable is not used in any kind of predictive modelling.
A good example of this could be PCA (see #14), which could plot the different categories of the target va
-
Updated
Dec 14, 2020 - Vue
I've been using nb_black lately and it's wonderful. https://github.com/dnanhkhoa/nb_black
We can add this with %load_ext lab_black for our notebook examples and it will auto-format every cell to fit black to make it more clean and readable without us having to worry about it.
As the server speaks the presto protocol, which is understood by most of the BI tools, it would be nice to test and showcase it with some of them.
For example I have done a very quick test with both hue and metabase, which look promising - but also have shown some additional quirks in the server implementation.
-
Updated
Apr 25, 2018 - Python
-
Updated
Jul 3, 2018 - Python
-
Updated
Jun 17, 2020 - Python
-
Updated
Dec 16, 2020 - JavaScript
-
Updated
Dec 3, 2020 - Python
-
Updated
Nov 6, 2017 - Python
Improve this page
Add a description, image, and links to the dask topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the dask topic, visit your repo's landing page and select "manage topics."