#
dask
Here are 326 public repositories matching this topic...
Mars is a tensor-based unified framework for large-scale data computation which scales numpy, pandas, scikit-learn and Python functions.
python
machine-learning
tensorflow
numpy
scikit-learn
pandas
pytorch
xgboost
lightgbm
tensor
dask
ray
dataframe
statsmodels
joblib
-
Updated
Nov 30, 2022 - Python
STUMPY is a powerful and scalable Python library for modern time series analysis
python
data-science
pattern-matching
pydata
dask
numba
motif-discovery
time-series-analysis
anomaly-detection
time-series-data-mining
matrix-profile
time-series-segmentation
-
Updated
Dec 2, 2022 - Python
Expressive analytics in Python at any scale.
mysql
python
bigquery
sqlalchemy
sql
database
spark
arrow
clickhouse
sqlite
impala
postgresql
pandas
pyspark
mssql
dask
pyarrow
datafusion
duckdb
polars
-
Updated
Dec 2, 2022 - Python
A package which efficiently applies any function to a pandas dataframe or series in the fastest available manner
-
Updated
Aug 16, 2022 - Python
A distributed task scheduler for Dask
-
Updated
Dec 2, 2022 - Python
data-science
machine-learning
spark
bigdata
data-transformation
pyspark
data-extraction
data-analysis
data-wrangling
dask
data-exploration
data-preparation
data-cleaning
data-profiling
data-cleansing
big-data-cleaning
data-cleaner
cudf
dask-cudf
-
Updated
Nov 21, 2022 - Python
Eliot: the logging system that tells you *why* it happened
python
elasticsearch
numpy
logging
twisted
tracing
scientific-computing
asyncio
logging-library
journald
dask
causality
causation
causality-analysis
-
Updated
Nov 28, 2022 - Python
A unified interface for distributed computing. Fugue executes SQL, Python, and Pandas code on Spark, Dask and Ray without any rewrites.
distributed-systems
machine-learning
sql
spark
distributed-computing
pandas
distributed
dask
data-practitioners
-
Updated
Nov 18, 2022 - Python
A library for managing, validating, summarizing, and visualizing data
data-science
statistics
spark
plotly
pandas
data-visualization
dataops
data-analysis
matplotlib
dask
data-exploration
pandas-summary
dataframes
data-summary
data-quality-checks
data-quality
data-profiling
mlops
data-quality-monitoring
data-reporting
-
Updated
Nov 9, 2022 - Python
Geospatial image resampling in Python
-
Updated
Nov 27, 2022 - Python
Distributed SQL Engine in Python using Dask
-
Updated
Dec 2, 2022 - Python
Universal Regridder for Geospatial Data
-
Updated
Jan 11, 2022 - Python
A web frontend for scheduling Jupyter notebook reports
docker
kubernetes
airflow
jupyter
notebook
jupyter-notebook
nteract
luigi
celery
jupyterlab
dask
jupyter-notebooks
phosphorjs
apache-airflow
papermill
scheduling-notebooks
-
Updated
Feb 9, 2022 - Python
A full pipeline AutoML tool for tabular data
sklearn
tabular-data
xgboost
semi-supervised-learning
gpu-acceleration
gbm
lightgbm
ensemble-learning
dask
preprocessing
automl
distributed-training
datacleaning
catboost
pseudo-labeling
dask-distributed
rapidsai
fullpipeline
adversarial-validation
-
Updated
Nov 26, 2022 - Python
Deploy Dask on job schedulers like PBS, SLURM, and SGE
-
Updated
Nov 23, 2022 - Python
Improve this page
Add a description, image, and links to the dask topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the dask topic, visit your repo's landing page and select "manage topics."