Skip to content
#

Data Science

Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.

Here are 17,059 public repositories matching this topic...

ogrisel
ogrisel commented Nov 13, 2020

Most functions in scipy.linalg functions (e.g. svd, qr, eig, eigh, pinv, pinv2 ...) have a default kwarg check_finite=True that we typically leave to the default value in scikit-learn.

As we already validate the input data for most estimators in scikit-learn, this check is redundant and can cause significant overhead, especially at predict / transform time. We should probably a

superset
junlincc
junlincc commented Jan 28, 2021

Currently, after user editing a metric in Edit dataset modal in Explore, the edited metric jump to the bottom of the metric list.
it create a few issues:

  1. when the metric list is long, by dropping to the bottom, user might take a while to find it or think the most recent edited metic is lost.
  2. every time users edit a single metric, they see a different order of metric list showing in the

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • Updated Jan 28, 2021
  • Python
YarShev
YarShev commented Jan 29, 2021

Describe your feature request

Hi guys,

It would be awesome to add API that has same output as ray memory command.
Also, it would be good to add some additional output info for ray.objects(). For example, node IP, IDs of objects which are created in in-process stores, IDs of objects from remote calls (when remote calls are still being executed).

Thanks in advance!

dash
wjaskowski
wjaskowski commented Dec 22, 2020

Summary

When a function has print('sth', file=sys.stderr) in the body I get:

InternalHashError: [Errno 2] No such file or directory: '<stderr>'

While caching the body of eval_models_on_all_data(), Streamlit encountered an object of type _io.TextIOWrapper, which it does not know how to hash.

Steps to reproduce

Code snippet:

@st.cache
def f():
   prin
pytorch-lightning
gensim
nni