Skip to content
#

Deep learning

Deep learning is an AI function and subset of machine learning, used for processing large amounts of complex data.

Here are 41,964 public repositories matching this topic...

transformers
kumpera
kumpera commented Jan 19, 2022

🐛 Describe the bug

Usage of RRefContext::handleException in torch/csrc/distributed/rpc/rref_context.cpp is wrong when the future has an error.

RRefContext::handleException uses TORCH_CHECK which throws.

Callers of RRefContext::handleException don't expect that and run code after it without any guarding.

Versions

master

cc @pietern @mrshenli @pritamdamania87

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • Updated Nov 4, 2021
  • Python
matthewdeng
matthewdeng commented Jan 6, 2022

Problem: Currently JsonLoggerCallback.handle_result will load in the entirety of the existing results, append the new result, and then rewrite the entire file. This may not scale when running long-running jobs or jobs with large results.

https://github.com/ray-project/ray/blob/4e8f90aca20aa7bb87a4e84039889444824382ca/python/ray/train/callbacks/logging.py#L138-L142

Potential Fix:

Wikipedia
Wikipedia