Skip to content
#

Data Science

Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.

Here are 21,026 public repositories matching this topic...

reshamas
reshamas commented Aug 6, 2021

Describe the issue linked to the documentation

The "20 newsgroups text" dataset can be accessed within scikit-learn using defined functions. The dataset contains some text which is considered culturally insensitive.

Suggest a potential alternative/fix

Add a section in the dataset documentation, possibly above the "Recommendation" section called "Data Considerations".
https://

superset
nguyenluongky
nguyenluongky commented Aug 26, 2021

Currently, we use Native filter on Superset version 1.2, but looks like The actual time range does not show correctly with SIP-15 (in the SIP-15 the time range must is [inclusive, exclusive) ). So that mean the actual time range and the tool tip must show label as: from_date <= col < to_date.

Expected results

![image](https://user-images.githubusercontent.com/37523968/130939207-7ff847a

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • Updated May 13, 2021
  • Python
pytorch-lightning
awaelchli
awaelchli commented Aug 21, 2021

📚 Documentation

There are a few undocumented public properties in the logger wrappers.
Example:
https://github.com/PyTorchLightning/pytorch-lightning/blob/92e49795e1e044ee7f1ca450babf0c50c1b5d81f/pytorch_lightning/loggers/mlflow.py#L174

Let's document these so they show up in our HTML docs and so users can discover them.

**This is a good issue for new contributors!! If you are int

dash
MajorMajor807
MajorMajor807 commented Aug 16, 2021

Bug summary

I am using contourf to plot filled in contours, but some of the contours are not being filled in despite how values exist for those regions. I am including an example. The code behind the generation of R_mesh, Z_mesh, and total_mesh has been exempted for simplicity, but the problem remains the same.

Code for reproduction



R_mesh = [231.86725132, 220
gensim
c4n
c4n commented Jul 30, 2021

Is your feature request related to a problem? Please describe.
I want to evaluate multiple datasets (same formatting, they can share the same dataset reader). The "evaluate" command takes much longer to load the model than to evaluate.

Describe the solution you'd like
support passing multiple input files and output files to the "evaluate" command

**Describe alternatives you've cons

nni