Data Science

Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.

Describe the issue linked to the documentation

The "20 newsgroups text" dataset can be accessed within scikit-learn using defined functions. The dataset contains some text which is considered culturally insensitive.

Suggest a potential alternative/fix

Add a section in the dataset documentation, possibly above the "Recommendation" section called "Data Considerations".
https://

Currently, we use Native filter on Superset version 1.2, but looks like The actual time range does not show correctly with SIP-15 (in the SIP-15 the time range must is [inclusive, exclusive) ). So that mean the actual time range and the tool tip must show label as: from_date <= col < to_date.

Expected results

![image](https://user-images.githubusercontent.com/37523968/130939207-7ff847a

Apache Arrow has a first-class tabular file format, Feather, that the Ray Datasets IO layer should support. Combined with Ray Datasets' existing .from_arrow() and .to_arrow() APIs, this would round out our "all-Arrow" experience, which should be as nice as possible given our "distributed Arrow dataset" positioning.

Implementation Note

We currently print a warning as shown below when a user sets both a widget default value in the function defining the widget as well as a widget value via the widget's key in st.session_state

While we certainly want to do this by default since doing both is not recommended, we should provide a

📚 Documentation

There are a few undocumented public properties in the logger wrappers.
Example:
https://github.com/PyTorchLightning/pytorch-lightning/blob/92e49795e1e044ee7f1ca450babf0c50c1b5d81f/pytorch_lightning/loggers/mlflow.py#L174

Let's document these so they show up in our HTML docs and so users can discover them.

**This is a good issue for new contributors!! If you are int

In recent versions (can't say from exactly when), there seems to be an off-by-one error in dcc.DatePickerRange. I set max_date_allowed = datetime.today().date(), but in the calendar, yesterday is the maximum date allowed. I see it in my apps, and it is also present in the first example on the DatePickerRange documentation page.

E

The docs for IPython.core.interactiveshell.InteractiveShell.set_custom_exc have horribly mangled a warning message into a list of arguments. I can't work out at a glance why this is happening; it might be a sphinx.ext.napoleon bug, or a sphi

Bug summary

I am using contourf to plot filled in contours, but some of the contours are not being filled in despite how values exist for those regions. I am including an example. The code behind the generation of R_mesh, Z_mesh, and total_mesh has been exempted for simplicity, but the problem remains the same.

Code for reproduction



R_mesh = [231.86725132, 220

In gensim/models/fasttext.py:

    model = FastText(
        vector_size=m.dim,
        vector_size=m.dim,
        window=m.ws,
        window=m.ws,
        epochs=m.epoch,
        epochs=m.epoch,
        negative=m.neg,
        negative=m.neg,
        # FIXME: these next 2 lines read in unsupported FB FT modes (loss=3 softmax or loss=4 onevsall,
        # or model=3 supervi

Is your feature request related to a problem? Please describe.
I want to evaluate multiple datasets (same formatting, they can share the same dataset reader). The "evaluate" command takes much longer to load the model than to evaluate.

Describe the solution you'd like
support passing multiple input files and output files to the "evaluate" command

**Describe alternatives you've cons

Data Science

Here are 21,026 public repositories matching this topic...

keras-team / keras

scikit-learn / scikit-learn

Describe the issue linked to the documentation

Suggest a potential alternative/fix

apache / superset

Expected results

GokuMohandas / MadeWithML

CamDavidsonPilon / Probabilistic-Programming-and-Bayesian-Methods-for-Hackers

donnemartin / data-science-ipython-notebooks

explosion / spaCy

eriklindernoren / ML-From-Scratch

microsoft / ML-For-Beginners

ray-project / ray

Implementation Note

academic / awesome-datascience

streamlit / streamlit

PyTorchLightning / pytorch-lightning

📚 Documentation

plotly / dash

ipython / ipython

matplotlib / matplotlib

Bug summary

Code for reproduction

AMAI-GmbH / AI-Expert-Roadmap

fastai / fastbook

virgili0 / Virgilio

eugeneyan / applied-ml

RaRe-Technologies / gensim

afshinea / stanford-cs-229-machine-learning

bharathgs / Awesome-pytorch-list

rasbt / python-machine-learning-book

microsoft / recommenders

d2l-ai / d2l-en

hangtwenty / dive-into-machine-learning

allenai / allennlp

0xnr / awesome-bigdata

microsoft / nni

Related Topics