data-science
Here are 10,348 public repositories matching this topic...
In the IterativeImputer, min_value and max_value are defaulted to None. Internally, if they are None min and max value will be affected to -np.inf and np.inf, respectively.
We should change this behaviour and make that the default of min_value=-np.inf and max_value=np.inf directly.
The alert-success box that begins "Tensorflow Probability is part of the colab default runtime"---and created by the div tag---is not supported by nbconvert, when converting the chapter to PDF. The result is difficult to read:
The underlying issue is de
"Bokeh is a Python interactive visualization library that targets modern web browsers for presentation. Its goal is to provide elegant, concise construction of novel graphics in the style of D3.js, but also deliver this capability with high-performance interactivity over very large or streaming datasets. Bokeh can help anyone who would like to quickly and easi
I was going though the existing enhancement issues again and though it'd be nice to collect ideas for spaCy plugins and related projects. There are always people in the community who are looking for new things to build, so here's some inspiration
If you have questions about the projects I suggested,
-
Updated
Feb 21, 2020 - Python
The function get_closure seem to be unused;
we should : check if it is used, if not remove the function .
-
Updated
Feb 20, 2020 - Jupyter Notebook
-
Updated
Feb 21, 2020
I write tutorials and do lots of live teaching and gravitate mostly to using Jupyter Notebooks, which allow for text and code to be integrated together in one. Using dash on a local instance of Jupyter is essentially no different than using it from a normal .py file.
But, when teaching to people that have no local jupyter instance or don't even have python installed, I've turned to [Binder](myb
For example, in ray-project/ray#6983, we ran both.
-
Updated
Feb 21, 2020 - Jupyter Notebook
Tutorial: Similarity Queries
https://radimrehurek.com/gensim/auto_examples/core/run_similarity_queries.html#sphx-glr-auto-examples-core-run-similarity-queries-py
Notice the document order in the tutorial:
documents = [
"Human machine interface for lab abc computer applications",
"A survey of user opinion of computer system response time",
"The EPS user interface management
-
Updated
Feb 21, 2020
-
Updated
Feb 21, 2020
load_csv line can be since there's columns_to_ignore param's supported
data, labels = load_csv('titanic_dataset.csv', target_column=0, columns_to_ignore=[2, 7], categorical_labels=True, n_classes=2)
and we don't need to do that in preprocess()
def preprocess(passengers):
for i in range(len(passengers)):
passengers[i][1] = 1. if passengers[i][1] == 'female' else 0.
-
Updated
Feb 21, 2020
-
Updated
Feb 21, 2020
Describe the bug
Calling Predictor.get_gradients() returns an empty dictionary
To Reproduce
I am replicating the binary sentiment classification tasked described in the paper 'Attention is not Explanation ' (Jain and Wallace 2019 - https://arxiv.org/pdf/1902.10186.pdf).
My first experiment is on the Stanford Sentiment TreeBank Dataset. I need to measure the correlation between th
i'm a newbie in programming. I try to use this library. it's very useful for me.
I want to show centroid in K-means clustering. how to show it? thank u so much..
-
Updated
Feb 21, 2020 - Jupyter Notebook
Description
@wutaomsft suggestion:
it would be a good discussion point what is preferred way to make references in notebooks. I prefer not to have a"reference" section where references are separate from where they are referred. Instead, link the reference "in place". And then add a paragraph for "additional reading", which is more descr
Describe the bug
You can't select a different sheet in a gSheet (tabs at the bottom) in a gSheet importation with OR. All the names of the sheets of spreadsheets are there and good, but choosing one of them just make the default one reload.
To Reproduce
Steps to reproduce the behavior:
- Go to 'Create project' -> 'Google Data'
- Sign in a Google Account
- Choose a gSheet' spread
Hi,
I'm new to tpot but I got this error. I understand that score function can take strings, but I got the following error when using TPOTClassifier.
ValueError Traceback (most recent call last)
in
----> 1 tpot.score(X_test, y_test)~/miniconda3/envs/ml
-
Updated
Feb 21, 2020 - Python
On page 501 for the first case example the text says "n-10, m=5, p=2, s=1 -> ...". I assume the first entry of the line should be "n=10" instead of "n-10".
is it Grid Search can solve CASH problems with NNI , it seems that it is usually used for hyper-parameters optimization, have you guys have finished some revision for Grid Search for solving CASH problems.
about Cash problems can refer to :microsoft/nni#1178
The documentation of the boltons.timeutils.daterange function only mention date objects as input, but I discovered that it works with datetime objects as well. Explicitly stating so would be great. Thanks!
-
Updated
Feb 21, 2020 - JavaScript
Improve this page
Add a description, image, and links to the data-science topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-science topic, visit your repo's landing page and select "manage topics."

Some lines in the code block of the keras docs is too long, the result of which is, there will be a horizonal scroll bar at the bottom of the code block. That is hard to read. The long lines should be rearranged to multiple short lines to improve readibility.
Example:
The docs for the SimpleRNN class (https://keras.io/layers/recurrent/#simplernn). The initializer of SimpleRNN has m