Skip to content
#

Data Science

Data science is an inter-disciplinary field that uses scientific methods, processes, algorithms, and systems to extract knowledge from structured and unstructured data. Data scientists perform data analysis and preparation, and their findings inform high-level decisions in many organizations.

Here are 28,619 public repositories matching this topic...

swight-prc
swight-prc commented Jul 28, 2022

Describe the bug

The acceptable values for max_samples are (0, 1]. One possible option for class_weight is balanced_subsample. However, for values of max_samples near zero and class_weight='balanced_subsample', the model fails with an unhelpful error related to having an empty array as an indexer.

This is only likely to come up in a grid search, as it seems unlikely that some

Bug good first issue help wanted module:ensemble
superset
ktmud
ktmud commented Jul 26, 2022

Time-series Bar Chart v2 does not update total values for stacked bar chart when toggling legends.

How to reproduce the bug

  1. Create a "Time-series Bar Chart v2"
  2. Go to "Customize" and select "Show value", "Stack series" and "Only total"
  3. Toggle series in legends
  4. The total value should update but it doesn't

The legacy Time-series Bar Chart does not have this issue.

good first issue viz:charts:echarts viz:charts:bar #bug

Data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe, Keras), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.

  • Updated Apr 3, 2022
  • Python
asaini
asaini commented Oct 1, 2021

Problem

See #3856 . Developer would like the ability to configure whether the developer menu or viewer menu is displayed while they are developing on cloud IDEs like Gitpod or Github Codespaces

Solution

Create a config option

showDeveloperMenu: true | false | auto

where

  • true: always shows the developer menu locally and while deployed
  • false: always sho
enhancement good first issue
lightning
otaj
otaj commented Jun 29, 2022

🚀 Typing coverage

Let's improve typing coverage of PyTorch Lightning together!

I'm creating a new issue in order to increase visibility. There are three older issues (#7037, #5023, #4698) which became stale over time.

Plan

Currently, there are 55 files which are excluded from mypy checks so that our CI does not fail. These files vastly differ in difficulty in order to make the t

help wanted good first issue let's do it! code quality
dash
myUsernameIsNotMyPassword
myUsernameIsNotMyPassword commented Aug 7, 2022

Bug summary

pyplot raises an error when a list of dates starts with None.

Code for reproduction

from datetime import *

from matplotlib import pyplot as plt


y = [6, 2, 8, 3, 1, 8, 5, 3, 0, 7]

x = [date.today() + timedelta(days=i) for i in range(10)]
x[5] = None
x[-1] = None

plt.plot(x, y) # works
plt.show()

x[0] = None

plt.plot(x, y) # TypeError
p
topic: date handling Good first issue status: has patch
Data-Science-For-Beginners
soubhikmandal2000
soubhikmandal2000 commented Oct 31, 2021
  • Base README.md
  • Quizzes
  • Introduction base README
    • Defining Data Science README
    • Defining Data Science assignment
    • Ethics README
    • Ethics assignment
    • Defining Data README
    • Defining Data assignment
    • Stats and Probability README
    • Stats and Probability assignment
  • Working with Data base README
    • Rel
good first issue help wanted translations
gensim
mpenkov
mpenkov commented Jun 22, 2021

In gensim/models/fasttext.py:

    model = FastText(
        vector_size=m.dim,
        vector_size=m.dim,
        window=m.ws,
        window=m.ws,
        epochs=m.epoch,
        epochs=m.epoch,
        negative=m.neg,
        negative=m.neg,
        # FIXME: these next 2 lines read in unsupported FB FT modes (loss=3 softmax or loss=4 onevsall,
        # or model=3 supervi
bug difficulty easy good first issue fasttext
nni