-
Updated
May 13, 2020 - Jupyter Notebook
data-exploration
Here are 109 public repositories matching this topic...
-
Updated
May 12, 2020 - JavaScript
-
Updated
May 5, 2020 - Jupyter Notebook
It would be great if there was an option to preserve the original order of variables in plot_histogram(). Currently, variables within each page of the output seem to be ordered alphabetically but the pages themselves follow the original order.
-
Updated
Nov 20, 2019 - Python
-
Updated
May 12, 2020 - TypeScript
Adding a description for the parameters will help the users understand how to specify values for each parameter. For example, the format of the longitude in Yelp.businesses table; the maximum limit of the results that a user can expect (if we incorporate limit parameter in the future).
For large datasets where computing the summary may be expensive, it would be useful to compute only part of it, be able to explore it, and then compute other parts of it without recomputing the initial report.
The selection of which parts to compute could be by:
- columns in the dataset,
- metrics, or
- row ranges.
This can be useful to get an overview of string structure of a columns
def patterns(self, input_cols, output_cols=None, mode=0):
See https://github.com/ironmussa/Optimus/blob/develop-3.0/optimus/engines/base/columns.py#L153 For more info about the param
-
Updated
Apr 24, 2020
Need someone to go through the documentation we do have and determine what is missing and what is needed; develop a roadmap to help organize efforts.
-
Updated
Mar 29, 2020 - Jupyter Notebook
-
Updated
Apr 5, 2020 - R
-
Updated
May 11, 2020 - R
The original model of querying, doc by doc, whether doc ID exists to then insert/update the corresponding doc does not scale
-
Updated
Apr 24, 2020 - JavaScript
-
Updated
May 15, 2020 - Jupyter Notebook
-
Updated
Jun 16, 2017 - Python
-
Updated
Mar 22, 2019 - R
-
Updated
Dec 21, 2017 - Jupyter Notebook
-
Updated
May 13, 2020 - Python
-
Updated
Jan 19, 2018 - Jupyter Notebook
-
Updated
Apr 2, 2020 - Python
-
Updated
Mar 9, 2020 - Jupyter Notebook
-
Updated
Sep 4, 2017 - TypeScript
-
Updated
Apr 9, 2020 - Python
-
Updated
May 21, 2019 - Jupyter Notebook
-
Updated
Mar 9, 2018 - HTML
-
Updated
Jan 19, 2020 - R
Improve this page
Add a description, image, and links to the data-exploration topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the data-exploration topic, visit your repo's landing page and select "manage topics."
Because some user has had problems configuring these services could be helpful to make some examples or videos about how to properly setup Optimus in this services.