-
Updated
Apr 14, 2022 - Jupyter Notebook
pandas
Here are 15,882 public repositories matching this topic...
-
Updated
Apr 3, 2022 - Python
Describe the bug
Streaming Datasets can't be pickled, so any interaction between them and multiprocessing results in a crash.
Steps to reproduce the bug
import transformers
from transformers import Trainer, AutoModelForCausalLM, TrainingArguments
import datasets
ds = datasets.load_dataset('oscar', "unshuffled_deduplicated_en", split='train', streaming=True).with_format("-
Updated
Mar 28, 2022 - Python
- Base README.md
- Quizzes
- Introduction base README
- Defining Data Science README
- Defining Data Science assignment
- Ethics README
- Ethics assignment
- Defining Data README
- Defining Data assignment
- Stats and Probability README
- Stats and Probability assignment
- Working with Data base README
- Rel
I naively tried to do dd.merge(a, b, on="column_with_ten_values"), where a and b were both large DataFrames with thousands of partitions each.
Eventually the compute failed with:
[File /opt/conda/envs/coiled/lib/python3.9/site-packages/dask/dataframe/multi.py:275, in merge_chunk()
File /opt/conda/envs/coiled/lib/python3.9/site-packages/pandas/core/frame.py:9329, i-
Updated
Apr 20, 2022 - Python
-
Updated
Dec 23, 2020 - Python
-
Updated
Apr 6, 2022 - Jupyter Notebook
-
Updated
Apr 1, 2022 - Python
-
Updated
Feb 13, 2022 - Jupyter Notebook
-
Updated
Apr 21, 2022 - Python
-
Updated
Feb 19, 2022 - Python
-
Updated
Apr 21, 2022 - Python
-
Updated
Feb 6, 2020
-
Updated
Apr 16, 2022 - Jupyter Notebook
We have two similar methods with different names:
structs_column_view::get_sliced_childstructs_column_device_view::sliced_child
We should rename structs_column_view::get_sliced_child to structs_column_view::sliced_child to align with the other method and avoid unnecessary get_ prefixes as is the normal practice in libcudf.
_Originally posted by @bdice in https://github.com/r
-
Updated
May 8, 2018 - Jupyter Notebook
-
Updated
Mar 30, 2022 - Python
Reading currencies, alphavantage returns a greeting note ("welcome") and this note raises an error in alphavantage.py line 363.
elif "Note" in json_response and self.treat_info_as_error:
raise ValueError(json_response["Note"])
For this reason, alphavantage does not work in home assistant.
-
Updated
Apr 21, 2022 - Python
to_dict() equivalent
I would like to convert a DataFrame to a JSON object the same way that Pandas does with to_dict().
toJSON() treats rows as elements in an array, and ignores the index labels. But to_dict() uses the index as keys.
Here is an example of what I have in mind:
function to_dict(df) {
const rows = df.toJSON();
const entries = df.index.map((e, i) => ({ [e]: rows[i] }));
-
Updated
Apr 9, 2022 - TypeScript
-
Updated
Apr 21, 2022 - C++
-
Updated
Feb 27, 2022 - Python
-
Updated
Apr 6, 2022 - Python
Improve this page
Add a description, image, and links to the pandas topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the pandas topic, visit your repo's landing page and select "manage topics."
We've got this page in our website, that is helping users set up a working environment to use pandas. With Anaconda moving to libmamba is probably fine to leave it as it is, but I'd personally use mambaforge in the page, which I think is probably a better choice of tool (smaller, only conda-forge channel, faster, a bit nicer UI). Does anyone think