-
Updated
Apr 19, 2022
datasets
Here are 1,454 public repositories matching this topic...
-
Updated
Feb 9, 2022 - Python
Describe the bug
I am trying to label Hebrew text (RTL language). When labels are attached to the text, the words of the text are mixed and not shown in their original order.
To Reproduce
Steps to reproduce the behavior:
- Create a project with attached dataset.json dataset.txt
- Choose NER template
- Start
-
Updated
Dec 11, 2019 - C++
-
Updated
May 15, 2022 - Python
🚨 🚨 Feature Request
- A new implementation (Improvement, Extension)
Is your feature request related to a problem?
Currently, if a user tries to access an index that is larger than the dataset length or tensor length, an internal error is thrown which is not easy to understand.
Description of the possible solution
We can catch the error and throw a more descriptive e
-
Updated
May 13, 2022 - Python
-
Updated
May 14, 2022
-
Updated
Mar 28, 2022 - JavaScript
-
Updated
Apr 5, 2022 - Python
-
Updated
Apr 3, 2022 - Python
-
Updated
Jan 31, 2022 - Jupyter Notebook
-
Updated
Dec 1, 2021 - Python
-
Updated
May 6, 2022 - Vue
-
Updated
May 15, 2022 - Julia
-
Updated
May 14, 2022 - Python
Issue to track tutorial requests:
- Deep Learning with PyTorch: A 60 Minute Blitz - #69
- Sentence Classification - #79
Note sure if it could be interesting but:
When registering a table:
addr: 0.0.0.0:8084
tables:
- name: "example"
uri: "/data/"
option:
format: "parquet"
use_memory_table: false
add in options:
glob
pattern: "file_typev1*.parquet"
or regexp
pattern: "\wfile_type\wv1\w*.parquet"
It would allow selecting in uri's with different exte
-
Updated
May 10, 2022 - Python
-
Updated
Nov 16, 2019 - Python
What?
Currently, API manually throws its own messages and errors. We should move them to werkzeug exceptions.
-
Updated
Mar 9, 2022
-
Updated
May 10, 2022 - Python
-
Updated
May 11, 2022
-
Updated
May 14, 2022 - Python
-
Updated
May 13, 2022 - Python
Improve this page
Add a description, image, and links to the datasets topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the datasets topic, visit your repo's landing page and select "manage topics."

Describe the bug
Streaming Datasets can't be pickled, so any interaction between them and multiprocessing results in a crash.
Steps to reproduce the bug