Pinned
1,103 contributions in the last year
Less
More
Activity overview
Contributed to
huggingface/datasets,
albertvillanova/albertvillanova.github.io,
albertvillanova/molai
and 5 other
repositories
Contribution activity
June 2021
Created 29 commits in 4 repositories
Created 5 repositories
- albertvillanova/examples Shell
- albertvillanova/xla C++
- albertvillanova/s3prl Python
- albertvillanova/manim Python
- albertvillanova/tmp
Created a pull request in huggingface/datasets that received 7 comments
Keep original features order
When loading a Dataset from a JSON file whose column names are not sorted alphabetically, we should get the same column name order, whether we pass…
+52
−7
•
7
comments
Opened 19 other pull requests in 2 repositories
huggingface/datasets
1
open
17
merged
- Minor fix in loading metrics docs
- Fix DuplicatedKeysError in drop dataset
- Fix logging levels
- Improve Features docs
- Sync with transformers disabling NOTSET
- Improve performance of pandas arrow extractor
- Fix typo in MatthewsCorrelation class name
- Rearrange JSON field names to match passed features schema field names
- Add Zenodo metadata file with license
- Use default cast for sliced list arrays if pyarrow >= 4
- Allow latest pyarrow version
- Set configurable downloaded datasets path
- Set configurable extracted datasets path
- Fix docs custom stable version
- Implement ClassLabel encoding in JSON loader
- Revert default in-memory for small datasets
- Fix cross-reference typos in documentation
- Rename config and environment variable for in memory max size
huggingface/transformers
1
open
Reviewed 6 pull requests in 1 repository
Created an issue in huggingface/datasets that received 4 comments
Fix automatic generation of Zenodo DOI
After the last release of Datasets (1.8.0), the automatic generation of the Zenodo DOI failed: it appears in yellow as "Received", instead of in gr…
•
4
comments
Opened 11 other issues in 2 repositories
huggingface/datasets
7
open
3
closed
- Improve docs on Enhancing performance
- Allow latest pyarrow version once segfault bug is fixed
- Implement layered building
- Implement loading a dataset builder
- Delete extracted files to save disk space
- Set download/extracted paths configurable
- Create release script
- Fix PermissionError on Windows when using tqdm >=4.50.0
- Merge DatasetDict and Dataset
- Revert default in-memory for small datasets
megagonlabs/SubjQA
1
open
Joined the BigScience Workshop organization
BigScience Workshop
Research workshop on large language models - The Summer of Language Models 21
37
contributions
in private repositories
Jun 3 – Jun 4