Create your own GitHub profile
Sign up for your own profile on GitHub, the best place to host code, manage projects, and build software alongside 50 million developers.
Sign upPinned
1,038 contributions in the last year
Activity overview
Contribution activity
November 2020
Created 27 commits in 2 repositories
Created a pull request in huggingface/transformers that received 20 comments
Fix rag finetuning + add finetuning test
Following #7715 we need more test coverage of the RAG example scripts. In this PR I'm adding a test for the finetuning script. The test includes a …
Opened 24 other pull requests in 2 repositories
huggingface/datasets
17
merged
3
open
1
closed
- Fix url with backslash in windows for blimp and pg19
- Disallow backslash in urls
- Follow cache_dir parameter to gcs downloader
- Allow arrow based builder in auto dummy data generation
- Better messages regarding split naming
- Allow several tags sets
- Add a few datasets of reference in the documentation
- Fix wikipedia custom config
- Auto generate dummy data
- Use GCP download url instead of tensorflow custom download for boolq
- Update ner datasets infos
- Consistent metric outputs
- Fix some metrics feature types
- Add clear_cache parameter in the test command
- Update head requests
- Integrate file_lock inside the lib for better logging control
- Use pandas reader in csv
- Fix kor nli csv reader
- Add writer_batch_size attribute to GeneratorBasedBuilder
- Make save function use deterministic global vars order
- Fix type hints pickling in python 3.6
huggingface/transformers
2
merged
1
closed
Reviewed 28 pull requests in 2 repositories
huggingface/datasets 23 pull requests
- Fix URL with backslash in Windows
- Optional per-dataset default config name
- Addition of Nl2Bash Dataset
- add metrec: arabic poetry dataset
- Add template and documentation for dataset card
- Add LER
- Add SemEval-2010 task 8
- gitignore .python-version
- Add IndicGLUE dataset and Metrics
- Auto generate dummy data
- Add open book corpus
- Create ClassLabel for labelling tasks datasets
- CNN/Dailymail Dataset Card
- AlloCiné dataset card
- Update loading_metrics.rst
- Update squad_v2.py
- Adding nli_tr dataset
- Add accuracy, precision, recall and F1 metrics
- dataset(ncslgr): add initial loading script
- Add ASNQ dataset
- fix: typos in tutorial to map KILT and TriviaQA
- Update quail dataset to v1.3
- Fix seqeval metric
huggingface/transformers 5 pull requests
Created an issue in huggingface/datasets that received 1 comment
[Caching] Dill globalvars() output order is not deterministic and can cause cache issues.
Dill uses dill.detect.globalvars to get the globals used by a function in a recursive dump. globalvars returns a dictionary of all the globals that…