Create your own GitHub profile
Sign up for your own profile on GitHub, the best place to host code, manage projects, and build software alongside 50 million developers.
Sign up
Pinned
1,313 contributions in the last year
Contribution activity
June 2020
Created a pull request in huggingface/nlp that received 1 comment
Add a sorting method
Add a dataset.sort(column_name=None, indices_list=None) method to sort a dataset according to a column (has to be a column with a numpy compatible …
+195
−21
•
1
comment
- Deal with multiple choice in common tests
- Split LMBert model in two
- [All models] Extend config.output_attentions with output_attentions function arguments
- first draft for blenderbot generator model
- Fix the __getattr__ method in BatchEncoding
- [isort] add matplotlib to known 3rd party dependencies
- never_split on slow tokenizers should not split
- Introduce a new tensor type for return_tensors on tokenizer for NumPy
Created an issue in huggingface/tokenizers that received 1 comment
Some rough edges with add_tokens
Here are some rough edges I identified when comparing with the python-based tokenizers in transformers. Maybe you would want to fix them?
Lower-ca…
1
comment