Skip to content

Pull requests: huggingface/tokenizers

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or 鈬� + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who鈥檚 assigned
Assigned to nobody Loading
Sort

Pull requests list

feat: add progress_format option for machine-readable JSON output
#1921 opened Dec 26, 2025 by podarok Loading鈥�
6 tasks done
Upgrade GitHub Actions for Node 24 compatibility
#1916 opened Dec 20, 2025 by salmanmkc Loading鈥�
Add windows arm64 wheel build to python release
#1907 opened Dec 8, 2025 by finnagin Loading鈥�
Fix undefined names in docs/source/_ext/entities.py
#1895 opened Nov 28, 2025 by cclauss Loading鈥�
Python: Add ruff rules for asyncio and performance
#1894 opened Nov 28, 2025 by cclauss Loading鈥�
Implement Append normalizer
#1893 opened Nov 28, 2025 by ArthurZucker Loading鈥�
C and C++ bindings to Tokenizers Feature Request
#1888 opened Nov 21, 2025 by thammegowda Loading鈥�
Mark Python tests that need network access
#1872 opened Oct 2, 2025 by gordonmessmer Loading鈥�
Fix unsigned integer underflow issue with truncation
#1859 opened Sep 1, 2025 by maxdebayser Loading鈥�
feat: add cli for tokenizer and training Feature Request
#1842 opened Aug 6, 2025 by b00f Loading鈥�
feat: whitespace optimize Feature Request
#1841 opened Aug 6, 2025 by b00f Loading鈥�
Unused Unicode Character Filter
#1832 opened Jul 23, 2025 by sanderland Loading鈥�
Add enforce_utf8_boundaries option to BpeTrainer
#1830 opened Jul 22, 2025 by sanderland Loading鈥�
Faster Whitespace PreTokenizer (Drop-in Replacement)
#1822 opened Jul 7, 2025 by 8ria Loading鈥�
Adding multiprocessing for sentencepiece_extractor
#1804 opened Jun 19, 2025 by AamodThakur Loading鈥�
add group capture to replace Feature Request
#1788 opened Jun 3, 2025 by cboseak Loading鈥�
Add Truncate pre-tokenizer
#1783 opened May 27, 2025 by ArthurZucker Draft
Make unigram cache optional
#1763 opened Apr 18, 2025 by wangrunji0408 Loading鈥�
Implement Append normalizer
#1755 opened Mar 24, 2025 by austinleedavis Loading鈥�
ProTip! Mix and match filters to narrow down what you鈥檙e looking for.