Highlights
- Arctic Code Vault Contributor
Create your own GitHub profile
Sign up for your own profile on GitHub, the best place to host code, manage projects, and build software alongside 50 million developers.
Sign up
Pinned
607 contributions in the last year
Activity overview
Contribution activity
October 1, 2020
guolinke has no activity
yet for this period.
September 2020
- guolinke/tokenizers Rust
- guolinke/guolinke
- guolinke/category_encoders Python
- guolinke/recommenders Python
Created a pull request in microsoft/LightGBM that received 12 comments
- [python] Use ctypes for parameters of DLL functions for Dataset
- Use ctypes to pass parameters (fixes #3398)
- fix address alignment, required by cran
- Fix SWIG methods that return char**
- fix warnings
- [R-package] Fix incorrect num_iterations, early_stopping_round in saved models (fixes #2208, #2468)
- [R-package] use keyword arguments in internal function calls
- Improve performance of path smoothing
- Auc mu weights
- Average precision score
- fix sparse multiclass local feature contributions and add test
- stable multi-threading sum reduction
- Pr4 advanced method monotone constraints
- Fix typo in ResetConfig
- disable monotone constraint in objective functions with renew_tree_output
- improve subfeature_bynode
- [R-package] remove unnecessary comments
- Avoid segment fault in ResetConfig for GBDT in prediction (fix #3317)
- CTR and Count values for categorical features
- Add support for CUDA-based GPU build
- Trees with linear models at leaves
- Build integrated Python package library
- [R-package] added support for first_metric_only (fixes #2368)
- [R-package] bump version
- bump version
Created an issue in huggingface/tokenizers that received 11 comments
"the" token is splitted to "t" "h" "e" in large scale corpus
I train the bpe (BERTWordPiece) by my own, using the RoBERTa 160GB data. However, I found all "the" is broken. I check the learned vocab.txt, and f…
11
comments
78
contributions
in private repositories
Sep 1 – Sep 30