Skip to content
Avatar
🍉
Being busy, sorry for the late response.
🍉
Being busy, sorry for the late response.

Highlights

  • Arctic Code Vault Contributor

Organizations

@microsoft
guolinke/README.md

Anurag's github stats

Pinned

  1. A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT, GBM or MART) framework based on decision tree algorithms, used for ranking, classification and many other machine learning …

    C++ 11.6k 3k

  2. Implementation for the paper "DeepGBM: A Deep Learning Framework Distilled by GBDT for Online Prediction Tasks", which has been accepted by KDD'2019.

    Python 492 104

  3. Transformer with Untied Positional Encoding (TUPE). Code of paper "Rethinking Positional Encoding in Language Pre-training". Improve existing models like BERT.

    Python 70 8

607 contributions in the last year

Oct Nov Dec Jan Feb Mar Apr May Jun Jul Aug Sep Mon Wed Fri
Activity overview
Contributed to microsoft/LightGBM, guolinke/TUPE, guolinke/pytorch-docker and 5 other repositories
Loading

Contribution activity

October 1, 2020

guolinke has no activity yet for this period.

September 2020

Created a pull request in microsoft/LightGBM that received 12 comments

fix address alignment, required by cran

+173 −111 12 comments

Created an issue in huggingface/tokenizers that received 11 comments

"the" token is splitted to "t" "h" "e" in large scale corpus

I train the bpe (BERTWordPiece) by my own, using the RoBERTa 160GB data. However, I found all "the" is broken. I check the learned vocab.txt, and f…

11 comments
78 contributions in private repositories Sep 1 – Sep 30

Seeing something unexpected? Take a look at the GitHub profile guide.

You can’t perform that action at this time.