- Bellevue, WA
- https://deepakn94.github.io/
- @deepakn94
Highlights
- Pro
Block or Report
Block or report deepakn94
Contact GitHub support about this user’s behavior. Learn more about reporting abuse.
Report abusePopular repositories
-
-
-
Megatron-LM
Megatron-LM PublicForked from NVIDIA/Megatron-LM
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Python 1
-
Megatron-DeepSpeed
Megatron-DeepSpeed PublicForked from bigscience-workshop/Megatron-DeepSpeed
Ongoing research training transformer language models at scale, including: BERT & GPT-2
Python 1
-
276 contributions in the last year
| Day of Week | December Dec | January Jan | February Feb | March Mar | April Apr | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | |||||||||||||||||||||||||||||||||||||||||
| Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
| Saturday Sat | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Contribution activity
November 2023
Created 8 commits in 3 repositories
Created 2 repositories
-
deepakn94/apex
Python
This contribution was made on Nov 27
-
deepakn94/TransformerEngine
Python
This contribution was made on Nov 27
Created a pull request in NVIDIA/TransformerEngine that received 1 comment
Use non-deprecated PyTorch methods to silence warnings
Getting warnings of the following form using ToT TE:
/usr/local/lib/python3.10/dist-packages/transformer_engine/pytorch/attention.py:852: UserWarni…
Opened 1 other pull request in 1 repository
NVIDIA/apex
1
merged
-
Use recommended PyTorch methods to silence warnings
This contribution was made on Nov 27
Reviewed 1 pull request in 1 repository
NVIDIA/TransformerEngine
1 pull request
-
Returning an empty tensor of param dtype for wgrad
This contribution was made on Nov 7





