Michaelvll

Follow

Zhanghao Wu Michaelvll

Follow

Ph.D. student @ UC Berkeley RISELab Previously, RA @ MIT HAN Lab; Undergrad @ SJTU ACM Honors Class

508 followers · 173 following

Sky Computing Lab, UC Berkeley
Berkeley, CA

Achievements

BetaSend feedback

Achievements

BetaSend feedback

Highlights

Pro

Organizations

Block or Report

Block or report Michaelvll

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned

skypilot-org/skypilot Public

SkyPilot is a framework for easily running machine learning workloads on any cloud through a unified interface.

Python 2.8k 154
lm-sys/FastChat Public

An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and FastChat-T5.

Python 23.2k 2.6k
ucbrise/graphtrans Public

Representing Long-Range Context for Graph Neural Networks with Global Attention

Python 86 17
mit-han-lab/lite-transformer Public

[ICLR 2020] Lite Transformer with Long-Short Range Attention

Python 573 77
DeepCCA Public

An implementation of Deep Canonical Correlation Analysis (DCCA or Deep CCA) with pytorch.

Python 223 60
facebookresearch/fairseq Public

Facebook AI Research Sequence-to-Sequence Toolkit written in Python.

Python 26.5k 5.9k

1,925 contributions in the last year

Learn how we count contributions

Contribution activity

June 2023

Created 27 commits in 2 repositories

Created 1 repository

Michaelvll/instruct-eval Python Jun 1

Created a pull request in skypilot-org/skypilot that received 17 comments

[Core] User ray cluster causes SkyPilot cluster in INIT state

Fixes #2019 Tested (run the relevant ones): Any manual or new tests for this PR (please specify below) Reproducible script in #2019 All smo…

+66 −15 • 17 comments

Opened 20 other pull requests in 2 repositories

skypilot-org/skypilot 4 open 15 merged

Reorder CLIs Jun 16
[Refactor] Move status query into the cloud class Jun 15
[UX] Fix the message for the spot jobs in sky status Jun 13
[catalog] Make the price fetching more robust Jun 12
[ray] Fix the api called for placement group Jun 12
[test] Default to terminate on failure Jun 12
[Core] Fix log buffering issue Jun 10
[UX/minor] Remove uneccessary spot setup log Jun 9
[Dependency] Avoid buggy grpcio version Jun 9
[Spot] Fix spot pending status Jun 7
[Spot] Make the controller resources configurable Jun 6
[UI] Add cloud logos to Readme and docs Jun 5
[OCI] Add instructions for OCI Jun 5
[Identity] Make the identity loading more robust Jun 5
[Core] Avoid deduplication of the logs for multi-node job Jun 4
[Storage] Fix default storage selection Jun 3
[Storage] Fix the storage cloud checking before sky.check is called. Jun 3
[SCP] Format the scp check Jun 2
[GCP] Remove unsupported GPUs from the list_accelerators Jun 2

skypilot-org/skypilot-catalog 1 merged

Add schema description in readme Jun 15

Reviewed 43 pull requests in 2 repositories

skypilot-org/skypilot 25 pull requests

[Docs] Onprem docs merge fix Jun 16
[Refactor] Move status query into the cloud class Jun 15
UX: if a cluster becomes INIT, warn about autostop reset. Jun 15
UX: drop image_id warning, and print a hint for a corner case. Jun 14
[Docs] Mark onprem as experimental Jun 14
[Docs] Add permission setup page for the clouds Jun 13
[ray] Fix the api called for placement group Jun 13
[OCI] Support configurable boot volume size (disk_size) and performance (disk_tier) Jun 13
Speed up refresh: delay the slower ray status call & use cached IPs. Jun 13
[OCI example]: Update the OCI example task files Jun 13
[OCI fix] Nodes are not reusable if launch config changed Jun 13
UX: don't print refresh hint on status -r. Jun 13
[OCI docs] Update quota.rst Jun 12
Add docker support for SkyPilot Jun 12
[Spot] Spot job pipeline support Jun 12
[Core] Fix log buffering issue Jun 11
API: fix a possibly unbound error in core.cancel(). Jun 11
[OCI fix] Add tenancy specific prefix to zone in runtime (use general catalog file) Jun 9
[Dependency] Avoid buggy grpcio version Jun 9
Prefer to obtain the ssh_user from gcloud os-login instead of assuming that the email address is the ssh_user Jun 8
Doc: add a "Cloud Administration" page. Jun 8
Make path_size_megabytes() more robust. Jun 8
[OCI] Reduce retry times by excluding unsubscribed regions Jun 7
[Spot] Fix spot pending status Jun 7
Docs: update spot controller docs in spot-jobs.rst Jun 7
Some pull request reviews not shown.

skypilot-org/skypilot-catalog 3 pull requests

Update Lambda H100 price to $1.99 Jun 15
Add schema description in readme Jun 15
[OCI] Remove tenancy prefix to make catalog file general Jun 9

Created an issue in skypilot-org/skypilot that received 2 comments

[Core] Resource leakage for sky down if a multi-node cluster is partially stopped

If a multi-node cluster is partially stopped (during autostop or manually stop the worker node), i.e. the cluster is in INIT state, our backend.tea…

2 comments

Opened 15 other issues in 1 repository

skypilot-org/skypilot 8 open 7 closed