Chip Huyen

@chipro

Building infra for real-time ML Teaching ML Sys Design ML Interviews book

Mountain View, CA
S-a alăturat în iunie 2008

Tweeturi

Ai blocat pe @chipro

Sigur vrei să vezi aceste Tweeturi? Vizualizarea Tweeturilor nu va debloca utilizatorul @chipro

  1. Tweet fixat
    22 iun. 2021

    An early draft of the machine learning interviews book is out 🥳 The book is open-sourced and free. Job search is a stressful process, and I hope that this effort can help in some way. Contributions and feedback are appreciated!

    Anulează acțiunea
  2. 3 ian.

    After my post on real-time machine learning last year, many people asked me how to do it. This post discusses the challenges + solutions for online prediction, online evaluation, and continual learning, with use cases and examples. Feedback appreciated!

    Anulează acțiunea
  3. 13 sept. 2021

    New blog post! Discussing: 1. Whether a data scientist should be fullstack 2. What caused the unreasonable expectation that DS should know Kubernetes 3. An overview of the tools that can help abstract away infra to allow DS to own a project end-to-end 🚀

    Anulează acțiunea
  4. 7 sept. 2021

    I was never a good systems engineer, so I always avoided the topic of compiling and optimizing ML models. However, as I work with ML on devices, the topic keeps coming up. So I spent the last 3 months learning about ML compilers. Here’s what I learned.

    Anulează acțiunea
  5. 19 aug. 2021

    Two books, nine years apart 🥳 I'm so happy. I still can't believe that a US publisher wants to publish my book! Grateful to so many people who made this happen. Designing Machine Learning Systems is scheduled for early 2022. First 3 chapters are here

    Anulează acțiunea
  6. 22 iul. 2021

    As I'm getting back to building tools, I'd like to formally apologize to every tool developer whose tool I've complained about before. It's easy to comment on how overwhelming, fragmented, and slow the ML tooling space is. It's 1000x harder to build something to make it better.

    Anulează acțiunea
  7. 8 iul. 2021

    The more I work in prod, the more I realize how hard it is to do prob & stats right. Splitting data is easy. Sampling data correctly is hard. Writing metrics is easy. Understanding what metrics measure is hard. Monitoring numbers is easy. Interpreting them is very, very hard.

    Anulează acțiunea
  8. 6 iul. 2021

    It amazes me how tool developers underestimate the onboarding process. Every time I see a new tool, I want to try it out, but 90% of the time, I give up because it takes over 2 hours to just set up the right environment and get the tool to even import.

    Anulează acțiunea
  9. 30 iun. 2021

    Now that AI can write linked lists better than us, can we get rid of this kind of interview question yet?

    Afișează acest fir
    Anulează acțiunea
  10. 27 iun. 2021

    The challenge for ML in production is to generalize to constantly changing edge cases. 2 main approaches: 1. Use massive data because more data can lead to better generalization 2. Build infra that allows models to learn to adapt in real-time Hmu if you're excited about #2!

    Anulează acțiunea
  11. 23 iun. 2021

    Takeaway from 's CVPR talk: The most successful ML projects in prod (Tesla, iPhone, Amazon drones, Zipline) are where you own the entire stack. They iterate not just ML algorithms but also: - how to collect/label data - infrastructure - hardware ML models run on

    Anulează acțiunea
  12. 16 mar. 2021

    We’ve been incredibly impressed by what our students have been able to build over the last 10 weeks. Our demo day will be open to the public 2pm - 4pm, tomorrow Wed Mar 17 (PT). Come join us and give us feedback!

    Anulează acțiunea
  13. 3 mar. 2021

    The secret to good writing is to publish it before you get bored of it.

    Anulează acțiunea
  14. 18 feb. 2021

    My students are slowly realizing that if they want to run their models in browsers, they can't avoid JavaScript 🤪

    Anulează acțiunea
  15. 9 feb. 2021

    I was blown away when I saw that JAX could auto-vectorize my training loops. It's awesome to see 3 posts on JAX in my feed last week. * * * Should I do a JAX tutorial?

    Anulează acțiunea
  16. 6 feb. 2021

    Guys, hear me out. What if instead of showing students that same convnet picture in 10 different classes, some classes teach ML students engineering principles? Idk, something like unit test, CI/CD, performance profiling, typing, encoding, databases, etc.?

    Anulează acțiunea
  17. 4 feb. 2021

    So I wrote a 5400-word lecture note on the basics of data engineering for my students, covering: * data formats (row- vs. column-based, text vs. binary) * ETL * batch processing vs. stream processing * training datasets WIP. Feedback much appreciated!

    Anulează acțiunea
  18. a redistribuit
    3 feb. 2021

    A big part of the ML workflow is in debugging. However, debugging for ML is hard! In this post, analyzes major sources of errors & their solutions at the four steps: * labeling * feature engineering * model training * model evaluation

    Afișează acest fir
    Anulează acțiunea
  19. 22 ian. 2021

    Simpson's paradox is one of many reasons why it’s important to evaluate your models on different slices of data. Model 1 outperforms model 2 on group A and group B separately, but model 2 can still outperform model 1 overall. Statistics is amazing.

    Anulează acțiunea
  20. 17 ian. 2021

    Why no/low-code platforms are exciting In most ML projects, subject matter expertise is only used for labels. In most successful ML projects, SME is also used for feature engineering, error analysis, reranking predictions, etc. No/low-code allows SMEs to get directly involved!

    Anulează acțiunea

Încărcarea pare că durează ceva timp.

Este posibil ca Twitter să fie suprasolicitat momentan sau să întâmpine mici probleme. Încearcă din nou sau vizitează Starea Twitter pentru mai multe informații.

    S-ar putea să-ți placă și

    ·