👋
Hi, I'm Romain Beaumont aka rom1504. I build and deploy ML infra to solve important problems.
Recent work:
- LLMs at YouTube
- KNNs at Criteo
- Laion5B and OpenClip in open source ML
- Mineflayer and PrismarineJS in open source javascript
🗒 Blog posts
- Semantic search at billions scale
- Semantic search with embeddings: index anything - Building scalable semantic retrieval from image, text, graph, and interaction data
- Image embeddings - Image similarity and building embeddings with modern computer vision
- Learning computer vision - A short introduction to computer vision
Blog posts with laion
- open coca
- clip H/14 index
- Laion translated and laion coco
- Clip H/14
- Laion aesthetic
- Laion400m and laion5B
Selection of papers
- Reproducible scaling laws for contrastive language-image learning
- Laion-5b: An open large-scale dataset for training next generation image-text models
- LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs






