Here are
220 public repositories
matching this topic...
🔮 Build cross-modal and multimodal applications on the cloud · Neural Search · Creative AI · Cloud Native · MLOps
Updated
Oct 3, 2022
Python
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Updated
Aug 11, 2022
Python
🪩 Create Disco Diffusion artworks in one line
Updated
Oct 1, 2022
Python
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Updated
Sep 30, 2022
Python
🧬 The data structure for unstructured multimodal data · Neural Search · Vector Search · Document Store
Updated
Oct 3, 2022
Python
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Updated
Sep 28, 2022
Python
A curated list of Multimodal Related Research.
Updated
Sep 15, 2022
Python
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
Updated
Oct 3, 2022
Jupyter Notebook
WIT (Wikipedia-based Image Text) Dataset is a large multimodal multilingual dataset comprising 37M+ image-text sets with 11M+ unique images across 100+ languages.
Easily compute clip embeddings and build a clip retrieval system with them
Updated
Oct 2, 2022
Jupyter Notebook
CVPR 2019: "Pluralistic Image Completion"
Updated
Jul 29, 2022
Python
Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.
Updated
Aug 9, 2022
Python
Implementation of CoCa, Contrastive Captioners are Image-Text Foundation Models, in Pytorch
Updated
Jul 16, 2022
Python
Open-AI's DALL-E for large scale training in mesh-tensorflow.
Updated
Feb 12, 2022
Python
Platform for Situated Intelligence
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Updated
Jun 1, 2022
Python
A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
Updated
Feb 8, 2022
Python
Official implementation for "Blended Diffusion for Text-driven Editing of Natural Images" [CVPR 2022]
Updated
Jun 14, 2022
Jupyter Notebook
(CVPR2021) Kaleido-BERT: Vision-Language Pre-training on Fashion Domain.
Updated
Jun 29, 2022
Python
Multi-Modal Transformer for Video Retrieval
Updated
May 10, 2021
Python
Improve this page
Add a description, image, and links to the
multimodal
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
multimodal
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.