A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Technique was originally created by https://twitter.com/advadnoun
-
Updated
Feb 6, 2022 - Python
A simple command line tool for text to image generation, using OpenAI's CLIP and a BigGAN. Technique was originally created by https://twitter.com/advadnoun
A Comparative Framework for Multimodal Recommender Systems
[TPAMI 2023] Multimodal Image Synthesis and Editing: The Generative AI Era
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Effective prompting for Large Multimodal Models like GPT-4 Vision or LLaVA. 🔥
Automated modeling and machine learning framework FEDOT
A CLI tool/python module for generating images from text using guided diffusion and CLIP from OpenAI.
✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.
X-VLM: Multi-Grained Vision Language Pre-Training (ICML 2022)
A knowledge base construction engine for richly formatted data
Sequence-to-Sequence Framework in PyTorch
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
An official implementation for " UniVL: A Unified Video and Language Pre-Training Model for Multimodal Understanding and Generation"
Attention-based multimodal fusion for sentiment analysis
An open source implementation of "Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning", an all-new multi modal AI that uses just a decoder to generate both text and images
DANCE: A Deep Learning Library and Benchmark Platform for Single-Cell Analysis
A Survey on multimodal learning research.
Pytorch implementation of CVPR2020 paper “VectorNet: Encoding HD Maps and Agent Dynamics from Vectorized Representation”
This repository contains code and metadata of How2 dataset
Towards Generalist Biomedical AI
Add a description, image, and links to the multimodality topic page so that developers can more easily learn about it.
To associate your repository with the multimodality topic, visit your repo's landing page and select "manage topics."