Here are
93 public repositories
matching this topic...
Reading list for research topics in multimodal machine learning
A curated list of Multimodal Related Research.
Updated
Jul 29, 2021
Python
Papers, code and datasets about deep learning and multi-modal learning for video analysis
A Comparative Framework for Multimodal Recommender Systems
Updated
Feb 19, 2022
Python
Official Pytorch implementation of "OmniNet: A unified architecture for multi-modal multi-task learning" | Authors: Subhojeet Pramanik, Priyanka Agrawal, Aman Hussain
Updated
Oct 31, 2020
Python
Knowledge-Aware machine LEarning (KALE): accessible machine learning from multiple sources for interdisciplinary research, part of the 🔥 PyTorch ecosystem
Updated
Apr 7, 2022
Python
An official implementation for "CLIP4Clip: An Empirical Study of CLIP for End to End Video Clip Retrieval"
Updated
Dec 2, 2021
Python
Multimodal model for text and tabular data with HuggingFace transformers as building block for text data
Updated
Feb 16, 2022
Python
This repository contains various models targetting multimodal representation learning, multimodal fusion for downstream tasks such as multimodal sentiment analysis.
Updated
Nov 29, 2021
OpenEdge ABL
[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning
A curated list of awesome vision and language resources (still under construction... stay tuned!)
Interface for easier topic modelling.
Updated
Apr 6, 2022
Python
Multi-modal Transformers Excel at Class-agnostic Object Detection
Updated
Feb 1, 2022
Python
Updated
Aug 17, 2018
Python
[AAAI 2018] Memory Fusion Network for Multi-view Sequential Learning
Updated
Aug 4, 2020
Python
my solution with 0.67 accuracy
Updated
May 21, 2019
Python
Code for the paper "VistaNet: Visual Aspect Attention Network for Multimodal Sentiment Analysis", AAAI'19
Updated
Sep 9, 2020
Python
Implementation of CVPR 2020 paper "MMTM: Multimodal Transfer Module for CNN Fusion"
Updated
Jun 16, 2020
Python
PyTorch implementation of our graph convolutional network (GCN) for human motion generation from music. Also with paired dance-music data for training!
Updated
Apr 22, 2021
Python
[ICCV 2021 Oral] Just Ask: Learning to Answer Questions from Millions of Narrated Videos
Updated
Oct 6, 2021
Jupyter Notebook
[ICLR 2019] Learning Factorized Multimodal Representations
Updated
Aug 4, 2020
Python
Source code for training Gated Multimodal Units on MM-IMDb dataset
Updated
Nov 20, 2020
Python
Code on selecting an action based on multimodal inputs. Here in this case inputs are voice and text.
Updated
Jun 7, 2021
Python
[CVPR 2022] Show Me What and Tell Me How: Video Synthesis via Multimodal Conditioning
Multi-Modal learning toolkit based on PaddlePaddle and PyTorch, supporting multiple applications such as multi-modal classification, cross-modal retrieval and image caption.
Updated
Apr 2, 2022
Python
Updated
Oct 8, 2021
Python
A paper list of pre-trained language models (PLMs).
Offical implementation of paper "MSAF: Multimodal Split Attention Fusion"
Updated
Jun 16, 2021
Python
Visually informed embedding of word (VIEW) is a tool for transferring multimodal background knowledge to NLP algorithms.
Updated
Sep 18, 2016
Python
Code for the paper "Multimodal Review Generation for Recommender Systems", WWW'19
Updated
Sep 19, 2020
Python
Improve this page
Add a description, image, and links to the
multimodal-learning
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
multimodal-learning
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.