Here are
54 public repositories
matching this topic...
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
-
Updated
Oct 19, 2022
-
Python
Code for "Aligning Linguistic Words and Visual Semantic Units for Image Captioning", ACM MM 2019
-
Updated
Oct 18, 2019
-
Python
Fully-Convolutional Point Networks for Large-Scale Point Clouds
-
Updated
Mar 22, 2019
-
Python
Applying Common-Sense Reasoning to Multi-Modal Dense Video Captioning and Video Question Answering | Python3 | PyTorch | CNNs | Causality | Reasoning | LSTMs | Transformers | Multi-Head Self Attention | Published in IEEE Winter Conference on Applications of Computer Vision (WACV) 2021
-
Updated
Jun 22, 2022
-
Python
A Base Tensorflow Project for Medical Report Generation
-
Updated
Jun 16, 2019
-
Python
Python code for handling the Clotho dataset.
-
Updated
Nov 24, 2020
-
Python
-
Updated
Aug 17, 2021
-
Python
What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment
-
Updated
Jul 28, 2021
-
Python
A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering
-
Updated
Nov 8, 2020
-
Python
Audio captioning baseline system for DCASE 2020 challenge.
-
Updated
Jun 22, 2022
-
Python
A Tennis dataset and models for event detection & commentary generation
-
Updated
Aug 17, 2020
-
Python
[CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning
-
Updated
Aug 26, 2022
-
Python
My notes on some Deep Learning papers
-
Updated
Nov 15, 2017
-
Jupyter Notebook
Toolkit for supporting the EBU-TT Live specification
-
Updated
Sep 16, 2022
-
Python
CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022
-
Updated
Apr 8, 2022
-
Python
S2VT (seq2seq) video captioning with bahdanau & luong attention implementation in Tensorflow
-
Updated
Apr 26, 2018
-
Python
Some papers about *diverse* image (a few videos) captioning
Official python implementation of R3-Transformer
-
Updated
Nov 30, 2020
-
Python
Sample app to display live captioning to a WebRTC video session with the Deepgram API.
-
Updated
Nov 22, 2021
-
JavaScript
Improve this page
Add a description, image, and links to the
captioning
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
captioning
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.