captioning

Star

Here are 54 public repositories matching this topic...

facebookresearch / mmf

Star

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

deep-learning dialog pytorch vqa pretrained-models captioning multimodal multi-tasking textvqa hateful-memes

Updated Oct 19, 2022
Python

ltguo19 / VSUA-Captioning

Star

Code for "Aligning Linguistic Words and Visual Semantic Units for Image Captioning", ACM MM 2019

nlp deep-learning pytorch captioning language-generation

Updated Oct 18, 2019
Python

drethage / fully-convolutional-point-network

Star

Fully-Convolutional Point Networks for Large-Scale Point Clouds

deep-neural-networks computer-vision deep-learning point-cloud point-clouds semantic-segmentation meshes 3d captioning

Updated Mar 22, 2019
Python

Updated Jun 22, 2022
Python

wangleihitcs / MedicalReportGeneration

Star

A Base Tensorflow Project for Medical Report Generation

tensorflow-models captioning medical-report-generate

Updated Jun 16, 2019
Python

audio-captioning / clotho-dataset

Star

Python code for handling the Clotho dataset.

audio natural-language-processing deep-learning audio-signal-processing captioning audio-captioning clotho-dataset

Updated Nov 24, 2020
Python

TheShadow29 / VidSitu

Star

[CVPR21] Visual Semantic Role Labeling for Video Understanding (https://arxiv.org/abs/2104.00990)

nlp video vision srl captioning captioning-videos vision-and-language grounding video-language event-relations semantic-roles

Updated Aug 17, 2021
Python

ParitoshParmar / MTL-AQA

Star

What and How Well You Performed? A Multitask Learning Approach to Action Quality Assessment

pytorch video-processing lstm representation-learning action-recognition video-understanding c3d video-captioning captioning fine-grained-classification multitask-learning dilated-convolution action-quality-assessment mtl-aqa fine-grained-action-recognition dilated-c3d

Updated Jul 28, 2021
Python

lucidrains / AoA-pytorch

Sponsor

Star

A Pytorch implementation of Attention on Attention module (both self and guided variants), for Visual Question Answering

vqa attention attention-mechanism captioning visual-question-answering

Updated Nov 8, 2020
Python

audio-captioning / dcase-2020-baseline

Star

Audio captioning baseline system for DCASE 2020 challenge.

machine-learning deep-neural-networks deep-learning signal-processing audio-signal-processing captioning dcase machine-listening audio-captioning dcase2020

Updated Jun 22, 2022
Python

HaydenFaulkner / Tennis

Star

A Tennis dataset and models for event detection & commentary generation

machine-learning video computer-vision mxnet dataset tennis gluon sportsanalytics fine-grained captioning eventdetection

Updated Aug 17, 2020
Python

CurryYuan / X-Trans2Cap

Star

[CVPR 2022] X-Trans2Cap: Cross-Modal Knowledge Transfer using Transformer for 3D Dense Captioning

captioning cvpr2022

Updated Aug 26, 2022
Python

elbayadm / PaperNotes

Star

My notes on some Deep Learning papers

deep-learning paper-notes seq2seq papers captioning

Updated Dec 8, 2018
HTML

alecwangcq / show-attend-and-tell

Star

captioning

Updated Nov 15, 2017
Jupyter Notebook

ebu / ebu-tt-live-toolkit

Star

Toolkit for supporting the EBU-TT Live specification

python video live captions subtitles broadcast ebu-tt subtitling captioning

Updated Sep 16, 2022
Python

aimagelab / camel

Star

CaMEL: Mean Teacher Learning for Image Captioning. ICPR 2022

computer-vision pytorch artificial-intelligence image-captioning captioning-images captioning

Updated Apr 8, 2022
Python

AdrianHsu / S2VT-seq2seq-video-captioning-attention

Star

S2VT (seq2seq) video captioning with bahdanau & luong attention implementation in Tensorflow

video deep-learning tensorflow seq2seq attention-mechanism captioning

Updated Apr 26, 2018
Python

RyanLiut / awesome-diverse-captioning

Star

Some papers about *diverse* image (a few videos) captioning

diversity captioning

Updated Jun 7, 2022

hassanhub / R3Transformer

Star

Official python implementation of R3-Transformer

transformer captioning r3-transformer

Updated Nov 30, 2020
Python

deepgram-devs / video-chat

Star

Sample app to display live captioning to a WebRTC video session with the Deepgram API.

webrtc speech-recognition speech-to-text captioning deepgram

Updated Nov 22, 2021
JavaScript

Improve this page

Add a description, image, and links to the captioning topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the captioning topic, visit your repo's landing page and select "manage topics."

Learn more

captioning

Here are 54 public repositories matching this topic...

facebookresearch / mmf

ltguo19 / VSUA-Captioning

drethage / fully-convolutional-point-network

amanchadha / iPerceive

wangleihitcs / MedicalReportGeneration

audio-captioning / clotho-dataset

TheShadow29 / VidSitu

ParitoshParmar / MTL-AQA

lucidrains / AoA-pytorch

audio-captioning / dcase-2020-baseline

HaydenFaulkner / Tennis

CurryYuan / X-Trans2Cap

elbayadm / PaperNotes

alecwangcq / show-attend-and-tell

ebu / ebu-tt-live-toolkit

aimagelab / camel

AdrianHsu / S2VT-seq2seq-video-captioning-attention

RyanLiut / awesome-diverse-captioning

hassanhub / R3Transformer

deepgram-devs / video-chat

Improve this page

Add this topic to your repo