Skip to content
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Python JavaScript Other
Branch: master
Clone or download

Latest commit

vedanuj and facebook-github-bot [docs] Add challenge pages for VQA and TextVQA (#321)
Summary:
- Added tutorials for VQA Challenge and TextVQA Challenge
- Remove Other Challenges page
Pull Request resolved: #321

Reviewed By: apsdehal

Differential Revision: D22013845

Pulled By: vedanuj

fbshipit-source-id: cf0aa797fa2a3373fca9651a671caa81ab728ec8
Latest commit a6c6b46 Jun 12, 2020

Files

Permalink
Type Name Latest commit message Commit time
Failed to load latest commit information.
.circleci [feat,refactor] build,visualize dataset; refactor MultiDataset, fix HM ( May 23, 2020
.github [feature] Automatic gh-pages deployment using github actions (#320) Jun 12, 2020
docs [docs] Update citation (#313) Jun 11, 2020
mmf [fix] Add new hm checksum and disable dark mode, link to blog (#323) Jun 12, 2020
mmf_cli [fix] Add new hm checksum and disable dark mode, link to blog (#323) Jun 12, 2020
projects [fix] Pretrain config for mmimdb (#301) Jun 9, 2020
tests [fix] Sleep and retry zoo tests for more stability (#318) Jun 11, 2020
tools [feature, fix] Capture stdout in logger, fix logger initialization (#284 Jun 3, 2020
website [docs] Add challenge pages for VQA and TextVQA (#321) Jun 12, 2020
.editorconfig [enhancement] Cache CircleCI tests, add isort, black tests (#60) May 8, 2020
.flake8 [fix] Fix flake8 errors (#42) May 8, 2020
.gitignore [docs] Fix docs build and some refactoring (#98) May 8, 2020
.pre-commit-config.yaml [feature] Add MMF website, unify docs; move docs to pytorch_sphinx_th… Jun 3, 2020
CODE_OF_CONDUCT.md Update COC and Contributing Apr 22, 2019
CONTRIBUTING.md [refactor] Rename to MMF (#80) May 8, 2020
LICENSE [refactor] Rename to MMF (#80) May 8, 2020
MANIFEST.in [fix] Sdist build and add bypass_checksum feature for hm_convert (#309) Jun 9, 2020
NOTICES [chore] Add licenses to files missing it along with precommit hook (#270 May 23, 2020
README.md [docs] Update citation (#313) Jun 11, 2020
pyproject.toml [feature] Add MMF website, unify docs; move docs to pytorch_sphinx_th… Jun 3, 2020
requirements.txt [fix] Sdist build and add bypass_checksum feature for hm_convert (#309) Jun 9, 2020
setup.py [fix] phoc build during setup.py (#285) Jun 2, 2020

README.md


MMF is a modular framework for vision and language multimodal research from Facebook AI Research. MMF contains reference implementations of state-of-the-art vision and language models and has powered multiple research projects at Facebook AI Research. See full list of project inside or built on MMF here.

MMF is powered by PyTorch, allows distributed training and is un-opinionated, scalable and fast. Use MMF to bootstrap for your next vision and language multimodal research project by following the installation instructions. Take a look at list of MMF features here.

MMF also acts as starter codebase for challenges around vision and language datasets (The Hateful Memes, TextVQA, TextCaps and VQA challenges). MMF was formerly known as Pythia. The next video shows an overview of how datasets and models work inside MMF. Checkout MMF's video overview.

Installation

Follow installation instructions in the documentation.

Documentation

Learn more about MMF here.

Citation

If you use MMF in your work or use any models published in MMF, please cite:

@misc{singh2020mmf,
  author =       {Singh, Amanpreet and Goswami, Vedanuj and Natarajan, Vivek and Jiang, Yu and Chen, Xinlei and Shah, Meet and
                 Rohrbach, Marcus and Batra, Dhruv and Parikh, Devi},
  title =        {MMF: A multimodal framework for vision and language research},
  howpublished = {\url{https://github.com/facebookresearch/mmf}},
  year =         {2020}
}

License

MMF is licensed under BSD license available in LICENSE file

You can’t perform that action at this time.