-
Updated
Jul 26, 2023 - Python
multimodal
Here are 375 public repositories matching this topic...
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
-
Updated
Jul 25, 2023 - Python
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
-
Updated
Jul 18, 2023 - Python
Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
-
Updated
Jul 26, 2023 - Python
🪩 Create Disco Diffusion artworks in one line
-
Updated
May 16, 2023 - Python
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.
-
Updated
Jul 26, 2023
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
-
Updated
Jun 26, 2023 - Python
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
-
Updated
Jun 16, 2023 - Python
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
-
Updated
Jul 4, 2023 - Python
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
-
Updated
Jul 24, 2023 - Python
OpenMMLab Pre-training Toolbox and Benchmark
-
Updated
Jul 26, 2023 - Python
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
-
Updated
Jul 15, 2023 - Python
-
Updated
Jul 27, 2023 - Python
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
-
Updated
Jul 26, 2023 - Jupyter Notebook
Conversational AI SDK for Web to build AI assistants to converse with a text chat or voice following with actions for your website or web app (JavaScript, React, Angular, Vue, Ember, Electron)
-
Updated
Jul 18, 2023
Foundation Architecture for (M)LLMs
-
Updated
Jul 26, 2023 - Python
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
-
Updated
Jun 5, 2023 - Python
Conversational AI SDK for iOS to build AI-powered voice assistants for iOS applications (Swift, Objective-C)
-
Updated
Jul 13, 2023 - Objective-C
Conversational AI SDK for Android to build AI-powered voice assistants for Android applications (Java, Kotlin)
-
Updated
Jul 13, 2023
Conversational AI SDK for Flutter to build AI-powered voice assistants for Flutter applications (iOS and Android)
-
Updated
Apr 28, 2023 - Ruby
Improve this page
Add a description, image, and links to the multimodal topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the multimodal topic, visit your repo's landing page and select "manage topics."