🔮 Multimodal AI services & pipelines with cloud-native stack: gRPC, Kubernetes, Docker, OpenTelemetry, Prometheus, Jaeger, etc.
-
Updated
Aug 30, 2023 - Python
🔮 Multimodal AI services & pipelines with cloud-native stack: gRPC, Kubernetes, Docker, OpenTelemetry, Prometheus, Jaeger, etc.
Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities
A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)
Visual Instruction Tuning: Large Language-and-Vision Assistant built towards multimodal GPT-4 level capabilities.
🪩 Create Disco Diffusion artworks in one line
notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.
This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)
Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%
Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系,成为中文AIGC和认知智能的基础设施。
InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)
OpenMMLab Pre-training Toolbox and Benchmark
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
🧬 Represent, send, and store multimodal data · Neural Search · Vector Search · Document Store
Foundation Architecture for (M)LLMs
SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)
Conversational AI SDK for Web to build AI assistants to converse with a text chat or voice following with actions for your website or web app (JavaScript, React, Angular, Vue, Ember, Electron)
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
Conversational AI SDK for iOS to build AI-powered voice assistants for iOS applications (Swift, Objective-C)
Conversational AI SDK for Android to build AI-powered voice assistants for Android applications (Java, Kotlin)
Conversational AI SDK for Flutter to build AI-powered voice assistants for Flutter applications (iOS and Android)
Add a description, image, and links to the multimodal topic page so that developers can more easily learn about it.
To associate your repository with the multimodal topic, visit your repo's landing page and select "manage topics."