multimodal

notes for software engineers getting up to speed on new AI developments. Serves as datastore for https://latent.space writing, and product brainstorming, but has cleaned up canonical references under the /Resources folder.

ai openai gpt multimodal gpt-3 prompt-engineering stable-diffusion

Updated Aug 31, 2023

SkalskiP / courses

Star

This repository is a curated collection of links to various courses and resources about Artificial Intelligence (AI)

nlp machine-learning natural-language-processing tutorial deep-neural-networks computer-vision deep-learning transformers generative-model multimodal mlops stable-diffusion

Updated Aug 16, 2023
Python

kyegomez / tree-of-thoughts

Sponsor

Star

Plug in and Play Implementation of Tree of Thoughts: Deliberate Problem Solving with Large Language Models that Elevates Model Reasoning by atleast 70%

deep-learning prompt artificial-intelligence multimodal gpt4 prompt-learning prompt-tuning prompt-engineering chatgpt

Updated Aug 28, 2023
Python

IDEA-CCNL / Fengshenbang-LM

Star

Fengshenbang-LM(封神榜大模型)是IDEA研究院认知计算与自然语言研究中心主导的大模型开源体系，成为中文AIGC和认知智能的基础设施。

transformers pytorch chinese-nlp pretrained-models distributed-training multimodal aigc

Updated Jul 4, 2023
Python

OpenGVLab / InternGPT

Star

InternGPT (iGPT) is an open source demo platform where you can easily showcase your AI models. Now it supports DragGAN, ChatGPT, ImageBind, multimodal chat like GPT-4, SAM, interactive image editing, etc. Try it at igpt.opengvlab.com (支持DragGAN、ChatGPT、ImageBind、SAM的在线Demo系统)

Updated Jul 24, 2023
Python

open-mmlab / mmpretrain

Star

OpenMMLab Pre-training Toolbox and Benchmark

deep-learning pytorch image-classification resnet pretrained-models clip mae mobilenet moco multimodal self-supervised-learning constrastive-learning beit vision-transformer swin-transformer masked-image-modeling convnext

Updated Aug 31, 2023
Python

rom1504 / img2dataset

Star

Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.

image big-data deep-learning dataset image-dataset download-images multimodal

Updated Aug 27, 2023
Python

docarray / docarray

Star

🧬 Represent, send, and store multimodal data · Neural Search · Vector Search · Document Store

Updated Aug 31, 2023
Python

microsoft / torchscale

Star

Foundation Architecture for (M)LLMs

machine-learning natural-language-processing translation computer-vision transformer speech-processing multimodal pretrained-language-model

Updated Aug 11, 2023
Python

Stability-AI / stability-sdk

Star

SDK for interacting with stability.ai APIs (e.g. stable diffusion inference)

generative-art multimodal ai-art latent-diffusion stable-diffusion

Updated Aug 28, 2023
Jupyter Notebook

alan-ai / alan-sdk-web

Star

Conversational AI SDK for Web to build AI assistants to converse with a text chat or voice following with actions for your website or web app (JavaScript, React, Angular, Vue, Ember, Electron)

machine-learning text-to-speech sdk chatbot voice voice-commands speech-recognition voice-control voice-assistant conversational-ai vui multimodal voice-interface voice-ai alan-voice alan-ai alan-studio alan-web-sdk

Updated Aug 31, 2023

OFA-Sys / OFA

Star

Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework