Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
-
Updated
Sep 16, 2023 - Python
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
ModelScope: bring the notion of Model-as-a-Service to life.
Open Source Routing Engine for OpenStreetMap
Vector search for humans. Also available on cloud - cloud.marqo.ai
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
Represent, send, store and search multimodal data
An Open Toolkit for Knowledge Graph Extraction and Construction published at EMNLP2022 System Demonstrations.
a state-of-the-art-level open visual language model | 多模态预训练模型
A robust, all-in-one GPT interface for Discord. ChatGPT-style conversations, image generation, AI-moderation, custom indexes/knowledgebase, youtube summarizer, and more!
Recent Transformer-based CV and related works.
🥂 Gracefully face hCaptcha challenge with MoE(ONNX) embedded solution.
Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
Project Page for "LISA: Reasoning Segmentation via Large Language Model"
[pip install medmnist] 18 MNIST-like Datasets for 2D and 3D Biomedical Image Classification
[NeurIPS 2023] MotionGPT: Human Motion as a Foreign Language, a unified motion-language generation model using LLMs
推荐/广告/搜索领域工业界经典以及最前沿论文集合。A collection of industry classics and cutting-edge papers in the field of recommendation/advertising/search.
SALMONN: Speech Audio Language Music Open Neural Network
A curated list of Visual Question Answering(VQA)(Image/Video Question Answering),Visual Question Generation ,Visual Dialog ,Visual Commonsense Reasoning and related area.
The TypeScript library for building multi-modal AI applications.
Add a description, image, and links to the multi-modal topic page so that developers can more easily learn about it.
To associate your repository with the multi-modal topic, visit your repo's landing page and select "manage topics."