xinyu1205 / Recognize_Anything-Tag2Text
Code for the Recognize Anything Model and Tag2Text Model
See what the GitHub community is most excited about today.
Code for the Recognize Anything Model and Tag2Text Model
<
Linux, Jenkins, AWS, SRE, Prometheus, Docker, Python, Ansible, Git, Kubernetes, Terraform, OpenStack, SQL, NoSQL, Azure, GCP, DNS, Elastic, Network, Virtualization. DevOps Interview Questions
OpenDAN is an open source Personal AI OS , which consolidates various AI modules in one place for your personal use.
It's React, but in Python
中文LLaMA&Alpaca大语言模型+本地CPU/GPU训练部署 (Chinese LLaMA & Alpaca LLMs)
Chat with your documents offline using AI.
Implementation of the StableLM/Pythia/INCITE language models based on nanoGPT. Supports flash attention, LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
with 100k context windows on the way, it's now feasible for every dev to have their own smol developer
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
A GPT-empowered penetration testing tool
TigerBot: A multi-language multi-task LLM
MiniGPT-4: Enhancing Vision-language Understanding with Advanced Large Language Models
Book_4_《矩阵力量》 | 鸢尾花书:从加减乘除到机器学习;上架!
FastAPI framework, high performance, easy to learn, fast to code, ready for production
This is my video documentation. Here you'll find code-snippets, technical documentation, templates, command reference, and whatever is needed for all my YouTube Videos.
Code for CRATE (Coding RAte reduction TransformEr).
CVE-2023-25157 - GeoServer SQL Injection - PoC
Implementation of the LLaMA language model based on nanoGPT. Supports flash attention, Int8 and GPTQ 4bit quantization, LoRA and LLaMA-Adapter fine-tuning, pre-training. Apache 2.0-licensed.
Video-LLaMA: An Instruction-tuned Audio-Visual Language Model for Video Understanding
A collection of SOTA real-time, multi-object trackers for object detectors