Stars
LLM-Seg: Bridging Image Segmentation and Large Language Model Reasoning
Train transformer language models with reinforcement learning.
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
Enjoy the magic of Diffusion models!
[CVPR 2023] Official Implementation of X-Decoder for generalized decoding for pixel, image and language
Official release of InternLM2.5 base and chat models. 1M context support
fastdup is a powerful, free tool designed to rapidly generate valuable insights from image and video datasets. It helps enhance the quality of both images and labels, while significantly reducing d…
Instruct-tune LLaMA on consumer hardware
VMamba: Visual State Space Models,code is based on mamba
This is a collection of our NAS and Vision Transformer work.
《Hello 算法》:动画图解、一键运行的数据结构与算法教程。支持 Python, Java, C++, C, C#, JS, Go, Swift, Rust, Ruby, Kotlin, TS, Dart 代码。简体版和繁体版同步更新,English version ongoing
🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch and FLAX.
OpenMMLab Multimodal Advanced, Generative, and Intelligent Creation Toolbox. Unlock the magic 🪄: Generative-AI (AIGC), easy-to-use APIs, awsome model zoo, diffusion models, for text-to-image genera…
📚 A collection of papers about Referring Image Segmentation.
Language-Driven Semantic Segmentation
FlyCV is a high-performance library for processing computer visual tasks.
solo-learn: a library of self-supervised methods for visual representation learning powered by Pytorch Lightning
OpenMMLab Pre-training Toolbox and Benchmark
Python机器学习算法技术博客,有原创干货!有code实践! 【更多内容敬请关注公众号 "算法进阶"】
Painter & SegGPT Series: Vision Foundation Models from BAAI
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.