RealTapeL

RealTapeL

Starred repositories

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 15,167 1,403 Updated Sep 5, 2024

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 6,757 684 Updated Aug 12, 2024

mlfoundations / open_clip

An open source implementation of CLIP.

Python 10,323 981 Updated Nov 12, 2024

PKU-Alignment / safe-rlhf

Safe RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback

Python 1,347 120 Updated Jun 13, 2024

thu-coai / Safety-Prompts

Chinese safety prompts for evaluating and improving the safety of LLMs. 中文安全prompts，用于评估和提升大模型的安全性。

871 81 Updated Feb 27, 2024

microsoft / DeepSpeedExamples

Example models using DeepSpeed

Python 6,086 1,038 Updated Nov 7, 2024

caoyunkang / Segment-Any-Anomaly

Official implementation of "Segment Any Anomaly without Training via Hybrid Prompt Regularization (SAA+)".

Jupyter Notebook 731 75 Updated Dec 20, 2023

LilitYolyan / CutPaste

Unofficial implementation of Google "CutPaste: Self-Supervised Learning for Anomaly Detection and Localization" in PyTorch

Python 115 25 Updated Apr 21, 2022

zllrunning / face-parsing.PyTorch

Using modified BiSeNet for face parsing in PyTorch

Python 2,315 455 Updated May 21, 2023

lipku / LiveTalking

Real time interactive streaming digital human

Python 3,929 563 Updated Nov 16, 2024

xszyou / Fay

Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guid…

JavaScript 9,208 1,793 Updated Nov 13, 2024

coqui-ai / TTS

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

Python 35,474 4,332 Updated Aug 16, 2024

z-x-yang / Segment-and-Track-Anything

An open-source project dedicated to tracking and segmenting any objects in videos, either automatically or interactively. The primary algorithms utilized include the Segment Anything Model (SAM) fo…

Jupyter Notebook 2,850 341 Updated Apr 25, 2024

gaomingqi / Track-Anything

Track-Anything is a flexible and interactive tool for video object tracking and segmentation, based on Segment Anything, XMem, and E2FGVI.

Python 6,504 482 Updated May 31, 2024

zyc00 / Point-SAM

Point-SAM: This is the official repository of "Point-SAM: Promptable 3D Segmentation Model for Point Clouds". We provide codes for running our demo and links to download checkpoints.

Python 135 8 Updated Aug 8, 2024

CASIA-IVA-Lab / FastSAM

Fast Segment Anything

Python 7,502 709 Updated Jul 30, 2024

yizhongw / self-instruct

Aligning pretrained language models with instruction data generated by themselves.

Python 4,159 487 Updated Mar 27, 2023

vxfla / kanchil

Kanchil（鼷鹿）是世界上最小的偶蹄目动物，这个开源项目意在探索小模型（6B以下）是否也能具备和人类偏好对齐的能力。

Python 114 5 Updated Apr 1, 2023

Stability-AI / stablediffusion

High-Resolution Image Synthesis with Latent Diffusion Models

Python 39,190 5,053 Updated Oct 10, 2024

ZHKKKe / MODNet

A Trimap-Free Portrait Matting Solution in Real Time [AAAI 2022]

Python 3,819 636 Updated May 6, 2024

SysCV / sam-hq

Segment Anything in High Quality [NeurIPS 2023]

Jupyter Notebook 3,712 224 Updated Nov 18, 2024

xinntao / Real-ESRGAN

Real-ESRGAN aims at developing Practical Algorithms for General Image/Video Restoration.

Python 28,452 3,574 Updated Aug 6, 2024

sczhou / CodeFormer

[NeurIPS 2022] Towards Robust Blind Face Restoration with Codebook Lookup Transformer

Python 15,861 3,341 Updated Oct 9, 2024

DLLXW / baby-llama2-chinese

用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库；24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.

Python 2,538 311 Updated May 21, 2024

xdit-project / xDiT

xDiT: A Scalable Inference Engine for Diffusion Transformers (DiTs) with Massive Parallelism

Python 702 55 Updated Nov 18, 2024

PeterH0323 / Streamer-Sales

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭…

Python 2,582 391 Updated Nov 11, 2024

heliossun / SQ-LLaVA

Visual self-questioning for large vision-language assistant.

Python 31 2 Updated Oct 1, 2024

Haoqiu-Yan / PerceptiveAgent

Code for Talk With Human-like Agents: Empathetic Dialogue Through Perceptible Acoustic Reception and Reaction (ACL24))

Python 32 1 Updated Aug 6, 2024

XPixelGroup / BasicSR

Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also …

Python 6,858 1,196 Updated Jul 21, 2024

pytorch / torchchat

Run PyTorch LLMs locally on servers, desktop and mobile

Python 3,378 221 Updated Nov 16, 2024

RealTapeL

Starred repositories

vision-language-model

llava

roughness

named-entity-recognition

Java

agent

fault-diagnosis

Computer vision

Machine learning

Deep learning