CCF0211

Chen-Chen Fan CCF0211

3 followers · 2 following

Institute of Automation，Chinese Academy of Sciences

Highlights

Stars

Computer-Vision-in-the-Wild / DataDownload

Python 23 4 Updated Aug 28, 2023

salesforce / ALBEF

Code for ALBEF: a new vision-language pre-training method

Python 1,525 195 Updated Sep 20, 2022

hustvl / Vim

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,858 185 Updated Sep 19, 2024

salesforce / BLIP

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 4,694 623 Updated Aug 5, 2024

georgian-io / Multimodal-Toolkit

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

Python 580 82 Updated Sep 24, 2024

facebookresearch / multimodal

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Python 1,435 138 Updated Sep 30, 2024

facebookresearch / mmf

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Python 5,486 935 Updated May 25, 2024

salesforce / LAVIS

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,712 952 Updated Aug 23, 2024

PyRetri / PyRetri

Open source deep learning based unsupervised image retrieval toolbox built on PyTorch🔥

Python 1,165 178 Updated Jan 25, 2021

lichengunc / refer

Referring Expression Datasets API

Jupyter Notebook 456 79 Updated Aug 27, 2024

voxel51 / fiftyone

The open-source tool for building high-quality datasets and computer vision models

Python 8,150 546 Updated Sep 30, 2024

tylin / coco-caption

Jupyter Notebook 1,121 544 Updated May 13, 2024

open-mmlab / mmpretrain

OpenMMLab Pre-training Toolbox and Benchmark

Python 3,392 1,055 Updated Aug 20, 2024

sthalles / SimCLR

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

Jupyter Notebook 2,224 460 Updated Mar 4, 2024

Spijkervet / SimCLR

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations by T. Chen et al.

Python 755 164 Updated May 21, 2024

ChenDelong1999 / ITRA

A codebase for flexible and efficient Image Text Representation Alignment

Python 14 Updated Jun 20, 2023

xiaoyuan1996 / GaLR

Source code of paper "Remote Sensing Cross-Modal Image-Text Retrieval Based on Global and Local Information"

Python 57 15 Updated Oct 25, 2023

epic-kitchens / epic-kitchens-download-scripts

Download scripts for EPIC-KITCHENS

Python 121 26 Updated Aug 10, 2024

xiaoyuan1996 / AMFMN

The source code of AMFMN and the dataset RSITMD

Python 191 132 Updated Oct 25, 2023

ZhanYang-nwpu / PE-RSITR

Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval, 2023

Python 13 Updated Jan 14, 2024

201528014227051 / RSICD_optimal

Datasets for remote sensing images (Paper:Exploring Models and Data for Remote Sensing Image Caption Generation)

172 28 Updated Nov 28, 2021

om-ai-lab / RS5M

RS5M: a large-scale vision language dataset for remote sensing [TGRS]

Python 197 9 Updated Sep 29, 2024

jaychempan / PIR

🧀 [ACMMM'23 Oral] Official Code for “A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval”

Python 26 2 Updated Jan 19, 2024

seekerhuang / HarMA

[ICLRW 2024] Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment

Python 30 Updated Jul 18, 2024

lx709 / RS-CLIP

Python 24 2 Updated Dec 15, 2023

TencentARC-QQ / QA-CLIP

Chinese CLIP models with SOTA performance.

Python 47 4 Updated Aug 28, 2023

elkhouryk / RS-TransCLIP

Open-source code for the paper "Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene Classification"

Python 33 Updated Sep 12, 2024

Weixin-Liang / Modality-Gap

Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning

Jupyter Notebook 114 8 Updated Sep 26, 2022

ChenDelong1999 / RemoteCLIP

🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)

Jupyter Notebook 276 17 Updated Jun 27, 2024

LAION-AI / CLIP_benchmark

CLIP-like model evaluation

Jupyter Notebook 590 75 Updated Aug 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly