[go: up one dir, main page]

Skip to content
View CCF0211's full-sized avatar
  • Institute of Automation,Chinese Academy of Sciences

Highlights

  • Pro

Block or report CCF0211

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Code for ALBEF: a new vision-language pre-training method

Python 1,525 195 Updated Sep 20, 2022

[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model

Python 2,858 185 Updated Sep 19, 2024

PyTorch code for BLIP: Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation

Jupyter Notebook 4,694 623 Updated Aug 5, 2024

Multimodal model for text and tabular data with HuggingFace transformers as building block for text data

Python 580 82 Updated Sep 24, 2024

TorchMultimodal is a PyTorch library for training state-of-the-art multimodal multi-task models at scale.

Python 1,435 138 Updated Sep 30, 2024

A modular framework for vision & language multimodal research from Facebook AI Research (FAIR)

Python 5,486 935 Updated May 25, 2024

LAVIS - A One-stop Library for Language-Vision Intelligence

Jupyter Notebook 9,712 952 Updated Aug 23, 2024

Open source deep learning based unsupervised image retrieval toolbox built on PyTorch🔥

Python 1,165 178 Updated Jan 25, 2021

Referring Expression Datasets API

Jupyter Notebook 456 79 Updated Aug 27, 2024

The open-source tool for building high-quality datasets and computer vision models

Python 8,150 546 Updated Sep 30, 2024
Jupyter Notebook 1,121 544 Updated May 13, 2024

OpenMMLab Pre-training Toolbox and Benchmark

Python 3,392 1,055 Updated Aug 20, 2024

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations

Jupyter Notebook 2,224 460 Updated Mar 4, 2024

PyTorch implementation of SimCLR: A Simple Framework for Contrastive Learning of Visual Representations by T. Chen et al.

Python 755 164 Updated May 21, 2024

A codebase for flexible and efficient Image Text Representation Alignment

Python 14 Updated Jun 20, 2023

Source code of paper "Remote Sensing Cross-Modal Image-Text Retrieval Based on Global and Local Information"

Python 57 15 Updated Oct 25, 2023

Download scripts for EPIC-KITCHENS

Python 121 26 Updated Aug 10, 2024

The source code of AMFMN and the dataset RSITMD

Python 191 132 Updated Oct 25, 2023

Parameter-Efficient Transfer Learning for Remote Sensing Image-Text Retrieval, 2023

Python 13 Updated Jan 14, 2024

Datasets for remote sensing images (Paper:Exploring Models and Data for Remote Sensing Image Caption Generation)

172 28 Updated Nov 28, 2021

RS5M: a large-scale vision language dataset for remote sensing [TGRS]

Python 197 9 Updated Sep 29, 2024

🧀 [ACMMM'23 Oral] Official Code for “A Prior Instruction Representation Framework for Remote Sensing Image-text Retrieval”

Python 26 2 Updated Jan 19, 2024

[ICLRW 2024] Efficient Remote Sensing with Harmonized Transfer Learning and Modality Alignment

Python 30 Updated Jul 18, 2024
Python 24 2 Updated Dec 15, 2023

Chinese CLIP models with SOTA performance.

Python 47 4 Updated Aug 28, 2023

Open-source code for the paper "Enhancing Remote Sensing Vision-Language Models for Zero-Shot Scene Classification"

Python 33 Updated Sep 12, 2024

Mind the Gap: Understanding the Modality Gap in Multi-modal Contrastive Representation Learning

Jupyter Notebook 114 8 Updated Sep 26, 2022

🛰️ Official repository of paper "RemoteCLIP: A Vision Language Foundation Model for Remote Sensing" (IEEE TGRS)

Jupyter Notebook 276 17 Updated Jun 27, 2024

CLIP-like model evaluation

Jupyter Notebook 590 75 Updated Aug 16, 2024
Next