[go: up one dir, main page]

Skip to content
View lisongquan95's full-sized avatar

Block or report lisongquan95

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

The official implementation of Achieving Cross Modal Generalization with Multimodal Unified Representation (NeurIPS '23)

Python 194 5 Updated Sep 30, 2024

[CVPR 2024] ViT-Lens: Towards Omni-modal Representations

Python 164 10 Updated Jul 2, 2024

💻 🤖 A summary on our attempts at using Deep Learning approaches for Emotional Text to Speech 🔈

Jupyter Notebook 429 44 Updated Jun 26, 2024

Code for ALBEF: a new vision-language pre-training method

Python 1,564 198 Updated Sep 20, 2022

[ICLR 23 oral] The Modality Focusing Hypothesis: Towards Understanding Crossmodal Knowledge Distillation

Python 39 2 Updated Jul 10, 2023

深度学习经典、新论文逐段精读

27,142 2,448 Updated Nov 17, 2024

A curated list of Multimodal Related Research.

Python 1,315 150 Updated Aug 5, 2023

CVPR 2024 论文和开源项目合集

18,302 2,593 Updated Jul 4, 2024

VCED 可以通过你的文字描述来自动识别视频中相符合的片段进行视频剪辑。该项目基于跨模态搜索与向量检索技术搭建,通过前后端分离的模式,帮助你快速的接触新一代搜索技术。

Python 344 57 Updated Jan 11, 2024

A GPT-4 AI Tutor Prompt for customizable personalized learning experiences.

28,916 3,324 Updated Mar 25, 2024

Awesome List of Attention Modules and Plug&Play Modules in Computer Vision

Python 1,099 162 Updated May 11, 2023

📜 A Novel Facial Emotion Recognition Model Using Segmentation VGG-19 Architecture

Python 16 2 Updated May 23, 2023
86 Updated Jan 25, 2024
Python 97 17 Updated Aug 16, 2024

MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversation

Python 820 204 Updated Mar 10, 2024

Deployed a facial emotion recognition using neural network model which predicts the emotion from faces in images, videos and live feed from webcam.

Jupyter Notebook 11 7 Updated May 2, 2021

Efficient face emotion recognition in photos and videos

Jupyter Notebook 689 127 Updated Jul 19, 2024

Real-Time facial emotion recognition in Python

Jupyter Notebook 16 8 Updated Apr 15, 2019

Building an efficient music recommendation system which determines the emotion of user using Facial Recognition techniques.

Python 11 4 Updated Jul 23, 2021

Computer Vision module for detecting emotion, age and gender of a person in any given image, video or real time webcam. A custom VGG16 model was developed and trained on open source facial datasets…

Jupyter Notebook 113 36 Updated Jan 17, 2024

[NeurIPS 2021] Multiscale Benchmarks for Multimodal Representation Learning

HTML 492 71 Updated Jan 27, 2024

Real-time face detection and emotion/gender classification using fer2013/imdb datasets with a keras CNN model and openCV.

Python 5,606 1,595 Updated Mar 8, 2024

Real time emotion recognition

Python 1,080 363 Updated Aug 30, 2024

A real time Multimodal Emotion Recognition web app for text, sound and video inputs

Jupyter Notebook 888 290 Updated Apr 29, 2021

OpenFace – a state-of-the art tool intended for facial landmark detection, head pose estimation, facial action unit recognition, and eye-gaze estimation.

MATLAB 6,959 1,854 Updated Jun 1, 2024

Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on

Python 5,604 813 Updated May 13, 2024

课堂专注度及考试作弊系统、课堂动态点名。情绪识别、表情识别、姿态识别和人脸识别结合

Python 385 80 Updated Apr 5, 2024

About Code release for "FECAM: Frequency Enhanced Channel Attention Mechanism for Time Series Forecasting" ⌚

Jupyter Notebook 115 17 Updated Sep 9, 2023

全网最全Stable Diffusion全套教程,从入门到进阶,耗时三个月制作

1,378 126 Updated May 26, 2023
Next