Seeing Beyond the Brain: Masked Modeling Conditioned Diffusion Model for Human Vision Decoding

Author 1, Author 2, ..., Author N

[Paper], [Bibtex], [Supplementary], Code

Overview

Abstract

Decoding visual stimuli from brain recordings aims to deepen our understanding of the human visual system and build a solid foundation for bridging human and computer vision through the Brain-Computer Interface. However, due to the scarcity of data annotations and the complexity of underlying brain information, it is challenging to decode images with faithful details and meaningful semantics. In this work, we present MinD-Vis: Sparse Masked Brain Modeling with Double-Conditioned Latent Diffusion Model for Human Vision Decoding. Specifically, by boosting the information capacity of feature representations learned from a large-scale resting-state fMRI dataset, we show that our MinD-Vis can reconstruct highly plausible images with semantically matching details from brain recordings with very few paired annotations. We benchmarked our model qualitatively and quantitatively; the experimental results indicate that our method outperformed state-of-the-art in both semantic mapping (100-way semantic classification) and generation quality (FID) by 66% and 41% respectively. Exhaustive ablation studies are conducted to analyze our framework.

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
figures		figures
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Seeing Beyond the Brain: Masked Modeling Conditioned Diffusion Model for Human Vision Decoding

Overview

Abstract

MinD-Vis Framework

About

Releases

Contributors 3

Languages

License

zjc062/mind-vis

Folders and files

Latest commit

History

Repository files navigation

Seeing Beyond the Brain: Masked Modeling Conditioned Diffusion Model for Human Vision Decoding

Overview

Abstract

MinD-Vis Framework

About

Resources

License

Stars

Watchers

Forks

Releases

Contributors 3

Languages