Neural-MCRL: Neural Multimodal Contrastive Representation Learning for EEG-based Visual Decoding

doi:10.48550/arXiv.2412.17337

Neural-MCRL: Neural Multimodal Contrastive Representation Learning for EEG-based Visual Decoding

Decoding neural visual representations from electroencephalogram (EEG)-based brain activity is crucial for advancing brain-machine interfaces (BMI) and has transformative potential for neural sensory rehabilitation. While multimodal contrastive representation learning (MCRL) has shown promise in neural decoding, existing methods often overlook semantic consistency and completeness within modalities and lack effective semantic alignment across modalities. This limits their ability to capture the complex representations of visual neural responses. We propose Neural-MCRL, a novel framework that achieves multimodal alignment through semantic bridging and cross-attention mechanisms, while ensuring completeness within modalities and consistency across modalities. Our framework also features the Neural Encoder with Spectral-Temporal Adaptation (NESTA), a EEG encoder that adaptively captures spectral patterns and learns subject-specific transformations. Experimental results demonstrate significant improvements in visual decoding accuracy and model generalization compared to state-of-the-art methods, advancing the field of EEG-based neural visual representation decoding in BMI. Codes will be available at: https://github.com/NZWANG/Neural-MCRL.

Publication:

arXiv e-prints

Pub Date:

December 2024

DOI:

10.48550/arXiv.2412.17337

arXiv:

arXiv:2412.17337

Bibcode:

2024arXiv241217337L

Keywords:

Computer Science - Computer Vision and Pattern Recognition

ADS

Neural-MCRL: Neural Multimodal Contrastive Representation Learning for EEG-based Visual Decoding

Abstract