└── README.md /README.md: -------------------------------------------------------------------------------- 1 | # Mamba-in-Computer-Vision 2 | 3 | Mamba-in-Vision[![Awesome](https://cdn.rawgit.com/sindresorhus/awesome/d7305f38d29fed78fa85652e3a63e154dd8e8829/media/badge.svg)](https://github.com/sindresorhus/awesome) 4 | 5 | A paper list of some recent Mamba-based CV works. If you find some ignored papers, please open issues or pull requests. 6 | 7 | **Last updated: 2025/11/10 8 | 9 | ## Mamba 10 | - (arXiv 2023.12) Mamba: Linear-Time Sequence Modeling with Selective State Spaces, [[Paper]](https://arxiv.org/pdf/2312.00752.pdf), [[Code]](https://github.com/state-spaces/mamba) 11 | 12 | ## Survey 13 | - (arXiv 2024.05) Vision Mamba in Remote Sensing: A Comprehensive Survey of Techniques, Applications and Outlook, [[Paper]](https://arxiv.org/pdf/2505.00630.pdf), [[Project]](https://github.com/BaoBao0926/Awesome-Mamba-in-Remote-Sensing) 14 | - (arXiv 2024.04) Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges, [[Paper]](https://arxiv.org/pdf/2404.16112.pdf), [[Project]](https://github.com/badripatro/mamba360) 15 | - (arXiv 2024.04) A Survey on Visual Mamba, [[Paper]](https://arxiv.org/pdf/2404.15956.pdf) 16 | - (arXiv 2024.04) State Space Model for New-Generation Network Alternative to Transformers: A Survey, [[Paper]](https://arxiv.org/pdf/2404.09516.pdf), [[Project]](https://github.com/Event-AHU/Mamba_State_Space_Model_Paper_List) 17 | - (arXiv 2024.05) A Survey on Vision Mamba: Models, Applications and Challenges, [[Paper]](https://arxiv.org/pdf/2404.18861.pdf), [[Project]](https://github.com/Ruixxxx/Awesome-Vision-Mamba-Models) 18 | - (arXiv 2024.05) Vision Mamba: A Comprehensive Survey and Taxonomy, [[Paper]](https://arxiv.org/pdf/2405.04404.pdf), [[Project]](https://github.com/lx6c78/Vision-Mamba-A-Comprehensive-Survey-and-Taxonomy) 19 | - (arXiv 2024.10) A Comprehensive Survey of Mamba Architectures for Medical Image Analysis: Classification, Segmentation, Restoration and Beyond, [[Paper]](https://arxiv.org/pdf/2410.02362),[[Code]](https://github.com/Madhavaprasath23/Awesome-Mamba-Papers-On-Medical-Domain) 20 | - (arXiv 2024.10) Mamba in Vision: A Comprehensive Survey of Techniques and Applications, [[Paper]](https://arxiv.org/pdf/2410.03105),[[Code]](https://github.com/maklachur/Mamba-in-Computer-Vision) 21 | - (arXiv 2025.02) A Survey on Mamba Architecture for Vision Applications, [[Paper]](https://arxiv.org/pdf/2502.07161) 22 | 23 | ## Recent Papers 24 | ### Action 25 | - (arXiv 2024.03) HARMamba: Efficient Wearable Sensor Human Activity Recognition Based on Bidirectional Selective SSM, [[Paper]](https://arxiv.org/pdf/2403.20183.pdf) 26 | - (arXiv 2024.04) Simba: Mamba augmented U-ShiftGCN for Skeletal Action Recognition in Videos, [[Paper]](https://arxiv.org/pdf/2404.07645.pdf) 27 | - (arXiv 2024.09) Mamba Fusion: Learning Actions Through Questioning, [[Paper]](https://arxiv.org/pdf/2409.11513.pdf), [[Code]](https://github.com/Dongzhikang/MambaVL) 28 | - (arXiv 2024.10) SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition, [[Paper]](https://arxiv.org/pdf/2410.16746.pdf), [[Code]](https://github.com/Typistchen/SpikMamba) 29 | - (arXiv 2024.12) Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence, [[Paper]](https://arxiv.org/pdf/2412.07481.pdf), [[Code]](https://github.com/wenbohuang1002/Manta) 30 | - (arXiv 2025.01) MS-Temba: Multi-Scale Temporal Mamba for Efficient Temporal Action Detection, [[Paper]](https://arxiv.org/pdf/2501.06138.pdf), [[Code]](https://github.com/thearkaprava/MS-Temba) 31 | - (arXiv 2025.01) MV-GMN: State Space Model for Multi-View Action Recognition, [[Paper]](https://arxiv.org/pdf/2501.13829) 32 | - (arXiv 2025.04) RadMamba: Efficient Human Activity Recognition through Radar-based Micro-Doppler-Oriented Mamba State-Space Model, [[Paper]](https://arxiv.org/pdf/2504.12039), [[Code]](https://github.com/lab-emi/AIRHAR) 33 | - (arXiv 2025.10) RefAtomNet++: Advancing Referring Atomic Video Action Recognition using Semantic Retrieval based Multi-Trajectory Mamba, [[Paper]](https://arxiv.org/pdf/2510.16444), [[Code]](https://github.com/KPeng9510/refAVA2) 34 | 35 | ### Adversarial Attack 36 | - (arXiv 2024.03) Understanding Robustness of Visual State Space Models for Image Classification, [[Paper]](https://arxiv.org/pdf/2403.10935.pdf) 37 | - (arXiv 2024.08) Exploring Robustness of Visual State Space model against Backdoor Attacks, [[Paper]](https://arxiv.org/pdf/2408.11679.pdf) 38 | 39 | ### Anomaly Detection 40 | - (arXiv 2024.04) MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection, [[Paper]](https://arxiv.org/pdf/2404.06564.pdf), [[Code]](https://github.com/lewandofskee/MambaAD) 41 | - (arXiv 2024.07) ALMRR: Anomaly Localization Mamba on Industrial Textured Surface with Feature Reconstruction and Refinement, [[Paper]](https://arxiv.org/pdf/2407.17705.pdf), [[Code]](https://github.com/qsc1103/ALMRR) 42 | - (arXiv 2025.01) STNMamba: Mamba-based Spatial-Temporal Normality Learning for Video Anomaly Detection, [[Paper]](https://arxiv.org/pdf/2412.20084.pdf) 43 | - (arXiv 2025.03) VADMamba: Exploring State Space Models for Fast Video Anomaly Detection, [[Paper]](https://arxiv.org/pdf/2503.21169.pdf), [[Code]](https://github.com/jLooo/VADMamba) 44 | - (arXiv 2025.04) Pyramid-based Mamba Multi-class Unsupervised Anomaly Detection, [[Paper]](https://arxiv.org/pdf/2504.03442.pdf), [[Code]](https://github.com/iqbalmlpuniud/Pyramid_Mamba) 45 | - (arXiv 2025.04) ACMamba: Fast Unsupervised Anomaly Detection via An Asymmetrical Consensus State Space Model, [[Paper]](https://arxiv.org/pdf/2504.11781.pdf) 46 | - (arXiv 2025.08) Self-Navigated Residual Mamba for Universal Industrial Anomaly Detection, [[Paper]](https://arxiv.org/pdf/2508.01591.pdf), [[Code]](https://github.com/BeJane/SNARM) 47 | 48 | ### Assessment 49 | - (arXiv 2024.06) Q-Mamba: On First Exploration of Vision Mamba for Image Quality Assessment, [[Paper]](https://arxiv.org/pdf/2406.09546.pdf), [[Code]](https://github.com/kenomo/ilid) 50 | - (arXiv 2025.04) MVQA: Mamba with Unified Sampling for Efficient Video Quality Assessment, [[Paper]](https://arxiv.org/pdf/2504.16003.pdf) 51 | 52 | ### Autonomous Driving 53 | - (arXiv 2024.05) DriveWorld: 4D Pre-trained Scene Understanding via World Models for Autonomous Driving, [[Paper]](https://arxiv.org/pdf/2405.04390.pdf) 54 | - (arXiv 2024.08) MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering, [[Paper]](https://arxiv.org/pdf/2408.11464.pdf), [[Code]](https://github.com/Hub-Tian/MambaOcc) 55 | - (arXiv 2024.08) OccMamba: Semantic Occupancy Prediction with State Space Models, [[Paper]](https://arxiv.org/pdf/2408.09859.pdf) 56 | - (arXiv 2024.08) MambaLoc: Efficient Camera Localisation via State Space Model, [[Paper]](https://arxiv.org/pdf/2408.09680.pdf) 57 | - (arXiv 2024.09) DSDFormer: An Innovative Transformer-Mamba Framework for Robust High-Precision Driver Distraction Identification, [[Paper]](https://arxiv.org/pdf/2409.05587.pdf) 58 | - (arXiv 2024.09) CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model, [[Paper]](https://arxiv.org/pdf/2409.07714.pdf) 59 | - (arXiv 2024.09) CoMamba: Real-time Cooperative Perception Unlocked with State Space Models, [[Paper]](https://arxiv.org/pdf/2409.10699.pdf) 60 | - (arXiv 2025.01) H-MBA: Hierarchical MamBa Adaptation for Multi-Modal Video Understanding in Autonomous Driving, [[Paper]](https://arxiv.org/pdf/2501.04302.pdf) 61 | - (arXiv 2025.03) Trajectory Mamba: Efficient Attention-Mamba Forecasting Model Based on Selective SSM, [[Paper]](https://arxiv.org/pdf/2503.10898.pdf) 62 | - (arXiv 2025.03) MamBEV: Enabling State Space Models to Learn Birds-Eye-View Representations, [[Paper]](https://arxiv.org/pdf/2503.13858.pdf), [[Code]](https://github.com/amai-gsu/MamBEV/tree/main) 63 | - (arXiv 2025.07) MambaMap: Online Vectorized HD Map Construction using State Space Model, [[Paper]](https://arxiv.org/pdf/2507.20224.pdf), [[Code]](https://github.com/ZiziAmy/MambaMap) 64 | - (arXiv 2025.07) MamV2XCalib: V2X-based Target-less Infrastructure Camera Calibration with State Space Model, [[Paper]](https://arxiv.org/pdf/2507.23595.pdf), [[Code]](https://github.com/zhuyaoye/MamV2XCalib) 65 | - (arXiv 2025.09) PRISM: Progressive Rain removal with Integrated State-space Modeling, [[Paper]](https://arxiv.org/pdf/2509.26413.pdf) 66 | ### Camouflaged 67 | - (arXiv 2025.07) Mamba-based Efficient Spatio-Frequency Motion Perception for Video Camouflaged Object Detection, [[Paper]](https://arxiv.org/pdf/2507.23601.pdf) 68 | 69 | ### Classification (Backbone) 70 | - (arXiv 2024.01) Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model, [[Paper]](https://arxiv.org/pdf/2401.09417.pdf), [[Code]](https://github.com/bowang-lab/U-Mamba) 71 | - (arXiv 2024.01) VMamba: Visual State Space Model, [[Paper]](https://arxiv.org/pdf/2401.10166.pdf), [[Code]](https://github.com/MzeroMiko/VMamba) 72 | - (arXiv 2024.02) Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining, [[Paper]](https://arxiv.org/pdf/2402.03302.pdf), [[Code]](https://github.com/JiarunLiu/Swin-UMamba) 73 | - (arXiv 2024.02) Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning, [[Paper]](https://arxiv.org/pdf/2402.15761.pdf),[[Code]](https://github.com/ChiShengChen/ResVMamba) 74 | - (arXiv 2024.02) Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data, [[Paper]](https://arxiv.org/pdf/2402.05892.pdf) 75 | - (arXiv 2024.03) LocalMamba: Visual State Space Model with Windowed Selective Scan, [[Paper]](https://arxiv.org/pdf/2403.09338.pdf), [[Code]](https://github.com/hunto/LocalMamba) 76 | - (arXiv 2024.03) EfficientVMamba: Atrous Selective Scan for Light Weight Visual Mamba, [[Paper]](https://arxiv.org/pdf/2403.09977.pdf), [[Code]](https://github.com/TerryPei/EfficientVMamba) 77 | - (arXiv 2024.03) On the low-shot transferability of [V]-Mamba, [[Paper]](https://arxiv.org/pdf/2403.10696.pdf) 78 | - (arXiv 2024.03) SiMBA: Simplified Mamba-Based Architecture for Vision and Multivariate Time series, [[Paper]](https://arxiv.org/pdf/2403.15360.pdf), [[Code]](https://github.com/badripatro/Simba) 79 | - (arXiv 2024.03) PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition, [[Paper]](https://arxiv.org/pdf/2403.17695.pdf),[[Code]](https://github.com/ChenhongyiYang/PlainMamba) 80 | - (arXiv 2024.03) MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection, [[Paper]](https://arxiv.org/pdf/2403.19888.pdf),[[Code]](https://github.com/MambaMixer/M2) 81 | - (arXiv 2024.05) Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model, [[Paper]](https://arxiv.org/pdf/2405.14174.pdf),[[Code]](https://github.com/YuHengsss/MSVMamba) 82 | - (arXiv 2024.05) Scalable Visual State Space Model with Fractal Scanning, [[Paper]](https://arxiv.org/pdf/2405.14480.pdf) 83 | - (arXiv 2024.05) Mamba-R: Vision Mamba ALSO Needs Registers, [[Paper]](https://arxiv.org/pdf/2405.14858.pdf) 84 | - (arXiv 2024.05) Demystify Mamba in Vision: A Linear Attention Perspective, [[Paper]](https://arxiv.org/pdf/2405.16605.pdf),[[Code]](https://github.com/LeapLabTHU/MLLA) 85 | - (arXiv 2024.05) Vim-F: Visual State Space Model Benefiting from Learning in the Frequency Domain, [[Paper]](https://arxiv.org/pdf/2405.18679.pdf),[[Code]](https://github.com/yws-wxs/Vim-F) 86 | - (arXiv 2024.06) Autoregressive Pretraining with Mamba in Vision, [[Paper]](https://arxiv.org/pdf/2406.07537.pdf),[[Code]](https://github.com/OliverRensu/ARM) 87 | - (arXiv 2024.06) Towards Evaluating the Robustness of Visual State Space Models, [[Paper]](https://arxiv.org/pdf/2406.09407.pdf),[[Code]](https://github.com/HashmatShadab/MambaRobustness) 88 | - (arXiv 2024.06) MambaVision: A Hybrid Mamba-Transformer Vision Backbone, [[Paper]](https://arxiv.org/pdf/2407.08083v1),[[Code]](https://github.com/NVlabs/MambaVision) 89 | - (arXiv 2024.07) GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model, [[Paper]](https://arxiv.org/pdf/2407.13772),[[Code]](https://github.com/Amshaker/GroupMamba) 90 | - (arXiv 2024.09) Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training, [[Paper]](https://arxiv.org/pdf/2408.17081),[[Code]](https://github.com/huangzizheng01/ShuffleMamba) 91 | - (arXiv 2024.09) SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks, [[Paper]](https://arxiv.org/pdf/2409.09649),[[Code]](https://github.com/LMMMEng/SparX) 92 | - (arXiv 2024.09) Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion, [[Paper]](https://arxiv.org/pdf/2409.09808) 93 | - (arXiv 2024.09) Distillation-free Scaling of Large SSMs for Images and Videos, [[Paper]](https://arxiv.org/pdf/2409.11867) 94 | - (arXiv 2024.09) Exploring Token Pruning in Vision State Space Models, [[Paper]](https://arxiv.org/pdf/2409.18962) 95 | - (arXiv 2024.10) MAP: Unleashing Hybrid Mamba-Transformer Vision Backbone's Potential with Masked Autoregressive Pretraining, [[Paper]](https://arxiv.org/pdf/2410.00871.pdf) 96 | - (arXiv 2024.10) GlobalMamba: Global Image Serialization for Vision Mamba, [[Paper]](https://arxiv.org/pdf/2410.10316.pdf),[[Code]](https://github.com/wangck20/GlobalMamba) 97 | - (arXiv 2024.10) START: A Generalized State Space Model with Saliency-Driven Token-Aware Transformation, [[Paper]](https://arxiv.org/pdf/2410.16020.pdf),[[Code]](https://github.com/lingeringlight/START) 98 | - (arXiv 2024.10) Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion, [[Paper]](https://arxiv.org/pdf/2410.15091.pdf),[[Code]](https://github.com/EdwardChasel/Spatial-Mamba) 99 | - (arXiv 2024.11) MambaPEFT: Exploring Parameter-Efficient Fine-Tuning for Mamba, [[Paper]](https://arxiv.org/pdf/2411.03855.pdf) 100 | - (arXiv 2024.11) MobileMamba: Lightweight Multi-Receptive Visual Mamba Network, [[Paper]](https://arxiv.org/pdf/2411.15941.pdf),[[Code]](https://github.com/lewandofskee/MobileMamba) 101 | - (arXiv 2024.11) TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba, [[Paper]](https://arxiv.org/pdf/2411.17473.pdf),[[Code]](https://github.com/xwmaxwma/TinyViM) 102 | - (arXiv 2024.12) Faster Vision Mamba is Rebuilt in Minutes via Merged Token Re-training, [[Paper]](https://arxiv.org/pdf/2412.12496.pdf),[[Code]](https://github.com/NUS-HPC-AI-Lab/R-MeeTo) 103 | - (arXiv 2024.12) Mamba2D: A Natively Multi-Dimensional State-Space Model for Vision Tasks, [[Paper]](https://arxiv.org/pdf/2412.16146.pdf) 104 | - (arXiv 2024.1InceptionMamba: An Efficient Hybrid Network with Large Band Convolution and Bottleneck Mamba 105 | 2) Selective Visual Prompting in Vision Mamba, [[Paper]](https://arxiv.org/pdf/2412.08947.pdf),[[Code]](https://github.com/zhoujiahuan1991/AAAI2025-SVP) 106 | - (arXiv 2025.01) A Separable Self-attention Inspired by the State Space Model for Computer Vision, [[Paper]](https://arxiv.org/pdf/2501.02040.pdf),[[Code]](https://github.com/yws-wxs/VMINet) 107 | - (arXiv 2025.02) Fast Vision Mamba: Pooling Spatial Dimensions for Accelerated Processing, [[Paper]](https://arxiv.org/pdf/2502.00594.pdf),[[Code]](https://github.com/insitro/FastVim) 108 | - (arXiv 2025.02) DAMamba: Vision State Space Model with Dynamic Adaptive Scan, [[Paper]](https://arxiv.org/pdf/2502.12627.pdf),[[Code]](https://github.com/ltzovo/DAMamba) 109 | - (arXiv 2025.03) Spectral State Space Model for Rotation-Invariant Visual Representation Learning, [[Paper]](https://arxiv.org/pdf/2503.06369.pdf),[[Code]](https://github.com/Sahardastani/equivariant-vmamba) 110 | - (arXiv 2025.03) vGamba: Attentive State Space Bottleneck for efficient Long-range Dependencies in Visual Recognition, [[Paper]](https://arxiv.org/pdf/2503.21262.pdf) 111 | - (arXiv 2025.04) DefMamba: Deformable Visual State Space Model, [[Paper]](https://arxiv.org/pdf/2504.05794.pdf),[[Code]](https://github.com/leiyeliu/DefMamba) 112 | - (arXiv 2025.04) Dynamic Vision Mamba, [[Paper]](https://arxiv.org/pdf/2504.04787.pdf),[[Code]](https://github.com/NUS-HPC-AI-Lab/DyVM) 113 | - (arXiv 2025.06) InceptionMamba: An Efficient Hybrid Network with Large Band Convolution and Bottleneck Mamba, [[Paper]](https://arxiv.org/pdf/2506.08735.pdf),[[Code]](https://github.com/Wake1021/InceptionMamba) 114 | - (arXiv 2025.07) QuarterMap: Efficient Post-Training Token Pruning for Visual State Space Models, [[Paper]](https://arxiv.org/pdf/2507.09514.pdf) 115 | - (arXiv 2025.07) Training-free Token Reduction for Vision Mamba, [[Paper]](https://arxiv.org/pdf/2507.14042.pdf) 116 | - (arXiv 2025.07) A2Mamba: Attention-augmented State Space Models for Visual Recognition, [[Paper]](https://arxiv.org/pdf/2507.16624.pdf),[[Code]](https://github.com/LMMMEng/A2Mamba) 117 | - (arXiv 2025.09) VCMamba: Bridging Convolutions with Multi-Directional Mamba for Efficient Visual Representation, [[Paper]](https://arxiv.org/pdf/2509.04669.pdf),[[Code]](https://github.com/Wertyuui345/VCMamba) 118 | 119 | ### Clustering 120 | - (arXiv 2024.12) Trusted Mamba Contrastive Network for Multi-View Clustering, [[Paper]](https://arxiv.org/pdf/2412.16487.pdf) 121 | 122 | ### Completion 123 | - (arXiv 2025.01) Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion, [[Paper]](https://arxiv.org/pdf/2501.07260.pdf),[[Code]](https://github.com/xrkong/skimba) 124 | - (arXiv 2025.03) Global-Aware Monocular Semantic Scene Completion with State Space Models, [[Paper]](https://arxiv.org/pdf/2503.06569.pdf) 125 | 126 | ### Compression 127 | - (arXiv 2024.05) MambaVC: Learned Visual Compression with Selective State Spaces, [[Paper]](https://arxiv.org/pdf/2405.15413.pdf) 128 | - (arXiv 2024.10) MambaSCI: Efficient Mamba-UNet for Quad-Bayer Patterned Video Snapshot Compressive Imaging, [[Paper]](https://arxiv.org/pdf/2405.15413.pdf) 129 | - (arXiv 2025.02) S2CFormer: Reorienting Learned Image Compression from Spatial Interaction to Channel Aggregation, [[Paper]](https://arxiv.org/pdf/2502.00700.pdf) 130 | - (arXiv 2025.02) CMamba: Learned Image Compression with State Space Models, [[Paper]](https://arxiv.org/pdf/2502.04988.pdf) 131 | - (arXiv 2025.03) MambaIC: State Space Models for High-Performance Learned Image Compression, [[Paper]](https://arxiv.org/pdf/2503.12461.pdf),[[Code]](https://github.com/AuroraZengfh/MambaIC) 132 | - (arXiv 2025.06) MambaMia: A State-Space-Model-Based Compression for Efficient Video Understanding in Large Multimodal Models, [[Paper]](https://arxiv.org/pdf/2506.13564.pdf) 133 | 134 | ### Crowd Counting 135 | - (arXiv 2024.05) VMambaCC: A Visual State Space Model for Crowd Counting, [[Paper]](https://arxiv.org/pdf/2405.03978.pdf) 136 | - (arXiv 2025.01) Mamba-MOC: A Multicategory Remote Object Counting via State Space Model, [[Paper]](https://arxiv.org/pdf/2501.06697.pdf),[[Code]](https://github.com/lp-094/Mamba-MOC) 137 | 138 | ### Deblurring 139 | - (arXiv 2024.03) Aggregating Local and Global Features via Selective State Spaces Model for Efficient Image Deblurring, [[Paper]](https://arxiv.org/pdf/2403.20106.pdf),[[Code]](https://github.com/MambaMixer/M2) 140 | - (arXiv 2024.05) Efficient Visual State Space Model for Image Deblurring, [[Paper]](https://arxiv.org/pdf/2405.14343.pdf) 141 | - (arXiv 2024.12) XYScanNet: An Interpretable State Space Model for Perceptual Image Deblurring, [[Paper]](https://arxiv.org/pdf/2412.10338.pdf) 142 | - (arXiv 2025.08) MBMamba: When Memory Buffer Meets Mamba for Structure-Aware Image Deblurring, [[Paper]](https://arxiv.org/pdf/2412.10338.pdf) 143 | 144 | ### Dehazing 145 | - (arXiv 2024.02) U-shaped Vision Mamba for Single Image Dehazing, [[Paper]](https://arxiv.org/pdf/2402.04139.pdf),[[Code]](https://github.com/zzr-idam) 146 | - (arXiv 2024.05) RSDehamba: Lightweight Vision Mamba for Remote Sensing Satellite Image Dehazing, [[Paper]](https://arxiv.org/pdf/2405.10030.pdf) 147 | - (arXiv 2025.05) WDMamba: When Wavelet Degradation Prior Meets Vision Mamba for Image Dehazing, [[Paper]](https://arxiv.org/pdf/2505.04369.pdf),[[Code]](https://github.com/SunJ000/WDMamba) 148 | 149 | ### Demosaicing 150 | - (arXiv 2025.03) Binarized Mamba-Transformer for Lightweight Quad Bayer HybridEVS Demosaicing, [[Paper]](https://arxiv.org/pdf/2503.16134.pdf),[[Code]](https://github.com/Clausy9/BMTNet) 151 | 152 | ### Depth 153 | - (arXiv 2024.06) MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation, [[Paper]](https://arxiv.org/pdf/2406.04532.pdf),[[Code]](https://github.com/ionut-grigore99/MambaDepth) 154 | - (arXiv 2025.01) DepthMamba with Adaptive Fusion, [[Paper]](https://arxiv.org/pdf/2412.19964.pdf) 155 | - (arXiv 2025.05) LMDepth: Lightweight Mamba-based Monocular Depth Estimation for Real-World Deployment, [[Paper]](https://arxiv.org/pdf/2505.00980) 156 | - (arXiv 2025.07) Tree-Mamba: A Tree-Aware Mamba for Underwater Monocular Depth Estimation, [[Paper]](https://arxiv.org/pdf/2507.07687),[[Code]](https://github.com/WYJGR/Tree-Mamba) 157 | 158 | ### Deraining 159 | - (arXiv 2024.04) FreqMamba: Viewing Mamba from a Frequency Perspective for Image Deraining, [[Paper]](https://arxiv.org/pdf/2404.09476.pdf) 160 | - (arXiv 2024.05) Image Deraining with Frequency-Enhanced State Space Model, [[Paper]](https://arxiv.org/pdf/2405.16470.pdf) 161 | - (arXiv 2024.08) RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining, [[Paper]](https://arxiv.org/pdf/2407.21773.pdf),[[Code]](https://github.com/TonyHongtaoWu/RainMamba) 162 | - (arXiv 2024.09) A Hybrid Transformer-Mamba Network for Single Image Deraining, [[Paper]](https://arxiv.org/pdf/2409.01148.pdf) 163 | 164 | ### Detection 165 | - (arXiv 2024.03) MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection, [[Paper]](https://arxiv.org/pdf/2403.02148.pdf),[[Code]](https://github.com/txchen-USTC/MiM-ISTD) 166 | - (arXiv 2024.04) Fusion-Mamba for Cross-modality Object Detection, [[Paper]](https://arxiv.org/pdf/2404.09146.pdf) 167 | - (arXiv 2024.04) CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions, [[Paper]](https://arxiv.org/pdf/2404.16302.pdf),[[Code]](https://github.com/lhy-zjut/CFMW) 168 | - (arXiv 2024.05) SOAR: Advancements in Small Body Object Detection for Aerial Imagery Using State Space Models and Programmable Gradients, [[Paper]](https://arxiv.org/pdf/2405.01699.pdf),[[Code]](https://github.com/yash2629/S.O.A.R) 169 | - (arXiv 2024.06) Mamba YOLO: SSMs-Based YOLO For Object Detection, [[Paper]](https://arxiv.org/pdf/2406.05835),[[Code]](https://github.com/HZAI-ZJNU/Mamba-YOLO) 170 | - (arXiv 2024.08) MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection, [[Paper]](https://arxiv.org/pdf/2408.00438) 171 | - (arXiv 2024.08) MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian Detection, [[Paper]](https://arxiv.org/pdf/2408.01037),[[Code]](https://github.com/XiangboGaoBarry/MambaST) 172 | - (arXiv 2024.10) Mamba Capsule Routing Towards Part-Whole Relational Camouflaged Object Detection, [[Paper]](https://arxiv.org/pdf/2410.03987),[[Code]](https://github.com/Liangbo-Cheng/mamba_capsule) 173 | - (arXiv 2024.10) HRVMamba: High-Resolution Visual State Space Model for Dense Prediction, [[Paper]](https://arxiv.org/pdf/2410.03174),[[Code]](https://github.com/zhanghao5201/HRVMamba) 174 | - (arXiv 2024.10) MambaBEV: An efficient 3D detection model with Mamba2, [[Paper]](https://arxiv.org/pdf/2410.12673) 175 | - (arXiv 2024.11) MambaDETR: Query-based Temporal Modeling using State Space Model for Multi-View 3D Object Detection, [[Paper]](https://arxiv.org/pdf/2411.13628) 176 | - (arXiv 2024.12) MSCrackMamba: Leveraging Vision Mamba for Crack Detection in Fused Multispectral Imagery, [[Paper]](https://arxiv.org/pdf/2412.06211) 177 | - (arXiv 2024.12) COMO: Cross-Mamba Interaction and Offset-Guided Fusion for Multimodal Object Detection, [[Paper]](https://arxiv.org/pdf/2412.18076),[[Code]](https://github.com/luluyuu/COMO) 178 | - (arXiv 2025.01) SMamba: Sparse Mamba for Event-based Object Detection, [[Paper]](https://arxiv.org/pdf/2501.11971),[[Code]](https://github.com/Zizzzzzzz/SMamba) 179 | - (arXiv 2025.02) DAViMNet: SSMs-Based Domain Adaptive Object Detection, [[Paper]](https://arxiv.org/pdf/2502.11178),[[Code]](https://github.com/enesdoruk/DAVimNet) 180 | - (arXiv 2025.03) State Space Model Meets Transformer: A New Paradigm for 3D Object Detection, [[Paper]](https://arxiv.org/pdf/2503.14493),[[Code]](https://github.com/OpenSpaceAI/DEST3D) 181 | - (arXiv 2025.03) UniMamba: Unified Spatial-Channel Representation Learning with Group-Efficient Mamba for LiDAR-based 3D Object Detection, [[Paper]](https://arxiv.org/pdf/2503.12009),[[Code]](https://github.com/suhaisheng/UniMamba) 182 | - (arXiv 2025.03) MAMAT: 3D Mamba-Based Atmospheric Turbulence Removal and its Object Detection Capability, [[Paper]](https://arxiv.org/pdf/2503.17700) 183 | - (arXiv 2025.06) SAMamba: Adaptive State Space Modeling with Hierarchical Vision for Infrared Small Target Detection, [[Paper]](https://arxiv.org/pdf/2505.23214),[[Code]](https://github.com/zhengshuchen/SAMamba) 184 | - (arXiv 2025.06) ConMamba: Contrastive Vision Mamba for Plant Disease Detection, [[Paper]](https://arxiv.org/pdf/2506.03213) 185 | - (arXiv 2025.06) MambaNeXt-YOLO: A Hybrid State Space Model for Real-time Object Detection, [[Paper]](https://arxiv.org/pdf/2506.03654),[[Code]](https://github.com/zhengshuchen/SAMamba) 186 | - (arXiv 2025.07) Multispectral State-Space Feature Fusion: Bridging Shared and Cross-Parametric Interactions for Object Detection, [[Paper]](https://arxiv.org/pdf/2507.14643),[[Code]](https://github.com/61s61min/MS2Fusion) 187 | - (arXiv 2025.07) WaveMamba: Wavelet-Driven Mamba Fusion for RGB-Infrared Object Detection, [[Paper]](https://arxiv.org/pdf/2507.18173) 188 | 189 | ### Diffusion 190 | - (arXiv 2024.03) ZigMa: Zigzag Mamba Diffusion Model, [[Paper]](https://arxiv.org/pdf/2403.13802.pdf),[[Code]](https://github.com/CompVis/zigma) 191 | - (arXiv 2024.05) DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis, [[Paper]](https://arxiv.org/pdf/2405.14224.pdf),[[Code]](https://github.com/tyshiwo1/DiM-DiffusionMamba/) 192 | - (arXiv 2024.05) Scaling Diffusion Mamba with Bidirectional SSMs for Efficient Image and Video Generation, [[Paper]](https://arxiv.org/pdf/2405.15881.pdf) 193 | - (arXiv 2024.06) Dimba: Transformer-Mamba Diffusion Models, [[Paper]](https://arxiv.org/pdf/2406.01159),[[Code]](https://dimba-project.github.io/) 194 | - (arXiv 2024.08) LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba, [[Paper]](https://arxiv.org/pdf/2408.02615) 195 | - (arXiv 2024.09) Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models, [[Paper]](https://arxiv.org/pdf/2409.07163),[[Code]](https://andycao1125.github.io/mamba_policy/) 196 | - (arXiv 2024.11) DiMSUM: Diffusion Mamba -- A Scalable and Unified Spatial-Frequency Method for Image Generation, [[Paper]](https://arxiv.org/pdf/2411.04168),[[Code]](https://github.com/VinAIResearch/DiMSUM) 197 | - (arXiv 2025.01) MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense Anticipation, [[Paper]](https://arxiv.org/pdf/2501.08837) 198 | - (arXiv 2025.03) TFDM: Time-Variant Frequency-Based Point Cloud Diffusion with Mamba, [[Paper]](https://arxiv.org/pdf/2503.13004) 199 | - (arXiv 2025.04) U-Shape Mamba: State Space Model for faster diffusion, [[Paper]](https://arxiv.org/pdf/2504.13499) 200 | - (arXiv 2025.04) WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba Diffusion, [[Paper]](https://arxiv.org/pdf/2504.13561),[[Code]](https://github.com/wuyang98/weathergen) 201 | - (arXiv 2025.06) StateSpaceDiffuser: Bringing Long Context to Diffusion World Models, [[Paper]](https://arxiv.org/pdf/2505.22246) 202 | 203 | ### Domain 204 | - (arXiv 2024.04) DGMamba: Domain Generalization via Generalized State Space Model, [[Paper]](https://arxiv.org/pdf/2404.07794.pdf),[[Code]](https://github.com/longshaocong/DGMamba) 205 | 206 | ### Edge 207 | - (arXiv 2025.01) EDMB: Edge Detector with Mamba, [[Paper]](https://arxiv.org/pdf/2501.04846.pdf),[[Code]](https://github.com/Li-yachuan/EDMB) 208 | 209 | ### Enhancement 210 | - (arXiv 2024.04) MambaUIE&SR: Unraveling the Ocean's Secrets with Only 2.8 FLOPs, [[Paper]](https://arxiv.org/ftp/arxiv/papers/2404/2404.13884.pdf),[[Code]](https://github.com/1024AILab/MambaUIE) 211 | - (arXiv 2024.05) Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement, [[Paper]](https://arxiv.org/abs/2405.03349.pdf),[[Code]](https://github.com/YhuoyuH/RetinexMamba) 212 | - (arXiv 2024.05) WaterMamba: Visual State Space Model for Underwater Image Enhancement, [[Paper]](https://arxiv.org/abs/2405.08419.pdf) 213 | - (arXiv 2024.05) MambaLLIE: Implicit Retinex-Aware Low Light Enhancement with Global-then-Local State Space, [[Paper]](https://arxiv.org/abs/2405.16105.pdf) 214 | - (arXiv 2024.06) LLEMamba: Low-Light Enhancement via Relighting-Guided Mamba with Deep Unfolding Network, [[Paper]](https://arxiv.org/abs/2406.01028.pdf) 215 | - (arXiv 2024.06) PixMamba: Leveraging State Space Models in a Dual-Level Architecture for Underwater Image Enhancement, [[Paper]](https://arxiv.org/abs/2406.08444.pdf),[[Code]](https://github.com/weitunglin/pixmamba) 216 | - (arXiv 2024.07) RESVMUNetX: A Low-Light Enhancement Network Based on VMamba, [[Paper]](https://arxiv.org/abs/2407.09553.pdf) 217 | - (arXiv 2024.08) Wave-Mamba: Wavelet State Space Model for Ultra-High-Definition Low-Light Image Enhancement, [[Paper]](https://arxiv.org/abs/2408.01276.pdf),[[Code]](https://github.com/AlexZou14/Wave-Mamba) 218 | - (arXiv 2024.08) ExpoMamba: Exploiting Frequency SSM Blocks for Efficient and Effective Image Enhancement, [[Paper]](https://arxiv.org/abs/2408.09650.pdf),[[Code]](https://github.com/eashanadhikarla/ExpoMamba) 219 | - (arXiv 2024.08) O-Mamba: O-shape State-Space Model for Underwater Image Enhancement, [[Paper]](https://arxiv.org/abs/2408.12816.pdf),[[Code]](https://github.com/chenydong/O-Mamba) 220 | - (arXiv 2024.09) Retinex-RAWMamba: Bridging Demosaicing and Denoising for Low-Light RAW Image Enhancement, [[Paper]](https://arxiv.org/abs/2409.07040.pdf) 221 | - (arXiv 2024.09) Semi-LLIE: Semi-supervised Contrastive Learning with Mamba-based Low-light Image Enhancement, [[Paper]](https://arxiv.org/abs/2409.16604.pdf),[[Code]](https://github.com/guanguanboy/Semi-LLIE) 222 | - (arXiv 2025.06) BSMamba: Brightness and Semantic Modeling for Long-Range Interaction in Low-Light Image Enhancemen, [[Paper]](https://arxiv.org/abs/2506.18346.pdf),[[Code]](https://github.com/guanguanboy/Semi-LLIE) 223 | 224 | ### Emotion 225 | - (arXiv 2025.03) Mamba-VA: A Mamba-based Approach for Continuous Emotion Recognition in Valence-Arousal Space, [[Paper]](https://arxiv.org/pdf/2503.10104.pdf),[[Code]](https://github.com/FreedomPuppy77/Charon) 226 | 227 | ### Event Cameras 228 | - (arXiv 2024.02) State Space Models for Event Cameras, [[Paper]](https://arxiv.org/pdf/2402.15584.pdf) 229 | - (arXiv 2024.04) MambaPupil: Bidirectional Selective Recurrent model for Event-based Eye tracking, [[Paper]](https://arxiv.org/pdf/2404.12083.pdf) 230 | - (arXiv 2024.09) Path-adaptive Spatio-Temporal State Space Model for Event-based Recognition with Arbitrary Duration, [[Paper]](https://arxiv.org/pdf/2409.16953.pdf),[[Code]](https://github.com/jiazhou-garland/EventBind) 231 | - (arXiv 2025.03) EventMamba: Enhancing Spatio-Temporal Locality with State Space Models for Event-Based Video Reconstruction, [[Paper]](https://arxiv.org/pdf/2503.19721.pdf) 232 | - (arXiv 2025.05) PRE-Mamba: A 4D State Space Model for Ultra-High-Frequent Event Camera Deraining, [[Paper]](https://arxiv.org/pdf/2505.05307.pdf) 233 | - (arXiv 2025.06) Spatio-Temporal State Space Model For Efficient Event-Based Optical Flow, [[Paper]](https://arxiv.org/pdf/2506.07878.pdf),[[Code]](https://github.com/AhmedHumais/E-STMFlow) 234 | 235 | ### Face 236 | - (arXiv 2024.05) FER-YOLO-Mamba: Facial Expression Detection and Classification Based on Selective State Space, [[Paper]](https://arxiv.org/pdf/2405.01828.pdf),[[Code]](https://github.com/SwjtuMa/FER-YOLO-Mamba) 237 | - (arXiv 2024.09) Mamba-Enhanced Text-Audio-Video Alignment Network for Emotion Recognition in Conversations, [[Paper]](https://arxiv.org/pdf/2409.05243.pdf) 238 | - (arXiv 2025.01) WMamba: Wavelet-based Mamba for Face Forgery Detection, [[Paper]](https://arxiv.org/pdf/2501.09617.pdf) 239 | - (arXiv 2025.09) Mamba-CNN: A Hybrid Architecture for Efficient and Accurate Facial Beauty Prediction, [[Paper]](https://arxiv.org/pdf/2509.01431.pdf) 240 | - (arXiv 2025.09) SynergyNet: Fusing Generative Priors and State-Space Models for Facial Beauty Prediction, [[Paper]](https://arxiv.org/pdf/2509.17172.pdf) 241 | 242 | ### Few-Shot 243 | - (arXiv 2024.07) Mamba-FSCIL: Dynamic Adaptation with Selective State Space Model for Few-Shot Class-Incremental Learning, [[Paper]](https://arxiv.org/pdf/2407.06136.pdf),[[Code]](https://github.com/xiaojieli0903/Mamba-FSCIL) 244 | - (arXiv 2025.07) Few-Shot Object Detection via Spatial-Channel State Space Model, [[Paper]](https://arxiv.org/pdf/2507.15308.pdf) 245 | 246 | ### Flow 247 | - (arXiv 2024.12) FlowMamba: Learning Point Cloud Scene Flow with Global Motion Propagation, [[Paper]](https://arxiv.org/pdf/2412.17366.pdf) 248 | - (arXiv 2025.02) MambaFlow: A Novel and Flow-guided State Space Model for Scene Flow Estimation, [[Paper]](https://arxiv.org/pdf/2502.16907.pdf),[[Code]](https://github.com/SCNU-RISLAB/MambaFlow) 249 | - (arXiv 2025.03) MambaFlow: A Mamba-Centric Architecture for End-to-End Optical Flow Estimation, [[Paper]](https://arxiv.org/pdf/2503.07046.pdf) 250 | - (arXiv 2025.04) DGFamba: Learning Flow Factorized State Space for Visual Domain Generalization, [[Paper]](https://arxiv.org/pdf/2504.08019.pdf) 251 | 252 | ### Fusion 253 | - (arXiv 2024.04) FusionMamba: Efficient Image Fusion with State Space Model, [[Paper]](https://arxiv.org/pdf/2404.07932.pdf) 254 | - (arXiv 2024.04) MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion, [[Paper]](https://arxiv.org/pdf/2404.08406.pdf) 255 | - (arXiv 2024.04) FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba, [[Paper]](https://arxiv.org/pdf/2404.09498.pdf) 256 | - (arXiv 2024.04) A Novel State Space Model with Local Enhancement and State Sharing for Image Fusion, [[Paper]](https://arxiv.org/pdf/2404.09293.pdf) 257 | - (arXiv 2024.06) S4Fusion: Saliency-aware Selective State Space Model for Infrared Visible Image Fusion, [[Paper]](https://arxiv.org/pdf/2405.20881) 258 | - (arXiv 2024.09) Shuffle Mamba: State Space Models with Random Shuffle for Multi-Modal Image Fusion, [[Paper]](https://arxiv.org/pdf/2409.01728) 259 | - (arXiv 2024.09) Why mamba is effective? Exploit Linear Transformer-Mamba Network for Multi-Modality Image Fusion, [[Paper]](https://arxiv.org/pdf/2409.03223) 260 | - (arXiv 2025.03) Exploring State Space Model in Wavelet Domain: An Infrared and Visible Image Fusion Network via Wavelet Transform and State Space Model, [[Paper]](https://arxiv.org/pdf/2503.18378),[[Code]](https://github.com/Lmmh058/W-Mamba) 261 | - (arXiv 2025.08) MambaTrans: Multimodal Fusion Image Translation via Large Language Model Priors for Downstream Visual Tasks, [[Paper]](https://arxiv.org/pdf/2508.07803) 262 | 263 | ### Generation 264 | - (arXiv 2024.06) MVGamba: Unify 3D Content Generation as State Space Sequence Modeling, [[Paper]](https://arxiv.org/pdf/2406.06367) 265 | - (arXiv 2024.08) Scalable Autoregressive Image Generation with Mamba, [[Paper]](https://arxiv.org/pdf/2408.12245),[[Code]](https://github.com/hp-l33/AiM) 266 | - (arXiv 2025.02) Pushing the Boundaries of State Space Models for Image and Video Generation, [[Paper]](https://arxiv.org/pdf/2502.00972),[[Code]](https://yiconghong.me/HTH/) 267 | 268 | ### Gesture 269 | - (arXiv 2024.03) MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models, [[Paper]](https://arxiv.org/pdf/2403.09471.pdf) 270 | 271 | ### Graph 272 | - (arXiv 2024.01) Graph-Mamba: Towards Long-Range Graph Sequence Modeling with Selective State Spaces, [[Paper]](https://arxiv.org/pdf/2402.00789.pdf),[[Code]](https://github.com/bowang-lab/Graph-Mamba) 273 | - (arXiv 2024.02) Graph Mamba: Towards Learning on Graphs with State Space Models, [[Paper]](https://arxiv.org/pdf/2402.08678.pdf),[[Code]](https://github.com/GraphMamba/GMN) 274 | - (arXiv 2024.03) DynSTG-Mamba: Dynamic Spatio-Temporal Graph Mamba with Cross-Graph Knowledge Distillation for Gait Disorders Recognition, [[Paper]](https://arxiv.org/pdf/2503.13156.pdf) 275 | - (arXiv 2025.09) Hazy Pedestrian Trajectory Prediction via Physical Priors and Graph-Mamba, [[Paper]](https://arxiv.org/pdf/2509.24020.pdf) 276 | 277 | ### Hyperspectral 278 | - (arXiv 2024.04) HSIMamba: Hyperspectral Imaging Efficient Feature Learning with Bidirectional State Space for Classification, [[Paper]](https://arxiv.org/pdf/2404.00272.pdf) 279 | - (arXiv 2024.04) SpectralMamba: Efficient Mamba for Hyperspectral Image Classification, [[Paper]](https://arxiv.org/pdf/2404.08489.pdf),[[Code]](https://github.com/danfenghong/SpectralMamba) 280 | - (arXiv 2024.04) HSIDMamba: Exploring Bidirectional State-Space Models for Hyperspectral Denoising, [[Paper]](https://arxiv.org/pdf/2404.09697.pdf),[[Code]](https://github.com/danfenghong/SpectralMamba) 281 | - (arXiv 2024.05) Spectral-Spatial Mamba for Hyperspectral Image Classification, [[Paper]](https://arxiv.org/pdf/2404.18401.pdf) 282 | - (arXiv 2024.05) S2Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification, [[Paper]](https://arxiv.org/pdf/2404.18213.pdf),[[Code]](https://github.com/PURE-melo/S2Mamba) 283 | - (arXiv 2024.05) SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising, [[Paper]](https://arxiv.org/pdf/2405.01726.pdf),[[Code]](https://github.com/lronkitty/SSUMamba) 284 | - (arXiv 2024.05) Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification, [[Paper]](https://arxiv.org/pdf/2405.12003.pdf),[[Code]](https://github.com/zhouweilian1904/Mamba-in-Mamba) 285 | - (arXiv 2024.06) DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification, [[Paper]](https://arxiv.org/pdf/2406.07050.pdf) 286 | - (arXiv 2024.07) HTD-Mamba: Efficient Hyperspectral Target Detection with Pyramid State Space Model, [[Paper]](https://arxiv.org/pdf/2407.06841.pdf),[[Code]](https://github.com/shendb2022/HTD-Mamba) 287 | - (arXiv 2024.07) GraphMamba: An Efficient Graph Structure Learning Vision Mamba for Hyperspectral Image Classification, [[Paper]](https://arxiv.org/pdf/2407.08255.pdf),[[Code]](https://github.com/ahappyyang/GraphMamba) 288 | - (arXiv 2024.08) Spatial-Spectral Morphological Mamba for Hyperspectral Image Classification, [[Paper]](https://arxiv.org/pdf/2408.01372.pdf),[[Code]](https://github.com/MHassaanButt/MorpMamba) 289 | - (arXiv 2024.08) WaveMamba: Spatial-Spectral Wavelet Mamba for Hyperspectral Image Classification, [[Paper]](https://arxiv.org/pdf/2408.01231.pdf) 290 | - (arXiv 2024.08) Multi-head Spatial-Spectral Mamba for Hyperspectral Image Classification, [[Paper]](https://arxiv.org/pdf/2408.01224.pdf),[[Code]](https://github.com/MHassaanButt/MHA_SS_Mamba) 291 | - (arXiv 2024.10) IGroupSS-Mamba: Interval Group Spatial-Spectral Mamba for Hyperspectral Image Classification, [[Paper]](https://arxiv.org/pdf/2410.05100.pdf) 292 | - (arXiv 2025.01) MambaHSI: Spatial-Spectral Mamba for Hyperspectral Image Classification, [[Paper]](https://arxiv.org/pdf/2501.04944.pdf),[[Code]](https://github.com/li-yapeng/MambaHSI) 293 | - (arXiv 2025.02) Hybrid State-Space and GRU-based Graph Tokenization Mamba for Hyperspectral Image Classification, [[Paper]](https://arxiv.org/pdf/2502.06427.pdf) 294 | - (arXiv 2025.04) HS-Mamba: Full-Field Interaction Multi-Groups Mamba for Hyperspectral Image Classification, [[Paper]](https://arxiv.org/pdf/2504.15612.pdf) 295 | - (arXiv 2025.04) MambaMoE: Mixture-of-Spectral-Spatial-Experts State Space Model for Hyperspectral Image Classification, [[Paper]](https://arxiv.org/pdf/2504.20509.pdf) 296 | - (arXiv 2025.08) PIF-Net: Ill-Posed Prior Guided Multispectral and Hyperspectral Image Fusion via Invertible Mamba and Fusion-Aware LoRA, [[Paper]](https://arxiv.org/pdf/2508.00453.pdf) 297 | - (arXiv 2025.09) Hyperspectral Mamba for Hyperspectral Object Tracking, [[Paper]](https://arxiv.org/pdf/2509.08265.pdf),[[Code]](https://github.com/lgao001/HyMamba) 298 | 299 | ### Inpainting 300 | - (arXiv 2024.07) MxT: Mamba x Transformer for Image Inpainting, [[Paper]](https://arxiv.org/pdf/2407.16126.pdf) 301 | - (arXiv 2024.11) SEM-Net: Efficient Pixel Modelling for image inpainting with Spatially Enhanced SSM, [[Paper]](https://arxiv.org/pdf/2411.06318.pdf),[[Code]](https://github.com/ChrisChen1023/SEM-Net) 302 | 303 | ### Instance Segmentation 304 | - (arXiv 2025.08) UIS-Mamba: Exploring Mamba for Underwater Instance Segmentation via Dynamic Tree Scan and Hidden State Weaken, [[Paper]](https://arxiv.org/pdf/2508.00421.pdf),[[Code]](https://github.com/Maricalce/UIS-Mamba) 305 | 306 | ### Keypoint 307 | - (arXiv 2024.12) MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection, [[Paper]](https://arxiv.org/pdf/2412.01422.pdf),[[Code]](https://mamkpd.github.io/index.html) 308 | 309 | ### Knowledge Distillation 310 | - (arXiv 2024.09) Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation, [[Paper]](https://arxiv.org/pdf/2409.11018.pdf) 311 | - (arXiv 2024.11) Vision Mamba Distillation for Low-resolution Fine-grained Image Classification, [[Paper]](https://arxiv.org/pdf/2411.17980.pdf),[[Code]](https://github.com/boa2004plaust/ViMD) 312 | - (arXiv 2025.02) Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation, [[Paper]](https://arxiv.org/pdf/2502.13145.pdf),[[Code]](https://github.com/hustvl/mmMamba) 313 | - (arXiv 2025.06) Diffusion Transformer-to-Mamba Distillation for High-Resolution Image Generation, [[Paper]](https://arxiv.org/pdf/2506.18999.pdf) 314 | 315 | ### LLM 316 | - (arXiv 2024.03) DenseMamba: State Space Models with Dense Hidden Connection for Efficient Large Language Models, [[Paper]](https://arxiv.org/pdf/2403.00818.pdf),[[Code]](https://github.com/WailordHe/DenseSSM) 317 | - (arXiv 2024.05) Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models, [[Paper]](https://arxiv.org/pdf/2405.15574),[[Code]](https://github.com/ByungKwanLee/Meteor) 318 | - (arXiv 2024.07) ML-Mamba: Efficient Multi-Modal Large Language Model Utilizing Mamba-2, [[Paper]](https://arxiv.org/pdf/2407.19832) 319 | - (arXiv 2024.09) Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling, [[Paper]](https://arxiv.org/pdf/2409.05395),[[Code]](https://github.com/gpantaz/vl_mamba) 320 | 321 | ### Matching 322 | - (arXiv 2025.02) MambaGlue: Fast and Robust Local Feature Matching With Mamba, [[Paper]](https://arxiv.org/pdf/2502.00462),[[Code]](https://github.com/url-kaist/MambaGlue) 323 | - (arXiv 2025.03) JamMa: Ultra-lightweight Local Feature Matching with Joint Mamba, [[Paper]](https://arxiv.org/pdf/2503.03437),[[Code]](https://github.com/leoluxxx/JamMa) 324 | - (arXiv 2025.09) Similarity-Aware Selective State-Space Modeling for Semantic Correspondence, [[Paper]](https://arxiv.org/pdf/2509.24318),[[Code]](https://cvlab.postech.ac.kr/project/MambaMatcher/) 325 | 326 | ### Medical 327 | - (arXiv 2024.01) U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2401.04722.pdf), [[Code]](https://github.com/bowang-lab/U-Mamba) 328 | - (arXiv 2024.01) SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2401.13560.pdf), [[Code]](https://github.com/ge-xing/SegMamba) 329 | - (arXiv 2024.01) Vivim: a Video Vision Mamba for Medical Video Object Segmentation, [[Paper]](https://arxiv.org/pdf/2401.14168.pdf), [[Code]](https://github.com/scott-yjyang/Vivim) 330 | - (arXiv 2024.01) MambaMorph: a Mamba-based Backbone with Contrastive Feature Learning for Deformable MR-CT Registration, [[Paper]](https://arxiv.org/pdf/2401.13934.pdf), [[Code]](https://github.com/guo-stone/mambamorph) 331 | - (arXiv 2024.02) VM-UNet: Vision Mamba UNet for Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2402.02491.pdf),[[Code]](https://github.com/JCruan519/VM-UNet) 332 | - (arXiv 2024.02) nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model,[[Paper]](https://arxiv.org/pdf/2402.03526.pdf),[[Code]](https://github.com/lhaof/nnMamba) 333 | - (arXiv 2024.02) FD-Vision Mamba for Endoscopic Exposure Correction, [[Paper]](https://arxiv.org/abs/2402.06378) 334 | - (arXiv 2024.02) Semi-Mamba-UNet: Pixel-Level Contrastive Cross-Supervised Visual Mamba-based UNet for Semi-Supervised Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2402.07245.pdf),[[Code]](https://github.com/ziyangwang007/Mamba-UNet) 335 | - (arXiv 2024.02) Marrying Perona Malik Diffusion with Mamba for Efficient Pediatric Echocardiographic Left Ventricular Segmentation,[[[Paper]](https://arxiv.org/pdf/2402.08506.pdf) 336 | - (arXiv 2024.02) Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image Segmentation,[[Paper]](https://arxiv.org/pdf/2402.10887.pdf),[[Code]](https://github.com/ziyangwang007/Mamba-UNet) 337 | - (arXiv 2024.03) MedMamba: Vision Mamba for Medical Image Classification,[[Paper]](https://arxiv.org/pdf/2403.03849.pdf),[[Code]](https://github.com/YubiaoYue/MedMamba) 338 | - (arXiv 2024.03) MambaMIR: An Arbitrary-Masked Mamba for Joint Medical Image Reconstruction and Uncertainty Estimation,[[Paper]](https://arxiv.org/pdf/2402.18451.pdf),[[Code]](https://github.com/ayanglab/MambaMIR) 339 | - (arXiv 2024.03) MamMIL: Multiple Instance Learning for Whole Slide Images with State Space Models,[[Paper]](https://arxiv.org/pdf/2403.05160.pdf) 340 | - (arXiv 2024.03) LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation,[[Paper]](https://arxiv.org/pdf/2403.05246.pdf),[[Code]](https://github.com/MrBlankness/LightM-UNet) 341 | - (arXiv 2024.03) MambaMIL: Enhancing Long Sequence Modeling with Sequence Reordering in Computational Pathology,[[Paper]](https://arxiv.org/pdf/2403.06800.pdf),[[Code]](https://github.com/isyangshu/MambaMIL) 342 | - (arXiv 2024.03) VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation,[[Paper]](https://arxiv.org/pdf/2403.09157.pdf),[[Code]](https://github.com/nobodyplayer1/VM-UNetV2) 343 | - (arXiv 2024.03) MD-Dose: A Diffusion Model based on the Mamba for Radiotherapy Dose Prediction,[[Paper]](https://arxiv.org/pdf/2403.08479.pdf),[[Code]](https://github.com/flj19951219/mamba_dose) 344 | - (arXiv 2024.03) Large Window-based Mamba UNet for Medical Image Segmentation: Beyond Convolution and Self-attention,[[Paper]](https://arxiv.org/pdf/2403.07332.pdf),[[Code]](https://github.com/wjh892521292/LMa-UNet) 345 | - (arXiv 2024.03) ProMamba: Prompt-Mamba for polyp segmentation,[[Paper]](https://arxiv.org/pdf/2403.13660.pdf),[[Code]](https://github.com/wjh892521292/LMa-UNet) 346 | - (arXiv 2024.03) H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation,[[Paper]](https://arxiv.org/pdf/2403.13642.pdf),[[Code]](https://github.com/wurenkai/H-vmunet) 347 | - (arXiv 2024.03) Rotate to Scan: UNet-like Mamba with Triplet SSM Module for Medical Image Segmentation,[[Paper]](https://arxiv.org/pdf/2403.17701.pdf) 348 | - (arXiv 2024.03) Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion,[[Paper]](https://arxiv.org/pdf/2403.17432.pdf) 349 | - (arXiv 2024.03) UltraLight VM-UNet: Parallel Vision Mamba Significantly Reduces Parameters for Skin Lesion Segmentation,[[Paper]](https://arxiv.org/pdf/2403.20035.pdf),[[Code]](https://github.com/wurenkai/UltraLight-VM-UNet) 350 | - (arXiv 2024.04) T-Mamba: Frequency-Enhanced Gated Long-Range Dependency for Tooth 3D CBCT Segmentation,[[Paper]](https://arxiv.org/pdf/2404.01065.pdf),[[Code]](https://github.com/isbrycee/T-Mamba) 351 | - (arXiv 2024.04) ViM-UNet: Vision Mamba for Biomedical Segmentation,[[Paper]](https://arxiv.org/pdf/2404.07705.pdf),[[Code]](https://github.com/constantinpape/torch-em/blob/main/vimunet.md) 352 | - (arXiv 2024.04) SurvMamba: State Space Model with Multi-grained Multi-modal Interaction for Survival Prediction,[[Paper]](https://arxiv.org/pdf/2404.08027.pdf) 353 | - (arXiv 2024.04) Vim4Path: Self-Supervised Vision Mamba for Histopathology Images,[[Paper]](https://arxiv.org/pdf/2404.13222.pdf),[[Code]](https://github.com/AtlasAnalyticsLab/Vim4Path) 354 | - (arXiv 2024.04) Sparse Reconstruction of Optical Doppler Tomography Based on State Space Model,[[Paper]](https://arxiv.org/pdf/2404.17484.pdf) 355 | - (arXiv 2024.05) AC-MAMBASEG: An adaptive convolution and Mamba-based architecture for enhanced skin lesion segmentation,[[Paper]](https://arxiv.org/pdf/2405.03011.pdf),[[Code]](https://github.com/vietthanh2710/AC-MambaSeg) 356 | - (arXiv 2024.05) HC-Mamba: Vision MAMBA with Hybrid Convolutional Techniques for Medical Image Segmentation,[[Paper]](https://arxiv.org/pdf/2405.05007.pdf) 357 | - (arXiv 2024.05) VM-DDPM: Vision Mamba Diffusion for Medical Image Synthesis,[[Paper]](https://arxiv.org/pdf/2405.05667.pdf) 358 | - (arXiv 2024.05) I2I-Mamba: Multi-modal medical image synthesis via selective state space modeling,[[Paper]](https://arxiv.org/pdf/2405.14022.pdf) 359 | - (arXiv 2024.05) MUCM-Net: A Mamba Powered UCM-Net for Skin Lesion Segmentation,[[Paper]](https://arxiv.org/pdf/2405.15925.pdf),[[Code]](https://github.com/chunyuyuan/MUCM-Net) 360 | - (arXiv 2024.06) MHS-VM: Multi-Head Scanning in Parallel Subspacesfor Vision Mamba,[[Paper]](https://arxiv.org/pdf/2406.05992),[[Code]](https://github.com/PixDeep/MHS-VM) 361 | - (arXiv 2024.06) Vision Mamba: Cutting-Edge Classification of Alzheimer's Disease with 3D MRI Scans,[[Paper]](https://arxiv.org/pdf/2406.05757) 362 | - (arXiv 2024.06) Soft Masked Mamba Diffusion Model for CT to MRI Conversion,[[Paper]](https://arxiv.org/abs/2406.15910),[[Code]](https://github.com/wongzbb/DiffMa-Diffusion-Mamba) 363 | - (arXiv 2024.06) MMR-Mamba: Multi-Contrast MRI Reconstruction with Mamba and Spatial-Frequency Information Fusion,[[Paper]](https://arxiv.org/abs/2406.18950) 364 | - (arXiv 2024.07) Vision Mamba for Classification of Breast Ultrasound Images,[[Paper]](https://arxiv.org/abs/2407.03552) 365 | - (arXiv 2024.07) SliceMamba for Medical Image Segmentation,[[Paper]](https://arxiv.org/abs/2407.08481) 366 | - (arXiv 2024.07) SR-Mamba: Effective Surgical Phase Recognition with State Space Model,[[Paper]](https://arxiv.org/abs/2407.08333),[[Code]](https://github.com/rcao-hk/SR-Mamba) 367 | - (arXiv 2024.07) GFE-Mamba: Mamba-based AD Multi-modal Progression Assessment via Generative Feature Extraction from MCI,[[Paper]](https://arxiv.org/abs/2407.15719),[[Code]](https://github.com/Tinysqua/GFE-Mamba) 368 | - (arXiv 2024.08) PhysMamba: Leveraging Dual-Stream Cross-Attention SSD for Remote Physiological Measurement,[[Paper]](https://arxiv.org/abs/2408.01077) 369 | - (arXiv 2024.08) HMT-UNet: A hybird Mamba-Transformer Vision UNet for Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2408.11289),[[Code]](https://github.com/simzhangbest/HMT-Unet) 370 | - (arXiv 2024.08) Costal Cartilage Segmentation with Topology Guided Deformable Mamba: Method and Benchmark, [[Paper]](https://arxiv.org/pdf/2408.07444) 371 | - (arXiv 2024.08) LoG-VMamba: Local-Global Vision Mamba for Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2408.14415),[[Code]](https://github.com/Oulu-IMEDS/LoG-VMamba) 372 | - (arXiv 2024.08) ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation, [[Paper]](https://arxiv.org/pdf/2408.14114) 373 | - (arXiv 2024.08) SpineMamba: Enhancing 3D Spinal Segmentation in Clinical Imaging through Residual Visual Mamba Layers and Shape Priors, [[Paper]](https://arxiv.org/pdf/2408.15887) 374 | - (arXiv 2024.09) Serp-Mamba: Advancing High-Resolution Retinal Vessel Segmentation with Selective State-Space Model, [[Paper]](https://arxiv.org/pdf/2409.04356) 375 | - (arXiv 2024.09) MpoxMamba: A Grouped Mamba-based Lightweight Hybrid Network for Mpox Detection, [[Paper]](https://arxiv.org/pdf/2409.04218),[[Code]](https://github.com/YubiaoYue/MpoxMamba) 376 | - (arXiv 2024.09) Microscopic-Mamba: Revealing the Secrets of Microscopic Images with Just 4M Parameters, [[Paper]](https://arxiv.org/pdf/2409.07896),[[Code]](https://github.com/zs1314/Microscopic-Mamba) 377 | - (arXiv 2024.09) OCTAMamba: A State-Space Model Approach for Precision OCTA Vasculature Segmentation, [[Paper]](https://arxiv.org/pdf/2409.08000),[[Code]](https://github.com/zs1314/OCTAMamba) 378 | - (arXiv 2024.09) Learning Brain Tumor Representation in 3D High-Resolution MR Images via Interpretable State Space Models, [[Paper]](https://arxiv.org/pdf/2409.07746),[[Code]](https://github.com/WinstonHuTiger/mamba_mae) 379 | - (arXiv 2024.09) MedSegMamba: 3D CNN-Mamba Hybrid Architecture for Brain Segmentation, [[Paper]](https://arxiv.org/pdf/2409.08307) 380 | - (arXiv 2024.09) Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images, [[Paper]](https://arxiv.org/pdf/2409.08492),[[Code]](https://github.com/xmed-lab/TP-Mamba) 381 | - (arXiv 2024.09) SkinMamba: A Precision Skin Lesion Segmentation Architecture with Cross-Scale Global State Modeling and Frequency Boundary Guidance, [[Paper]](https://arxiv.org/pdf/2409.10890),[[Code]](https://github.com/zs1314/SkinMamba) 382 | - (arXiv 2024.09) MambaClinix: Hierarchical Gated Convolution and Mamba-Based U-Net for Enhanced 3D Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2409.12533),[[Code]](https://github.com/CYB08/MambaClinix-PyTorch) 383 | - (arXiv 2024.09) MambaRecon: MRI Reconstruction with Structured State Space Models, [[Paper]](https://arxiv.org/pdf/2409.12401),[[Code]](https://github.com/yilmazkorkmaz1/MambaRecon) 384 | - (arXiv 2024.09) SPRMamba: Surgical Phase Recognition for Endoscopic Submucosal Dissection with Mamba, [[Paper]](https://arxiv.org/pdf/2409.12108) 385 | - (arXiv 2024.09) PhysMamba: Efficient Remote Physiological Measurement with SlowFast Temporal Difference Mamba, [[Paper]](https://arxiv.org/pdf/2409.12031),[[Code]](https://github.com/Chaoqi31/PhysMamba) 386 | - (arXiv 2024.09) Classification of Gleason Grading in Prostate Cancer Histopathology Images Using Deep Learning Techniques: YOLO, Vision Transformers, and Vision Mamba, [[Paper]](https://arxiv.org/pdf/2409.17122) 387 | - (arXiv 2024.10) MedVisionLlama: Leveraging Pre-Trained Large Language Model Layers to Enhance Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2410.02458),[[Code]](https://bit.ly/3zf2CVs) 388 | - (arXiv 2024.10) SlimSeiz: Efficient Channel-Adaptive Seizure Prediction Using a Mamba-Enhanced Network, [[Paper]](https://arxiv.org/pdf/2410.09998),[[Code]](https://github.com/guoruilu/SlimSeiz) 389 | - (arXiv 2024.10) Taming Mambas for Voxel Level 3D Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2410.15496),[[Code]](https://anonymous.4open.science/r/WACV2025-TamingMamba/README.md) 390 | - (arXiv 2024.10) R2Gen-Mamba: A Selective State Space Model for Radiology Report Generation, [[Paper]](https://arxiv.org/pdf/2410.18135),[[Code]](https://github.com/YonghengSun1997/R2Gen-Mamba) 391 | - (arXiv 2024.10) Advancing Efficient Brain Tumor Multi-Clas Classification -- New Insights from the Vision Mamba Model in Transfer Learning, [[Paper]](https://arxiv.org/pdf/2410.21872) 392 | - (arXiv 2024.10) MLLA-UNet: Mamba-like Linear Attention in an Efficient U-Shape Model for Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2410.23738),[[Code]](https://github.com/csyfjiang/MLLA-UNet) 393 | - (arXiv 2024.11) MedSora: Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation, [[Paper]](https://arxiv.org/pdf/2411.01647),[[Code]](https://wongzbb.github.io/MedSora/) 394 | - (arXiv 2024.11) KAN-Mamba FusionNet: Redefining Medical Image Segmentation with Non-Linear Modeling, [[Paper]](https://arxiv.org/pdf/2411.01647) 395 | - (arXiv 2024.12) MambaU-Lite: A Lightweight Model based on Mamba and Integrated Channel-Spatial Attention for Skin Lesion Segmentation, [[Paper]](https://arxiv.org/pdf/2412.01405),[[Code]](https://github.com/nqnguyen812/MambaU-Lite) 396 | - (arXiv 2024.12) 2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification, [[Paper]](https://arxiv.org/pdf/2412.00678),[[Code]](https://github.com/AtlasAnalyticsLab/2DMamba) 397 | - (arXiv 2024.12) SkelMamba: A State Space Model for Efficient Skeleton Action Recognition of Neurological Disorders, [[Paper]](https://arxiv.org/pdf/2411.19544) 398 | - (arXiv 2024.12) S3-Mamba: Small-Size-Sensitive Mamba for Lesion Segmentation, [[Paper]](https://arxiv.org/pdf/2412.14546),[[Code]](https://github.com/ErinWang2023/S3-Mamba) 399 | - (arXiv 2024.12) SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp Segmentation, [[Paper]](https://arxiv.org/pdf/2412.08482) 400 | - (arXiv 2025.01) HCMA-UNet: A Hybrid CNN-Mamba UNet with Inter-Slice Self-Attention for Efficient Breast Cancer Segmentation, [[Paper]](https://arxiv.org/pdf/2501.00751),[[Code]](https://anonymous.4open.science/r/ICME2025_HCMA-UNet/README.md) 401 | - (arXiv 2025.01) Merging Context Clustering with Visual State Space Models for Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2501.01618),[[Code]](https://github.com/zymissy/CCViM) 402 | - (arXiv 2025.01) KM-UNet KAN Mamba UNet for medical image segmentation, [[Paper]](https://arxiv.org/pdf/2501.02559),[[Code]](https://github.com/2760613195/KM_UNet) 403 | - (arXiv 2025.01) MSV-Mamba: A Multiscale Vision Mamba Network for Echocardiography Segmentation, [[Paper]](https://arxiv.org/pdf/2501.07120) 404 | - (arXiv 2025.01) DM-Mamba: Dual-domain Multi-scale Mamba for MRI reconstruction, [[Paper]](https://arxiv.org/pdf/2501.08163) 405 | - (arXiv 2025.01) Surface Vision Mamba: Leveraging Bidirectional State Space Model for Efficient Spherical Manifold Representation, [[Paper]](https://arxiv.org/pdf/2501.14679),[[Code]](https://github.com/Rongzhao-He/surface-vision-mamba) 406 | - (arXiv 2025.02) Topology-Aware Wavelet Mamba for Airway Structure Segmentation in Postoperative Recurrent Nasopharyngeal Carcinoma CT Scans, [[Paper]](https://arxiv.org/pdf/2502.14363) 407 | - (arXiv 2025.02) CardiacMamba: A Multimodal RGB-RF Fusion Framework with State Space Models for Remote Physiological Measurement, [[Paper]](https://arxiv.org/pdf/2502.13624),[[Code]](https://github.com/WuZheng42/CardiacMamba) 408 | - (arXiv 2025.02) MobileViM: A Light-weight and Dimension-independent Vision Mamba for 3D Medical Image Analysis, [[Paper]](https://arxiv.org/pdf/2502.13524),[[Code]](https://github.com/anthonyweidai/MobileViM_3D/) 409 | - (arXiv 2025.02) A Reverse Mamba Attention Network for Pathological Liver Segmentation, [[Paper]](https://arxiv.org/pdf/2502.18232),[[Code]](https://github.com/JunZengz/RMAMamba) 410 | - (arXiv 2025.02) EndoMamba: An Efficient Foundation Model for Endoscopic Videos via Hierarchical Pre-training, [[Paper]](https://arxiv.org/pdf/2502.19090) 411 | - (arXiv 2025.03) From Claims to Evidence: A Unified Framework and Critical Analysis of CNN vs. Transformer vs. Mamba in Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2503.01306),[[Code]](https://github.com/QLYCode/SparseMamba-PCL) 412 | - (arXiv 2025.03) SparseMamba-PCL: Scribble-Supervised Medical Image Segmentation via SAM-Guided Progressive Collaborative Learning, [[Paper]](https://arxiv.org/pdf/2503.01633),[[Code]](https://github.com/AI-in-Cardiovascular-Medicine/nnUZoo) 413 | - (arXiv 2025.03) COMMA: Coordinate-aware Modulated Mamba Network for 3D Dispersed Vessel Segmentation, [[Paper]](https://arxiv.org/pdf/2503.02332),[[Code]](https://github.com/shigen-StoneRoot/COMMA) 414 | - (arXiv 2025.03) XFMamba: Cross-Fusion Mamba for Multi-View Medical Image Classification, [[Paper]](https://arxiv.org/pdf/2503.02619),[[Code]](https://github.com/XZheng0427/XFMamba) 415 | - (arXiv 2025.03) SegResMamba: An Efficient Architecture for 3D Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2503.07766) 416 | - (arXiv 2025.03) Multi-Modal Mamba Modeling for Survival Prediction (M4Survive): Adapting Joint Foundation Model Representations, [[Paper]](https://arxiv.org/pdf/2503.10057),[[Code]](https://github.com/microsoft/healthcareai-examples) 417 | - (arXiv 2025.03) MM-UNet: Meta Mamba UNet for Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2503.17540) 418 | - (arXiv 2025.03) Cross-Modal Interactive Perception Network with Mamba for Lung Tumor Segmentation in PET-CT Images, [[Paper]](https://arxiv.org/pdf/2503.17261),[[Code]](https://github.com/mj129/CIPA) 419 | - (arXiv 2025.03) Mamba-3D as Masked Autoencoders for Accurate and Data-Efficient Analysis of Medical Ultrasound Videos, [[Paper]](https://arxiv.org/pdf/2503.20258),[[Code]](https://github.com/HenryZhou19/E-ViM3) 420 | - (arXiv 2025.03) A Comprehensive Analysis of Mamba for 3D Volumetric Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2503.19308) 421 | - (arXiv 2025.03) Prompt-Guided Dual-Path UNet with Mamba for Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2503.19589v1) 422 | - (arXiv 2025.03) ASP-VMUNet: Atrous Shifted Parallel Vision Mamba U-Net for Skin Lesion Segmentation, [[Paper]](https://arxiv.org/pdf/2503.19427),[[Code]](https://github.com/BaoBao0926/ASP-VMUNet) 423 | - (arXiv 2025.04) Hierarchical Feature Learning for Medical Point Clouds via State Space Model, [[Paper]](https://arxiv.org/pdf/2504.13015),[[Code]](https://github.com/wlsdzyzl/flemme) 424 | - (arXiv 2025.04) Mamba-Sea: A Mamba-based Framework with Global-to-Local Sequence Augmentation for Generalizable Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2504.17515),[[Code]](https://github.com/orange-czh/Mamba-Sea) 425 | - (arXiv 2025.04) Mamba Based Feature Extraction And Adaptive Multilevel Feature Fusion For 3D Tumor Segmentation From Multi-modal Medical Image, [[Paper]](https://arxiv.org/pdf/2504.21281) 426 | - (arXiv 2025.05) Topo-VM-UNetV2: Encoding Topology into Vision Mamba UNet for Polyp Segmentation, [[Paper]](https://arxiv.org/pdf/2505.06210) 427 | - (arXiv 2025.05) GaMNet: A Hybrid Network with Gabor Fusion and NMamba for Efficient 3D Glioma Segmentation, [[Paper]](https://arxiv.org/pdf/2505.05520) 428 | - (arXiv 2025.05) MambaControl: Anatomy Graph-Enhanced Mamba ControlNet with Fourier Refinement for Diffusion-Based Disease Trajectory Prediction, [[Paper]](https://arxiv.org/pdf/2505.09965) 429 | - (arXiv 2025.06) DM-SegNet: Dual-Mamba Architecture for 3D Medical Image Segmentation with Global Context Modeling, [[Paper]](https://arxiv.org/pdf/2506.05297) 430 | - (arXiv 2025.06) FMaMIL: Frequency-Driven Mamba Multi-Instance Learning for Weakly Supervised Lesion Segmentation in Medical Images, [[Paper]](https://arxiv.org/pdf/2506.07652) 431 | - (arXiv 2025.06) Hybrid Vision Transformer-Mamba Framework for Autism Diagnosis via Eye-Tracking Analysis, [[Paper]](https://arxiv.org/pdf/2506.06886) 432 | - (arXiv 2025.06) FAMSeg: Fetal Femur and Cranial Ultrasound Segmentation Using Feature-Aware Attention and Mamba Enhancement, [[Paper]](https://arxiv.org/pdf/2506.07431) 433 | - (arXiv 2025.06) InceptionMamba: Efficient Multi-Stage Feature Enhancement with Selective State Space Model for Microscopic Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2506.12208) 434 | - (arXiv 2025.06) MS-UMamba: An Improved Vision Mamba Unet for Fetal Abdominal Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2506.12441) 435 | - (arXiv 2025.06) Unleashing Diffusion and State Space Models for Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2506.12747),[[Code]](https://github.com/Rows21/KMax-Mamba) 436 | - (arXiv 2025.06) MARL-MambaContour: Unleashing Multi-Agent Deep Reinforcement Learning for Active Contour Optimization in Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2506.18679) 437 | - (arXiv 2025.06) Holistic Surgical Phase Recognition with Hierarchical Input Dependent State Space Models, [[Paper]](https://arxiv.org/pdf/2506.21330) 438 | - (arXiv 2025.07) Unified Medical Image Segmentation with State Space Modeling Snake, [[Paper]](https://arxiv.org/pdf/2507.12760) 439 | - (arXiv 2025.07) DeSamba: Decoupled Spectral Adaptive Framework for 3D Multi-Sequence MRI Lesion Classification, [[Paper]](https://arxiv.org/pdf/2507.15487) 440 | - (arXiv 2025.07) Mamba-OTR: a Mamba-based Solution for Online Take and Release Detection from Untrimmed Egocentric Video, [[Paper]](https://arxiv.org/pdf/2507.16342) 441 | - (arXiv 2025.07) Differential-UMamba: Rethinking Tumor Segmentation Under Limited Data Scenarios, [[Paper]](https://arxiv.org/pdf/2507.18177) 442 | - (arXiv 2025.07) MCM: Mamba-based Cardiac Motion Tracking using Sequential Images in MRI, [[Paper]](https://arxiv.org/pdf/2507.17678),[[Code]](https://github.com/yjh-0104/MCM) 443 | - (arXiv 2025.07) Mammo-Mamba: A Hybrid State-Space and Transformer Architecture with Sequential Mixture of Experts for Multi-View Mammography, [[Paper]](https://arxiv.org/pdf/2507.17662) 444 | - (arXiv 2025.07) SP-Mamba: Spatial-Perception State Space Model for Unsupervised Medical Anomaly Detection, [[Paper]](https://arxiv.org/pdf/2507.19076),[[Code]](https://github.com/Ray-RuiPan/SP-Mamba) 445 | - (arXiv 2025.07) FaRMamba: Frequency-based learning and Reconstruction aided Mamba for Medical Segmentation, [[Paper]](https://arxiv.org/pdf/2507.20056) 446 | - (arXiv 2025.07) MambaVesselNet++: A Hybrid CNN-Mamba Architecture for Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2507.19931),[[Code]](https://github.com/CC0117/MambaVesselNet) 447 | - (arXiv 2025.08) Improving Spatial Transcriptomics Prediction with Hybrid State Space-Vision Transformer Backbone in Pathology Vision Foundation Models, [[Paper]](https://arxiv.org/pdf/2508.00383),[[Code]](https://github.com/deepnoid-ai/MVHybrid) 448 | - (arXiv 2025.08) SSFMamba: Symmetry-driven Spatial-Frequency Feature Fusion for 3D Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2508.03069) 449 | - (arXiv 2025.08) AMD-Mamba: A Phenotype-Aware Multi-Modal Framework for Robust AMD Prognosis, [[Paper]](https://arxiv.org/pdf/2508.02957) 450 | - (arXiv 2025.08) ClinicalFMamba: Advancing Clinical Assessment using Mamba-based Multimodal Neuroimaging Fusion, [[Paper]](https://arxiv.org/pdf/2508.03008) 451 | - (arXiv 2025.08) Text Embedded Swin-UMamba for DeepLesion Segmentation, [[Paper]](https://arxiv.org/pdf/2508.06453),[[Code]](https://github.com/ruida/LLM-Swin-UMamba) 452 | - (arXiv 2025.08) HiFi-Mamba: Dual-Stream W-Laplacian Enhanced Mamba for High-Fidelity MRI Reconstruction, [[Paper]](https://arxiv.org/pdf/2508.09179) 453 | - (arXiv 2025.08) UltraLight Med-Vision Mamba for Classification of Neoplastic Progression in Tubular Adenomas, [[Paper]](https://arxiv.org/pdf/2508.09339) 454 | - (arXiv 2025.08) Diversity-enhanced Collaborative Mamba for Semi-supervised Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2508.13712),[[Code]](https://github.com/ShumengLI/DCMamba) 455 | - (arXiv 2025.08) SRMA-Mamba: Spatial Reverse Mamba Attention Network for Pathological Liver Segmentation in MRI Volumes, [[Paper]](https://arxiv.org/pdf/2508.12410),[[Code]](https://github.com/JunZengz/SRMA-Mamba) 456 | - (arXiv 2025.09) SFD-Mamba2Net: Strcture-Guided Frequency-Enhanced Dual-Stream Mamba2 Network for Coronary Artery Segmentation, [[Paper]](https://arxiv.org/pdf/2509.08934) 457 | - (arXiv 2025.09) SemaMIL: Semantic Reordering with Retrieval-Guided State Space Modeling for Whole Slide Image Classification, [[Paper]](https://arxiv.org/pdf/2509.00442) 458 | - (arXiv 2025.09) SpectMamba: Integrating Frequency and State Space Models for Enhanced Medical Image Detection, [[Paper]](https://arxiv.org/pdf/2509.01080) 459 | - (arXiv 2025.09) Joint-octamamba:an octa joint segmentation network based on feature enhanced mamba, [[Paper]](https://arxiv.org/pdf/2509.11649),[[Code]](https://github.com/lc-sfis/Joint-OCTAMamba) 460 | - (arXiv 2025.09) U-Mamba2: Scaling State Space Models for Dental Anatomy Segmentation in CBCT, [[Paper]](https://arxiv.org/pdf/2509.12069),[[Code]](https://github.com/zhiqin1998/UMamba2) 461 | - (arXiv 2025.09) CECT-Mamba: a Hierarchical Contrast-enhanced-aware Model for Pancreatic Tumor Subtyping from Multi-phase CECT, [[Paper]](https://arxiv.org/pdf/2509.12777) 462 | - (arXiv 2025.09) HybridMamba: A Dual-domain Mamba for 3D Medical Image Segmentation, [[Paper]](https://arxiv.org/pdf/2509.14609) 463 | - (arXiv 2025.09) Surgical-MambaLLM: Mamba2-enhanced Multimodal Large Language Model for VQLA in Robotic Surgery, [[Paper]](https://arxiv.org/pdf/2509.16618) 464 | - (arXiv 2025.09) ME-Mamba: Multi-Expert Mamba with Efficient Knowledge Capture and Fusion for Multimodal Survival Analysis, [[Paper]](https://arxiv.org/pdf/2509.16900) 465 | - (arXiv 2025.09) U-Mamba2-SSL for Semi-Supervised Tooth and Pulp Segmentation in CBCT, [[Paper]](https://arxiv.org/pdf/2509.20154),[[Code]](https://github.com/zhiqin1998/UMamba2) 466 | - (arXiv 2025.09) SlideMamba: Entropy-Based Adaptive Fusion of GNN and Mamba for Enhanced Representation Learning in Digital Pathology, [[Paper]](https://arxiv.org/pdf/2509.21239) 467 | - (arXiv 2025.09) MSD-KMamba: Bidirectional Spatial-Aware Multi-Modal 3D Brain Segmentation via Multi-scale Self-Distilled Fusion Strategy, [[Paper]](https://arxiv.org/pdf/2509.23677),[[Code]](https://github.com/daimao-zhang/MSD-KMamba) 468 | - (arXiv 2025.10) Cortical-SSM: A Deep State Space Model for EEG and ECoG Motor Imagery Decoding, [[Paper]](https://arxiv.org/pdf/2510.15371),[[Code]](https://cortical-ssm-u90sg.kinsta.page/) 469 | - (arXiv 2025.10) t-Mamba3D: A Time-Aware Spatio-Temporal State-Space Model for Breast Cancer Risk Prediction, [[Paper]](https://arxiv.org/pdf/2510.19003) 470 | - (arXiv 2025.10) EMRRG: Efficient Fine-Tuning Pre-trained X-ray Mamba Networks for Radiology Report Generation, [[Paper]](https://arxiv.org/pdf/2510.16776),[[Code]](https://github.com/Event-AHU/Medical_Image_Analysis) 471 | - (arXiv 2025.10) MambaX-Net: Dual-Input Mamba-Enhanced Cross-Attention Network for Longitudinal MRI Segmentation, [[Paper]](https://arxiv.org/pdf/2510.17529) 472 | - (arXiv 2025.10) CausalMamba: Scalable Conditional State Space Models for Neural Causal Inference, [[Paper]](https://arxiv.org/pdf/2510.17318) 473 | 474 | ### Mesh 475 | - (arXiv 2024.05) HandSSCA: 3D Hand Mesh Reconstruction with State Space Channel Attention from RGB images,[[Paper]](https://arxiv.org/pdf/2405.01066.pdf) 476 | - (arXiv 2025.04) Mesh Mamba: A Unified State Space Model for Saliency Prediction in Non-Textured and Textured Meshes,[[Paper]](https://arxiv.org/pdf/2504.01466.pdf),[[Code]](https://github.com/kaviezhang/MeshMamba) 477 | - (arXiv 2025.04) VM-BHINet:Vision Mamba Bimanual Hand Interaction Network for 3D Interacting Hand Mesh Recovery From a Single RGB Image, [[Paper]](https://arxiv.org/pdf/2504.14618.pdf) 478 | - (arXiv 2025.07) MeshMamba: State Space Models for Articulated 3D Mesh Generation and Reconstruction, [[Paper]](https://arxiv.org/pdf/2507.15212.pdf) 479 | 480 | ### MIL 481 | - (arXiv 2024.08) Mamba2MIL: State Space Duality Based Multiple Instance Learning for Computational Pathology,[[Paper]](https://arxiv.org/pdf/2408.15032.pdf), [[Code]](https://github.com/YuqiZhang-Buaa/Mamba2MIL) 482 | 483 | ### Mixture of Experts 484 | - (arXiv 2024.01) MoE-Mamba: Efficient Selective State Space Models with Mixture of Experts, [[Paper]](https://arxiv.org/pdf/2401.04081.pdf) 485 | - (arXiv 2024.01) BlackMamba: Mixture of Experts for State-Space Models, [[Paper]](https://arxiv.org/pdf/2402.01771.pdf), [[Code]](https://github.com/Zyphra/BlackMamba) 486 | - (arXiv 2025.01) Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity, [[Paper]](https://arxiv.org/pdf/2501.16295.pdf), [[Code]](https://github.com/Weixin-Liang/Mixture-of-Mamba) 487 | 488 | ### Motion 489 | - (arXiv 2024.03) Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM, [[Paper]](https://arxiv.org/pdf/2403.07487.pdf), [[Code]](https://steve-zeyu-zhang.github.io/MotionMamba) 490 | - (arXiv 2024.04) Text-controlled Motion Mamba: Text-Instructed Temporal Grounding of Human Motion, [[Paper]](https://arxiv.org/pdf/2404.11375.pdf) 491 | - (arXiv 2024.04) HumMUSS: Human Motion Understanding using State Space Models, [[Paper]](https://arxiv.org/pdf/2404.10880.pdf) 492 | - (arXiv 2024.04) MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model, [[Paper]](https://arxiv.org/pdf/2404.12794.pdf), [[Code]](https://github.com/Terminal-K/MambaMOS) 493 | - (arXiv 2024.05) SMCD: High Realism Motion Style Transfer via Mamba-based Diffusion, [[Paper]](https://arxiv.org/pdf/2405.02844.pdf) 494 | - (arXiv 2024.07) InfiniMotion: Mamba Boosts Memory in Transformer for Arbitrary Long Motion Generation, [[Paper]](https://arxiv.org/pdf/2407.10061.pdf), [[Code]](https://steve-zeyu-zhang.github.io/InfiniMotion/) 495 | - (arXiv 2024.08) Pedestrian Motion Prediction Using Transformer-based Behavior Clustering and Data-Driven Reachability Analysis, [[Paper]](https://arxiv.org/pdf/2407.10061.pdf) 496 | - (arXiv 2024.11) KMM: Key Frame Mask Mamba for Extended Motion Generation, [[Paper]](https://arxiv.org/pdf/2411.06481.pdf), [[Code]](https://github.com/steve-zeyu-zhang/KMM) 497 | - (arXiv 2025.03) HiSTF Mamba: Hierarchical Spatiotemporal Fusion with Multi-Granular Body-Spatial Modeling for High-Fidelity Text-to-Motion Generation, [[Paper]](https://arxiv.org/pdf/2503.06897.pdf) 498 | - (arXiv 2025.05) Dyadic Mamba: Long-term Dyadic Human Motion Synthesis, [[Paper]](https://arxiv.org/pdf/2505.09827.pdf) 499 | - (arXiv 2025.06) InterMamba: Efficient Human-Human Interaction Generation with Adaptive Spatio-Temporal Mamba, [[Paper]](https://arxiv.org/pdf/2506.03084.pdf) 500 | - (arXiv 2025.08) EgoMusic-driven Human Dance Motion Estimation with Skeleton Mamba, [[Paper]](https://arxiv.org/pdf/2508.10522.pdf), [[Code]](https://zquang2202.github.io/SkeletonMamba/) 501 | - (arXiv 2025.09) Can SSD-Mamba2 Unlock Reinforcement Learning for End-to-End Motion Control, [[Paper]](https://arxiv.org/pdf/2509.07593.pdf) 502 | 503 | ### Multimodal 504 | - (arXiv 2024.03) VL-Mamba: Exploring State Space Models for Multimodal Learning,[[Paper]](https://arxiv.org/pdf/2403.13600.pdf),[[Code]](https://github.com/ZhengYu518/VL-Mamba) 505 | - (arXiv 2024.09) DepMamba: Progressive Fusion Mamba for Multimodal Depression Detection,[[Paper]](https://arxiv.org/pdf/2409.15936.pdf),[[Code]](https://github.com/Jiaxin-Ye/DepMamba) 506 | - (arXiv 2024.12) AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment,[[Paper]](https://arxiv.org/pdf/2412.00833.pdf) 507 | - (arXiv 2025.01) AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation,[[Paper]](https://arxiv.org/pdf/2501.07810.pdf),[Code](https://github.com/SitongGong/AVS-Mamba) 508 | - (arXiv 2025.03) OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models,[[Paper]](https://arxiv.org/pdf/2503.08686.pdf),[Code](https://github.com/hustvl/OmniMamba) 509 | - (arXiv 2025.06) Cross-modal State Space Modeling for Real-time RGB-thermal Wild Scene Semantic Segmentation,[[Paper]](https://arxiv.org/pdf/2506.17869.pdf),[Code](https://github.com/xiaodonguo/CMSSM) 510 | 511 | ### Multi-Task 512 | - (arXiv 2024.07) MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders,[[Paper]](https://arxiv.org/pdf/2407.02228.pdf),[[Code]](https://github.com/EnVision-Research/MTMamba) 513 | - (arXiv 2024.08) MTMamba++: Enhancing Multi-Task Dense Scene Understanding via Mamba-Based Decoders,[[Paper]](https://arxiv.org/pdf/2408.15101.pdf),[[Code]](https://github.com/EnVision-Research/MTMamba) 514 | 515 | ### OCR 516 | - (arXiv 2024.01) LOCOST: State-Space Models for Long Document Abstractive Summarization, [[Paper]](https://arxiv.org/pdf/2401.17919.pdf),[[Code]](https://github.com/flbbb/locost-summarization) 517 | - (arXiv 2024.10) Adaptive Multi Scale Document Binarisation Using Vision Mamba, [[Paper]](https://arxiv.org/pdf/2410.22811.pdf) 518 | 519 | ### OOD 520 | - (arXiv 2024.05) CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation,[[Paper]](https://arxiv.org/pdf/2404.19394.pdf),[[Code]](https://github.com/raytrun/mamba-clip) 521 | 522 | ### Point Cloud 523 | - (arXiv 2024.02) PointMamba: A Simple State Space Model for Point Cloud Analysis, [[Paper]](https://arxiv.org/pdf/2402.10739.pdf),[[Code]](https://github.com/LMD0311/PointMamba) 524 | - (arXiv 2024.02) Point Could Mamba: Point Cloud Learning via State Space Model, [[Paper]](https://arxiv.org/pdf/2403.00762.pdf),[[Code]](https://github.com/zhang-tao-whu/PCM) 525 | - (arXiv 2024.03) Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy, [[Paper]](https://arxiv.org/pdf/2403.06467.pdf),[[Code]](https://github.com/IRMVLab/Point-Mamba) 526 | - (arXiv 2024.04) 3DMambaComplete: Exploring Structured State Space Model for Point Cloud Completion, [[Paper]](https://arxiv.org/pdf/2404.07106.pdf) 527 | - (arXiv 2024.04) Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model, [[Paper]](https://arxiv.org/pdf/2404.14966.pdf) 528 | - (arXiv 2024.05) PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis, [[Paper]](https://arxiv.org/pdf/2405.15463.pdf),[[Code]](https://github.com/xiaoyao3302/PoinTramba) 529 | - (arXiv 2024.06) PointABM:Integrating Bidirectional State Space Model with Multi-Head Self-Attention for Point Cloud Analysis, [[Paper]](https://arxiv.org/pdf/2406.06069) 530 | - (arXiv 2024.06) Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection, [[Paper]](https://arxiv.org/pdf/2406.06069),[[Code]](https://github.com/gwenzhang/Voxel-Mamba) 531 | - (arXiv 2024.06) Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model, [[Paper]](https://arxiv.org/pdf/2406.17442) 532 | - (arXiv 2024.07) Serialized Point Mamba: A Serialized Point Cloud Mamba Segmentation Model, [[Paper]](https://arxiv.org/pdf/2407.12319) 533 | - (arXiv 2024.07) PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model, [[Paper]](https://arxiv.org/pdf/2408.13574) 534 | - (arXiv 2024.08) MambaPlace:Text-to-Point-Cloud Cross-Modal Place Recognition with Attention Mamba Mechanisms, [[Paper]](https://arxiv.org/pdf/2408.15740),[[Code]](https://github.com/nuozimiaowu/MambaPlace/tree/main) 535 | - (arXiv 2024.10) MBPU: A Plug-and-Play State Space Model for Point Cloud Upsamping with Fast Point Rendering, [[Paper]](https://arxiv.org/pdf/2410.15941) 536 | - (arXiv 2024.11) NIMBA: Towards Robust and Principled Processing of Point Clouds With SSMs, [[Paper]](https://arxiv.org/pdf/2411.00151) 537 | - (arXiv 2024.11) STREAM: A Universal State-Space Model for Sparse Geometric Data, [[Paper]](https://arxiv.org/pdf/2411.12603) 538 | - (arXiv 2025.04) Efficient Spiking Point Mamba for Point Cloud Analysis, [[Paper]](https://arxiv.org/pdf/2504.14371) 539 | - (arXiv 2025.04) WeatherGen: A Unified Diverse Weather Generator for LiDAR Point Clouds via Spider Mamba Diffusion, [[Paper]](https://arxiv.org/pdf/2504.13561),[[Code]](https://github.com/wuyang98/weathergen) 540 | - (arXiv 2025.05) PillarMamba: Learning Local-Global Context for Roadside Point Cloud via Hybrid State Space Model, [[Paper]](https://arxiv.org/pdf/2505.05397) 541 | - (arXiv 2025.06) Polar Hierarchical Mamba: Towards Streaming LiDAR Object Detection with Point Clouds as Egocentric Sequences, [[Paper]](https://arxiv.org/pdf/2506.06944),[[Code]](https://github.com/meilongzhang/Polar-Hierarchical-Mamba) 542 | - (arXiv 2025.06) StruMamba3D: Exploring Structural Mamba for Self-supervised Point Cloud Representation Learning, [[Paper]](https://arxiv.org/pdf/2506.21541) 543 | - (arXiv 2025.07) SRMambaV2: Biomimetic Attention for Sparse Point Cloud Upsampling in Autonomous Driving, [[Paper]](https://arxiv.org/pdf/2507.17479) 544 | - (arXiv 2025.07) PointLAMA: Latent Attention meets Mamba for Efficient Point Cloud Pretraining, [[Paper]](https://arxiv.org/pdf/2507.17296) 545 | - (arXiv 2025.07) HydraMamba: Multi-Head State Space Model for Global Point Cloud Learning, [[Paper]](https://arxiv.org/pdf/2507.19778),[[Code]](https://github.com/Point-Cloud-Learning/HydraMamba) 546 | - (arXiv 2025.08) UST-SSM: Unified Spatio-Temporal State Space Models for Point Cloud Video Modeling, [[Paper]](https://arxiv.org/pdf/2508.14604),[[Code]](https://github.com/wangzy01/UST-SSM) 547 | 548 | ### Pose 549 | - (arXiv 2024.08) Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network, [[Paper]](https://arxiv.org/pdf/2408.02922) 550 | - (arXiv 2024.08) PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space Model, [[Paper]](https://arxiv.org/pdf/2408.03540) 551 | - (arXiv 2025.04) EMO-X: Efficient Multi-Person Pose and Shape Estimation in One-Stage, [[Paper]](https://arxiv.org/pdf/2504.08718) 552 | - (arXiv 2025.07) A Structure-aware and Motion-adaptive Framework for 3D Human Pose Estimation with Mamba, [[Paper]](https://arxiv.org/pdf/2507.19852) 553 | - (arXiv 2025.09) MV-SSM: Multi-View State Space Modeling for 3D Human Pose Estimation, [[Paper]](https://arxiv.org/pdf/2509.00649),[[Code]](https://aviralchharia.github.io/MV-SSM/) 554 | 555 | 556 | ### Pruning 557 | - (arXiv 2025.05) Efficient Unstructured Pruning of Mamba State-Space Models for Resource-Constrained Environments, [[Paper]](https://arxiv.org/pdf/2505.08299) 558 | 559 | ### Quantization 560 | - (arXiv 2025.01) PTQ4VM: Post-Training Quantization for Visual Mamba, [[Paper]](https://arxiv.org/pdf/2412.20386),[[Code]](https://github.com/YoungHyun197/ptq4vm) 561 | - (arXiv 2025.01) QMamba: Post-Training Quantization for Vision State Space Models, [[Paper]](https://arxiv.org/pdf/2501.13624) 562 | - (arXiv 2025.03) ViM-VQ: Efficient Post-Training Vector Quantization for Visual Mamba, [[Paper]](https://arxiv.org/pdf/2503.09509) 563 | - (arXiv 2025.03) OuroMamba: A Data-Free Quantization Framework for Vision Mamba Models, [[Paper]](https://arxiv.org/pdf/2503.10959) 564 | 565 | ### Recognition 566 | - (arXiv 2024.05) MemoryMamba: Memory-Augmented State Space Model for Defect Recognition, [[Paper]](https://arxiv.org/pdf/2405.03673.pdf) 567 | - (arXiv 2024.05) OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition, [[Paper]](https://arxiv.org/pdf/2405.03673.pdf) 568 | - (arXiv 2024.07) An Empirical Study of Mamba-based Pedestrian Attribute Recognition, [[Paper]](https://arxiv.org/pdf/2407.10374.pdf),[[Code]](https://github.com/Event-AHU/OpenPAR) 569 | 570 | ### Reconstruction 571 | - (arXiv 2024.03) Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction, [[Paper]](https://arxiv.org/pdf/2403.18795.pdf), [[Code]](https://github.com/SkyworkAI/Gamba) 572 | - (arXiv 2024.05) GMSR:Gradient-Guided Mamba for Spectral Reconstruction from RGB Images, [[Paper]](https://arxiv.org/pdf/2405.07777.pdf),[[Code]](https://github.com/wxy11-27/GMSR) 573 | - (arXiv 2024.11) M3D: Dual-Stream Selective State Spaces and Depth-Driven Framework for High-Fidelity Single-View 3D Reconstruction, [[Paper]](https://arxiv.org/pdf/2411.12635),[[Code]](https://github.com/AnnnnnieZhang/M3D) 574 | 575 | ### Referring 576 | - (arXiv 2024.03) ReMamber: Referring Image Segmentation with Mamba Twister, [[Paper]](https://arxiv.org/pdf/2403.17839.pdf) 577 | - (arXiv 2024.10) MambaPainter: Neural Stroke-Based Rendering in a Single Step, [[Paper]](https://arxiv.org/pdf/2410.12524.pdf),[[Code]](https://github.com/STomoya/MambaPainter) 578 | 579 | ### Registration 580 | - (arXiv 2024.04) VMambaMorph: a Visual Mamba-based Framework with Cross-Scan Module for Deformable 3D Image Registration, [[Paper]](https://arxiv.org/pdf/2404.05105.pdf),[[Code]](https://github.com/ziyangwang007/VMambaMorph) 581 | - (arXiv 2024.07) Mamba? Catch The Hype Or Rethink What Really Helps for Image Registration, [[Paper]](https://arxiv.org/pdf/2407.19274.pdf),[[Code]](https://github.com/BailiangJ/rethink-reg) 582 | - (arXiv 2024.11) MambaReg: Mamba-Based Disentangled Convolutional Sparse Coding for Unsupervised Deformable Multi-Modal Image Registration, [[Paper]](https://arxiv.org/pdf/2411.01399.pdf) 583 | - (arXiv 2024.11) XPoint: A Self-Supervised Visual-State-Space based Architecture for Multispectral Image Registration, [[Paper]](https://arxiv.org/pdf/2411.07430.pdf),[[Code]](https://github.com/canyagmur/XPoint) 584 | - (arXiv 2025.06) MT-PCR: A Hybrid Mamba-Transformer with Spatial Serialization for Hierarchical Point Cloud Registration, [[Paper]](https://arxiv.org/pdf/2506.13183.pdf) 585 | 586 | ### Re-Identification 587 | - (arXiv 2024.12) MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt, [[Paper]](https://arxiv.org/pdf/2412.10707.pdf),[[Code]](https://github.com/924973292/MambaPro) 588 | 589 | ### Remote Sensing 590 | - (arXiv 2024.03) RSMamba: Remote Sensing Image Classification with State Space Model, [[Paper]](https://arxiv.org/pdf/2403.19654.pdf),[[Code]](https://github.com/KyanChen/RSMamba) 591 | - (arXiv 2024.04) RS-Mamba for Large Remote Sensing Image Dense Prediction, [[Paper]](https://arxiv.org/pdf/2404.02668.pdf),[[Code]](https://github.com/walking-shadow/Official_Remote_Sensing_Mamba) 592 | - (arXiv 2024.04) RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation, [[Paper]](https://arxiv.org/pdf/2404.02457.pdf),[[Code]](https://github.com/sstary/SSRS) 593 | - (arXiv 2024.04) Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model, [[Paper]](https://arxiv.org/pdf/2404.01705.pdf),[[Code]](https://github.com/zhuqinfeng1999/Samba) 594 | - (arXiv 2024.04) ChangeMamba: Remote Sensing Change Detection with Spatio-Temporal State Space Model, [[Paper]](https://arxiv.org/pdf/2404.03425.pdf),[[Code]](https://github.com/ChenHongruixuan/MambaCD) 595 | - (arXiv 2024.05) RSCaMa: Remote Sensing Image Change Captioning with State Space Model, [[Paper]](https://arxiv.org/pdf/2404.18895.pdf),[[Code]](https://github.com/Chen-Yang-Liu/RSCaMa) 596 | - (arXiv 2024.05) Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution, [[Paper]](https://arxiv.org/pdf/2405.04964.pdf) 597 | - (arXiv 2024.05) Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study, [[Paper]](https://arxiv.org/pdf/2405.08493.pdf) 598 | - (arXiv 2024.05) CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation, [[Paper]](https://arxiv.org/pdf/2405.10530.pdf),[[Code]](https://github.com/XiaoBuL/CM-UNet) 599 | - (arXiv 2024.06) CDMamba: Remote Sensing Image Change Detection with Mamba, [[Paper]](https://arxiv.org/pdf/2406.04207.pdf),[[Code]](https://github.com/zmoka-zht/CDMamba) 600 | - (arXiv 2024.06) HDMba: Hyperspectral Remote Sensing Imagery Dehazing with State Space Model, [[Paper]](https://arxiv.org/pdf/2406.05700),[[Code]](https://github.com/RsAI-lab/HDMba) 601 | - (arXiv 2024.06) PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery, [[Paper]](https://arxiv.org/pdf/2406.10828),[[Code]](https://github.com/WangLibo1995/GeoSeg) 602 | - (arXiv 2024.07) A Mamba-based Siamese Network for Remote Sensing Change Detection, [[Paper]](https://arxiv.org/pdf/2407.06839),[[Code]](https://github.com/JayParanjape/M-CD) 603 | - (arXiv 2024.07) DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing, [[Paper]](https://arxiv.org/pdf/2407.08132),[[Code]](https://github.com/Another-0/DMM) 604 | - (arXiv 2024.08) UNetMamba: Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images, [[Paper]](https://arxiv.org/pdf/2408.11545),[[Code]](https://github.com/EnzeZhu2001/UNetMamba) 605 | - (arXiv 2024.09) UV-Mamba: A DCN-Enhanced State Space Model for Urban Village Boundary Identification in High-Resolution Remote Sensing Images, [[Paper]](https://arxiv.org/pdf/2409.03431) 606 | - (arXiv 2024.09) PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation, [[Paper]](https://arxiv.org/pdf/2409.06309) 607 | - (arXiv 2024.09) SITSMamba for Crop Classification based on Satellite Image Time Series, [[Paper]](https://arxiv.org/pdf/2409.09673),[[Code]](https://github.com/XiaoleiQinn/SITSMamba) 608 | - (arXiv 2024.10) RemoteDet-Mamba: A Hybrid Mamba-CNN Network for Multi-modal Object Detection in Remote Sensing Images, [[Paper]](https://arxiv.org/pdf/2410.13532) 609 | - (arXiv 2025.01) CD-Lamba: Boosting Remote Sensing Change Detection via a Cross-Temporal Locally Adaptive State Space Model, [[Paper]](https://arxiv.org/pdf/2501.15455),[[Code]](https://github.com/xwmaxwma/rschange) 610 | - (arXiv 2025.02) SatMamba: Development of Foundation Models for Remote Sensing Imagery Using State Space Models, [[Paper]](https://arxiv.org/pdf/2502.00435),[[Code]](https://github.com/mdchuc/HRSFM) 611 | - (arXiv 2025.03) DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding, [[Paper]](https://arxiv.org/pdf/2503.16426),[[Code]](https://github.com/KyanChen/DynamicVis) 612 | - (arXiv 2025.03) 2DMCG:2D Mamba with Change Flow Guidance for Change Detection in Remote Sensing, [[Paper]](https://arxiv.org/pdf/2503.00521) 613 | - (arXiv 2025.03) M3amba: CLIP-driven Mamba Model for Multi-modal Remote Sensing Classification, [[Paper]](https://arxiv.org/pdf/2503.06446),[[Code]](https://github.com/kaka-Cao/M3amba) 614 | - (arXiv 2025.03) RoMA: Scaling up Mamba-based Foundation Models for Remote Sensing, [[Paper]](https://arxiv.org/pdf/2503.10392),[[Code]](https://github.com/MiliLab/RoMA) 615 | - (arXiv 2025.06) OSDMamba: Enhancing Oil Spill Detection from Remote Sensing Images Using Selective State Space Model, [[Paper]](https://arxiv.org/pdf/2506.18006),[[Code]](https://github.com/Chenshuaiyu1120/Oil-Spill-detection) 616 | - (arXiv 2025.07) AtrousMamaba: An Atrous-Window Scanning Visual State Space Model for Remote Sensing Change Detection, [[Paper]](https://arxiv.org/pdf/2507.16172) 617 | - (arXiv 2025.09) CSFMamba: Cross State Fusion Mamba Operator for Multimodal Remote Sensing Image Classification, [[Paper]](https://arxiv.org/pdf/2509.00677) 618 | - (arXiv 2025.09) DC-Mamba: Bi-temporal deformable alignment and scale-sparse enhancement for remote sensing change detection, [[Paper]](https://arxiv.org/pdf/2509.15563) 619 | - (arXiv 2025.09) SwinMamba: A hybrid local-global mamba framework for enhancing semantic segmentation of remotely sensed images, [[Paper]](https://arxiv.org/pdf/2509.20918) 620 | 621 | ### Restoration 622 | - (arXiv 2024.02) A Simple Baseline for Image Restoration with State-Space Model, [[Paper]](https://arxiv.org/pdf/2402.15648.pdf),[[Code]](https://github.com/csguoh/MambaIR) 623 | - (arXiv 2024.03) VmambaIR: Visual State Space Model for Image Restoration, [[Paper]](https://arxiv.org/pdf/2403.11423.pdf),[[Code]](https://github.com/AlphacatPlus/VmambaIR) 624 | - (arXiv 2024.03) Serpent: Scalable and Efficient Image Restoration via Multi-scale Structured State Space Models, [[Paper]](https://arxiv.org/pdf/2403.17902.pdf) 625 | - (arXiv 2024.08) Multi-Scale Representation Learning for Image Restoration with State-Space Model, [[Paper]](https://arxiv.org/pdf/2408.10145.pdf) 626 | - (arXiv 2024.12) Multi-dimensional Visual Prompt Enhanced Image Restoration via Mamba-Transformer Aggregation, [[Paper]](https://arxiv.org/pdf/2412.15845.pdf),[[Code]](https://github.com/12138-chr/MTAIR) 627 | - (arXiv 2025.01) Directing Mamba to Complex Textures: An Efficient Texture-Aware State Space Model for Image Restoration, [[Paper]](https://arxiv.org/pdf/2501.16583.pdf) 628 | - (arXiv 2025.01) MatIR: A Hybrid Mamba-Transformer Image Restoration Model, [[Paper]](https://arxiv.org/pdf/2501.18401.pdf),[[Code]](https://github.com/wenjuan7275/MatIR) 629 | - (arXiv 2025.03) DehazeMamba: SAR-guided Optical Remote Sensing Image Dehazing with Adaptive State Space Model, [[Paper]](https://arxiv.org/pdf/2503.13073.pdf),[[Code]](https://github.com/mmic-lcl/Datasets-and-benchmark-code) 630 | - (arXiv 2025.03) Q-MambaIR: Accurate Quantized Mamba for Efficient Image Restoration, [[Paper]](https://arxiv.org/pdf/2503.21970.pdf) 631 | - (arXiv 2025.04) DPMambaIR:All-in-One Image Restoration via Degradation-Aware Prompt State Space Model, [[Paper]](https://arxiv.org/pdf/2504.17732.pdf) 632 | - (arXiv 2025.06) ControlMambaIR: Conditional Controls with State-Space Model for Image Restoration, [[Paper]](https://arxiv.org/pdf/2506.02633.pdf) 633 | - (arXiv 2025.06) M2Restore: Mixture-of-Experts-based Mamba-CNN Fusion Framework for All-in-One Image Restoration, [[Paper]](https://arxiv.org/pdf/2506.07814.pdf) 634 | - (arXiv 2025.06) EAMamba: Efficient All-Around Vision State Space Model for Image Restoration, [[Paper]](https://arxiv.org/pdf/2506.22246.pdf),[[Code]](https://github.com/daidaijr/EAMamba) 635 | - (arXiv 2025.09) VAMamba: An Efficient Visual Adaptive Mamba for Image Restoration, [[Paper]](https://arxiv.org/pdf/2509.23601.pdf),[[Code]](https://github.com/WaterHQH/VAMamba) 636 | - (arXiv 2025.11) WaMaIR: Image Restoration via Multiscale Wavelet Convolutions and Mamba-based Channel Modeling with Texture Enhancement, [[Paper]](https://arxiv.org/pdf/2510.16765.pdf) 637 | 638 | ### Retrieval 639 | - (arXiv 2024.08) MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval, [[Paper]](https://arxiv.org/pdf/2408.10575.pdf),[[Code]](https://github.com/hrtang22/MUSE) 640 | - (arXiv 2025.06) MamFusion: Multi-Mamba with Temporal Fusion for Partially Relevant Video Retrieval, [[Paper]](https://arxiv.org/pdf/2506.03473.pdf),[[Code]](https://github.com/Vision-Multimodal-Lab-HZCU/MamFusion) 641 | - (arXiv 2025.06) MambaHash: Visual State Space Deep Hashing Model for Large-Scale Image Retrieval, [[Paper]](https://arxiv.org/pdf/2506.16353.pdf),[[Code]](https://github.com/shuaichaochao/MambaHash) 642 | 643 | ### Robot 644 | - (arXiv 2024.06) RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation, [[Paper]](https://arxiv.org/pdf/2406.04339.pdf),[[Code]](https://github.com/lmzpai/roboMamba) 645 | - (arXiv 2024.08) OMEGA: Efficient Occlusion-Aware Navigation for Air-Ground Robot in Dynamic Environments via State Space Model, [[Paper]](https://arxiv.org/pdf/2408.10618.pdf),[[Code]](https://github.com/jmwang0117/Occ-Mamba) 646 | - (arXiv 2024.09) GraspMamba: A Mamba-based Language-driven Grasp Detection Framework with Hierarchical Feature Learning, [[Paper]](https://arxiv.org/pdf/2409.14403) 647 | - (arXiv 2024.11) VMGNet: A Low Computational Complexity Robotic Grasping Network Based on VMamba with Multi-Scale Feature Fusion, [[Paper]](https://arxiv.org/pdf/2411.12520) 648 | - (arXiv 2025.09) LocoMamba: Vision-Driven Locomotion via End-to-End Deep Reinforcement Learning with Mamba, [[Paper]](https://arxiv.org/pdf/2508.11849),[[Code]](https://github.com/allen-quad-robot/locomamba) 649 | 650 | ### Salient 651 | - (arXiv 2024.10) MambaSOD: Dual Mamba-Driven Cross-Modal Fusion Network for RGB-D Salient Object Detection, [[Paper]](https://arxiv.org/pdf/2410.15015),[[Code]](https://github.com/YueZhan721/MambaSOD) 652 | - (arXiv 2024.11) LFSamba: Marry SAM with Mamba for Light Field Salient Object Detection, [[Paper]](https://arxiv.org/pdf/2411.06652),[[Code]](https://github.com/liuzywen/LFScribble) 653 | - (arXiv 2025.03) SSNet: Saliency Prior and State Space Model-based Network for Salient Object Detection in RGB-D Images [[Paper]](https://arxiv.org/pdf/2503.02270) 654 | - (arXiv 2025.09) LEAF-Mamba: Local Emphatic and Adaptive Fusion State Space Model for RGB-D Salient Object Detection, [[Paper]](https://arxiv.org/pdf/2509.18683) 655 | 656 | ### Self supervised learning 657 | - (arXiv 2024.08) MambaMIM: Pre-training Mamba with State Space Token-interpolation, [[Paper]](https://arxiv.org/pdf/2408.08070.pdf),[[Code]](https://github.com/FengheTan9/MambaMIM) 658 | 659 | ### Semantic Segmentation 660 | - (arXiv 2024.04) Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation, [[Paper]](https://arxiv.org/pdf/2404.04256.pdf),[[Code]](https://github.com/zifuwan/Sigma) 661 | - (arXiv 2024.06) Vision Mamba-based autonomous crack segmentation on concrete, asphalt, and masonry surfaces, [[Paper]](https://arxiv.org/pdf/2404.16518.pdf) 662 | - (arXiv 2024.07) Mamba meets crack segmentation, [[Paper]](https://arxiv.org/pdf/2407.15714.pdf) 663 | - (arXiv 2024.11) Deformable Mamba for Wide Field of View Segmentation, [[Paper]](https://arxiv.org/pdf/2411.16481.pdf),[[Code]](https://github.com/JieHu1996/DeformableMamba) 664 | - (arXiv 2024.12) SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation, [[Paper]](https://arxiv.org/pdf/2412.11890.pdf),[[Code]](https://github.com/yunxiangfu2001/SegMAN) 665 | - (arXiv 2025.03) SCSegamba: Lightweight Structure-Aware Vision Mamba for Crack Segmentation in Structures, [[Paper]](https://arxiv.org/pdf/2503.01113.pdf),[[Code]](https://github.com/Karl1109/SCSegamba) 666 | - (arXiv 2025.04) Mamba as a Bridge: Where Vision Foundation Models Meet Vision Language Models for Domain-Generalized Semantic Segmentation, [[Paper]](https://arxiv.org/pdf/2504.03193.pdf),[[Code]](https://github.com/devinxzhang/MFuser) 667 | - (arXiv 2025.06) ECMNet:Lightweight Semantic Segmentation with Efficient CNN-Mamba Network, [[Paper]](https://arxiv.org/pdf/2506.08629.pdf),[[Code]](https://github.com/devinxzhang/MFuser) 668 | - (arXiv 2025.07) HybridTM: Combining Transformer and Mamba for 3D Semantic Segmentation, [[Paper]](https://arxiv.org/pdf/2507.18575.pdf),[[Code]](https://github.com/deepinact/HybridTM) 669 | - (arXiv 2025.07) LIDAR: Lightweight Adaptive Cue-Aware Fusion Vision Mamba for Multimodal Segmentation of Structural Cracks, [[Paper]](https://arxiv.org/pdf/2507.22477.pdf),[[Code]](https://github.com/Karl1109/LIDAR-Mamba) 670 | 671 | ### Shadow 672 | - (arXiv 2024.11) ShadowMamba: State-Space Model with Boundary-Region Selective Scan for Shadow Removal, [[Paper]](https://arxiv.org/pdf/2411.03260.pdf) 673 | 674 | ### SLAM 675 | - (arXiv 2025.01) MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing, [[Paper]](https://arxiv.org/pdf/2412.20082.pdf) 676 | 677 | ### Spatiotemporal Forecasting 678 | - (arXiv 2024.03) VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting, [[Paper]](https://arxiv.org/pdf/2403.16536.pdf),[[Code]](https://github.com/yyyujintang/VMRNN-PyTorch) 679 | - (arXiv 2025.06) Damba-ST: Domain-Adaptive Mamba for Efficient Urban Spatio-Temporal Prediction, [[Paper]](https://arxiv.org/pdf/2506.18939.pdf) 680 | 681 | ### Speech 682 | - (arXiv 2024.10) CleanUMamba: A Compact Mamba Network for Speech Denoising using Channel Pruning, [[Paper]](https://arxiv.org/pdf/2410.11062.pdf),[[Code]](https://github.com/lab-emi/CleanUMamba) 683 | - (arXiv 2024.11) SAV-SE: Scene-aware Audio-Visual Speech Enhancement with Selective State Space Model, [[Paper]](https://arxiv.org/pdf/2411.07751.pdf),[[Code]](https://avsepage.github.io/) 684 | 685 | ### State Space Model (SSM) 686 | - (NeurIPS 2020) HiPPO: Recurrent Memory with Optimal Polynomial Projections, [[Paper]](https://arxiv.org/pdf/2008.07669.pdf),[[Code]](https://github.com/HazyResearch/hippo-code) 687 | - (ICLR 2022) Efficiently Modeling Long Sequences with Structured State Spaces, [[Paper]](https://arxiv.org/pdf/2111.00396.pdf),[[Code]](https://github.com/state-spaces/s4) 688 | - (ICLR 2023) Hungry Hungry Hippos: Toward Language Modeling with State Space Models, [[Paper]](https://arxiv.org/pdf/2212.14052.pdf),[[Code]](https://github.com/HazyResearch/H3) 689 | - (arXiv 2024.01) MambaByte: Token-free Selective State Space Model, [[Paper]](https://arxiv.org/pdf/2401.13660.pdf),[[Code]](https://github.com/lucidrains/MEGABYTE-pytorch) 690 | - (arXiv 2024.02) Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks, [[Paper]](https://arxiv.org/pdf/2402.04248.pdf) 691 | - (arXiv 2024.02) Hierarchical State Space Models for Continuous Sequence-to-Sequence Modeling, [[Paper]](https://arxiv.org/pdf/2402.10211.pdf),[[Code]](https://github.com/raunaqbhirangi/hiss/tree/main) 692 | - (arXiv 2024.05) MambaOut: Do We Really Need Mamba for Vision, [[Paper]](https://arxiv.org/pdf/2405.07992.pdf),[[Code]](https://github.com/yuweihao/MambaOut) 693 | - (arXiv 2024.07) VSSD: Vision Mamba with Non-Casual State Space Duality, [[Paper]](https://arxiv.org/pdf/2407.18559.pdf),[[Code]](https://github.com/YuHengsss/VSSD) 694 | - (arXiv 2024.08) DeMansia: Mamba Never Forgets Any Tokens, [[Paper]](https://arxiv.org/pdf/2408.01986.pdf),[[Code]](https://github.com/catalpaaa/DeMansia) 695 | - (arXiv 2024.09) Saliency Unification through Mamba for Visual Attention Modeling, [[Paper]](https://arxiv.org/pdf/2406.17815.pdf),[[Code]](https://github.com/Arhosseini77/SUM) 696 | - (arXiv 2024.12) V"Mean"ba: Visual State Space Models only need 1 hidden dimension, [[Paper]](https://arxiv.org/pdf/2412.16602.pdf) 697 | - (arXiv 2025.03) Ancestral Mamba: Enhancing Selective Discriminant Space Model with Online Visual Prototype Learning for Efficient and Robust Discriminant Approach, [[Paper]](https://arxiv.org/pdf/2503.22729.pdf) 698 | 699 | ### Stereo 700 | - (arXiv 2025.04) StereoMamba: Real-time and Robust Intraoperative Stereo Disparity Estimation via Long-range Spatial Dependencies, [[Paper]](https://arxiv.org/pdf/2504.17401.pdf),[[Code]](https://github.com/MichaelWangGo/StereoMamba) 701 | 702 | ### Style Transfer 703 | - (arXiv 2024.05) StyleMamba : State Space Model for Efficient Text-driven Image Style Transfer, [[Paper]](https://arxiv.org/pdf/2405.05027.pdf) 704 | - (arXiv 2024.09) Mamba-ST: State Space Model for Efficient Style Transfer, [[Paper]](https://arxiv.org/pdf/2409.10385.pdf),[[Code]](https://github.com/FilippoBotti/MambaST) 705 | - (arXiv 2025.03) SaMam: Style-aware State Space Model for Arbitrary Image Style Transfer, [[Paper]](https://arxiv.org/pdf/2503.15934.pdf) 706 | 707 | ### Super-Resolution 708 | - (arXiv 2024.05) DVMSR: Distillated Vision Mamba for Efficient Super-Resolution, [[Paper]](https://arxiv.org/pdf/2405.03008.pdf),[[Code]](https://github.com/nathan66666/DVMSR) 709 | - (arXiv 2024.05) IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation Model, [[Paper]](https://arxiv.org/pdf/2405.09873.pdf),[[Code]](https://github.com/yongsongH/IRSRMamba) 710 | - (arXiv 2024.06) Mamba-based Light Field Super-Resolution with Efficient Subspace Scanning, [[Paper]](https://arxiv.org/pdf/2406.16083.pdf) 711 | - (arXiv 2024.07) Self-Prior Guided Mamba-UNet Networks for Medical Image Super-Resolution, [[Paper]](https://arxiv.org/pdf/2407.05993.pdf) 712 | - (arXiv 2024.07) Deform-Mamba Network for MRI Super-Resolution, [[Paper]](https://arxiv.org/pdf/2407.05969.pdf) 713 | - (arXiv 2024.08) QMambaBSR: Burst Image Super-Resolution with Query State Space Model, [[Paper]](https://arxiv.org/pdf/2408.08665.pdf) 714 | - (arXiv 2024.08) MambaCSR: Dual-Interleaved Scanning for Compressed Image Super-Resolution With SSMs, [[Paper]](https://arxiv.org/pdf/2408.11758.pdf),[[Code]](https://github.com/Event-AHU/MambaEVT) 715 | - (arXiv 2024.10) Hi-Mamba: Hierarchical Mamba for Efficient Image Super-Resolution, [[Paper]](https://arxiv.org/pdf/2410.10140.pdf) 716 | - (arXiv 2024.11) Arbitrary-Scale Super-Resolution via Scaleable State Space Model, [[Paper]](https://arxiv.org/pdf/2411.11906.pdf),[[Code]](https://github.com/xiapeizhe12138/S3Mamba-ArbSR) 717 | - (arXiv 2024.12) MPSI: Mamba enhancement model for pixel-wise sequential interaction Image Super-Resolution, [[Paper]](https://arxiv.org/pdf/2412.07222.pdf) 718 | - (arXiv 2025.01) HSRMamba: Contextual Spatial-Spectral State Space Model for Single Hyperspectral Super-Resolution, [[Paper]](https://arxiv.org/pdf/2501.18500.pdf) 719 | - (arXiv 2025.02) MambaLiteSR: Image Super-Resolution with Low-Rank Mamba using Knowledge Distillation, [[Paper]](https://arxiv.org/pdf/2502.14090.pdf) 720 | - (arXiv 2025.03) Burst Image Super-Resolution with Mamba, [[Paper]](https://arxiv.org/pdf/2503.19634.pdf) 721 | - (arXiv 2025.03) Lightweight Light Field Image Super-Resolution with State Space Model, [[Paper]](https://arxiv.org/pdf/2503.19253.pdf) 722 | - (arXiv 2025.06) Self-supervised ControlNet with Spatio-Temporal Mamba for Real-world Video Super-resolution, [[Paper]](https://arxiv.org/pdf/2506.01040.pdf),[[Code]](https://github.com/HaixiaBi1982/ECP_Mamba) 723 | - (arXiv 2025.06) MambaVSR: Content-Aware Scanning State Space Model for Video Super-Resolution, [[Paper]](https://arxiv.org/pdf/2506.11768.pdf) 724 | - (arXiv 2025.06) VSRM: A Robust Mamba-Based Framework for Video Super-Resolution, [[Paper]](https://arxiv.org/pdf/2506.22762.pdf) 725 | - (arXiv 2025.07) GPSMamba: A Global Phase and Spectral Prompt-guided Mamba for Infrared Image Super-Resolution, [[Paper]](https://arxiv.org/pdf/2507.18998.pdf),[[Code]](https://github.com/yongsongH/GPSMamba) 726 | - (arXiv 2025.08) Guided Depth Map Super-Resolution via Multi-Scale Fusion U-shaped Mamba Network, [[Paper]](https://arxiv.org/pdf/2508.00248.pdf) 727 | - (arXiv 2025.08) Trajectory-aware Shifted State Space Models for Online Video Super-Resolution, [[Paper]](https://arxiv.org/pdf/2508.10453.pdf) 728 | - (arXiv 2025.09) First-order State Space Model for Lightweight Image Super-resolution, [[Paper]](https://arxiv.org/pdf/2509.08458.pdf) 729 | - (arXiv 2025.09) Exploring Non-Local Spatial-Angular Correlations with a Hybrid Mamba-Transformer Framework for Light Field Super-Resolution, [[Paper]](https://arxiv.org/pdf/2509.04824.pdf) 730 | - (arXiv 2025.09) Gather-Scatter Mamba: Accelerating Propagation with Efficient State Space Model, [[Paper]](https://arxiv.org/pdf/2510.00862.pdf),[[Code]](https://github.com/Ko-Lani/GSMamba) 731 | 732 | ### T2V 733 | - (arXiv 2025.06) M4V: Multi-Modal Mamba for Text-to-Video Generation, [[Paper]](https://arxiv.org/pdf/2506.10915.pdf),[[Code]](https://github.com/huangjch526/M4V) 734 | 735 | ### Tracking 736 | - (arXiv 2024.05) Mamba-FETrack: Frame-Event Tracking via State Space Model, [[Paper]](https://arxiv.org/pdf/2404.18174.pdf),[[Code]](https://github.com/Event-AHU/Mamba_FETrack) 737 | - (arXiv 2024.08) RGBT Tracking via All-layer Multimodal Interactions with Progressive Fusion Mamba, [[Paper]](https://arxiv.org/pdf/2408.08827.pdf) 738 | - (arXiv 2024.08) MambaEVT: Event Stream based Visual Object Tracking using State Space Model, [[Paper]](https://arxiv.org/pdf/2408.10487.pdf),[[Code]](https://github.com/Event-AHU/Mamba_FETrack) 739 | - (arXiv 2024.08) MambaTrack: A Simple Baseline for Multiple Object Tracking with State Space Model, [[Paper]](https://arxiv.org/pdf/2408.09178.pdf) 740 | - (arXiv 2024.09) FMRFT: Fusion Mamba and DETR for Query Time Sequence Intersection Fish Tracking, [[Paper]](https://arxiv.org/pdf/2409.01148.pdf) 741 | - (arXiv 2024.10) Samba: Synchronized Set-of-Sequences Modeling for Multiple Object Tracking, [[Paper]](https://arxiv.org/pdf/2410.01806.pdf),[[Code]](https://github.com/mattiasegu/sambamotr) 742 | - (arXiv 2024.11) MambaXCTrack: Mamba-based Tracker with SSM Cross-correlation and Motion Prompt for Ultrasound Needle Tracking, [[Paper]](https://arxiv.org/pdf/2411.08395.pdf) 743 | - (arXiv 2024.12) MambaNUT: Nighttime UAV Tracking via Mamba and Adaptive Curriculum Learning, [[Paper]](https://arxiv.org/pdf/2412.00626.pdf) 744 | - (arXiv 2024.12) MambaLCT: Boosting Tracking via Long-term Context State Space Model, [[Paper]](https://arxiv.org/pdf/2412.13615.pdf),[[Code]](https://github.com/GXNU-ZhongLab/MambaLCT) 745 | - (arXiv 2024.12) Robust Tracking via Mamba-based Context-aware Token Learning, [[Paper]](https://arxiv.org/pdf/2412.13611.pdf),[[Code]](https://github.com/GXNU-ZhongLab/TemTrack) 746 | - (arXiv 2025.04) S3MOT: Monocular 3D Object Tracking with Selective State Space Model, [[Paper]](https://arxiv.org/pdf/2504.18068.pdf),[[Code]](https://github.com/bytepioneerX/s3mot)  747 | - (arXiv 2025.05) SMMT: Siamese Motion Mamba with Self-attention for Thermal Infrared Target Tracking, [[Paper]](https://arxiv.org/pdf/2505.04088.pdf) 748 | - (arXiv 2025.06) SportMamba: Adaptive Non-Linear Multi-Object Tracking with State Space Models for Team Sports, [[Paper]](https://arxiv.org/pdf/2506.03335.pdf) 749 | - (arXiv 2025.06) Mamba-FETrack V2: Revisiting State Space Model for Frame-Event based Visual Object Tracking, [[Paper]](https://arxiv.org/pdf/2510.17860.pdf) 750 | 751 | ### TTA 752 | - (arXiv 2024.07) Test-Time Adaptation with State-Space Models, [[Paper]](https://arxiv.org/pdf/2407.12492.pdf) 753 | 754 | ### UAV 755 | - (arXiv 2025.09) DEPF: A UAV Multispectral Object Detector with Dual-Domain Enhancement and Priority-Guided Mamba Fusion, [[Paper]](https://arxiv.org/pdf/2509.07327.pdf) 756 | - (arXiv 2025.10) DMTrack: Deformable State-Space Modeling for UAV Multi-Object Tracking with Kalman Fusion and Uncertainty-Aware Association, [[Paper]](https://arxiv.org/pdf/2509.07327.pdf) 757 | 758 | ### Video 759 | - (arXiv 2024.03) VideoMamba: State Space Model for Efficient Video Understanding, [[Paper]](https://arxiv.org/pdf/2403.06977.pdf),[[Code]](https://github.com/OpenGVLab/VideoMamba) 760 | - (arXiv 2024.03) Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding, [[Paper]](https://arxiv.org/pdf/2403.09626.pdf),[[Code]](https://github.com/OpenGVLab/video-mamba-suite) 761 | - (arXiv 2024.03) SSM Meets Video Diffusion Models: Efficient Video Generation with Structured State Spaces, [[Paper]](https://arxiv.org/pdf/2403.07711.pdf),[[Code]](https://github.com/shim0114/SSM-Meets-Video-Diffusion-Models) 762 | - (arXiv 2024.04) SpikeMba: Multi-Modal Spiking Saliency Mamba for Temporal Video Grounding, [[Paper]](https://arxiv.org/pdf/2404.01174.pdf) 763 | - (arXiv 2024.05) Matten: Video Generation with Mamba-Attention, [[Paper]](https://arxiv.org/pdf/2405.03025.pdf) 764 | - (arXiv 2024.05) MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models, [[Paper]](https://arxiv.org/pdf/2405.14338.pdf) 765 | - (arXiv 2024.06) VideoMambaPro: A Leap Forward for Mamba in Video Understanding, [[Paper]](https://arxiv.org/pdf/2406.19006.pdf),[[Code]](https://github.com/hotfinda/VideoMambaPro) 766 | - (arXiv 2024.07) VFIMamba: Video Frame Interpolation with State Space Models, [[Paper]](https://arxiv.org/pdf/2407.02315.pdf) 767 | - (arXiv 2024.07) VideoMamba: Spatio-Temporal Selective State Space Model, [[Paper]](https://arxiv.org/pdf/2407.08476.pdf),[[Code]](https://github.com/jinyjelly/VideoMamba) 768 | - (arXiv 2024.07) DemMamba: Alignment-free Raw Video Demoireing with Frequency-assisted Spatio-Temporal Mamba, [[Paper]](https://arxiv.org/pdf/2408.10679.pdf) 769 | - (arXiv 2024.12) Look Every Frame All at Once: Video-Ma2mba for Efficient Long-form Video Understanding with Multi-Axis Gradient Checkpointing, [[Paper]](https://arxiv.org/pdf/2411.19460.pdf),[[Code]](https://ivy-lvlm.github.io/Video-MA2MBA/) 770 | - (arXiv 2024.12) Efficient Self-Supervised Video Hashing with Selective State Spaces, [[Paper]](https://arxiv.org/pdf/2412.14518.pdf),[[Code]](https://github.com/gimpong/AAAI25-S5VH) 771 | - (arXiv 2025.04) Learning Phase Distortion with Selective State Space Models for Video Turbulence Mitigation, [[Paper]](https://arxiv.org/pdf/2504.02697.pdf),[[Code]](https://github.com/xg416/MambaTM) 772 | - (arXiv 2025.04) Audio-visual Controlled Video Diffusion with Masked Selective State Spaces Modeling for Natural Talking Head Generation, [[Paper]](https://arxiv.org/pdf/2504.02542.pdf),[[Code]](https://github.com/harlanhong/ACTalker) 773 | - (arXiv 2025.06) Dual Branch VideoMamba with Gated Class Token Fusion for Violence Detection, [[Paper]](https://arxiv.org/pdf/2506.03162.pdf) 774 | - (arXiv 2025.06) MLVTG: Mamba-Based Feature Alignment and LLM-Driven Purification for Multi-Modal Video Temporal Grounding, [[Paper]](https://arxiv.org/pdf/2506.08512.pdf) 775 | - (arXiv 2025.06) MoMa: Modulating Mamba for Adapting Image Foundation Models to Video Recognition, [[Paper]](https://arxiv.org/pdf/2508.08974.pdf),[[Code]](https://github.com/Elman295/TCSSM) 776 | 777 | ### VQA 778 | - (arXiv 2025.08) Text-conditioned State Space Model For Domain-generalized Change Detection Visual Question Answering, [[Paper]](https://arxiv.org/pdf/2506.23283.pdf) 779 | 780 | ### Zero-Shot Learning 781 | - (arXiv 2024.08) ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning, [[Paper]](https://arxiv.org/pdf/2408.14868.pdf),[[Code]](https://anonymous.4open.science/r/ZeroMamba/README.md) 782 | 783 | ### Other 784 | - (arXiv 2024.02) Pan-Mamba: Effective pan-sharpening with State Space Model, [[Paper]](https://arxiv.org/pdf/2402.12192.pdf),[[Code]](https://github.com/alexhe101/Pan-Mamba) 785 | - (arXiv 2024.04) InsectMamba: Insect Pest Classification with State Space Model, [[Paper]](https://arxiv.org/pdf/2404.03611.pdf) 786 | - (arXiv 2024.08) MambaDS: Near-Surface Meteorological Field Downscaling with Topography Constrained Selective State Space Modeling, [[Paper]](https://arxiv.org/pdf/2408.108541.pdf) 787 | - (arXiv 2024.08) ColorMamba: Towards High-quality NIR-to-RGB Spectral Translation with Mamba, [[Paper]](https://arxiv.org/pdf/2408.08087.pdf),[[Code]](https://github.com/AlexYangxx/ColorMamba) 788 | - (arXiv 2024.10) ECMamba: Consolidating Selective State Space Model with Retinex Guidance for Efficient Multiple Exposure Correction, [[Paper]](https://arxiv.org/pdf/2410.21535.pdf),[[Code]](https://github.com/LowlevelAI/ECMamba) 789 | - (arXiv 2024.11) RAWMamba: Unified sRGB-to-RAW De-rendering With State Space Model, [[Paper]](https://arxiv.org/pdf/2411.11717.pdf) 790 | - (arXiv 2024.12) Image Forgery Localization with State Space Models, [[Paper]](https://arxiv.org/pdf/2412.11214.pdf) 791 | - (arXiv 2025.01) Detail Matters: Mamba-Inspired Joint Unfolding Network for Snapshot Spectral Compressive Imaging, [[Paper]](https://arxiv.org/pdf/2501.01262.pdf) 792 | - (arXiv 2025.02) TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba, [[Paper]](https://arxiv.org/pdf/2502.15130.pdf) 793 | - (arXiv 2025.03) HiCMamba: Enhancing Hi-C Resolution and Identifying 3D Genome Structures with State Space Modeling, [[Paper]](https://arxiv.org/pdf/2503.10713.pdf) 794 | - (arXiv 2025.03) BS-Mamba for Black-Soil Area Detection On the Qinghai-Tibetan Plateau, [[Paper]](https://arxiv.org/pdf/2503.12495.pdf) 795 | - (arXiv 2025.06) RiverMamba: A State Space Model for Global River Discharge and Flood Forecasting, [[Paper]](https://arxiv.org/pdf/2505.22535.pdf) 796 | - (arXiv 2025.06) RAUM-Net: Regional Attention and Uncertainty-aware Mamba Network, [[Paper]](https://arxiv.org/pdf/2506.21905.pdf),[[Code]](https://github.com/wxqnl/RAUM) 797 | - (arXiv 2025.08) DeflareMamba: Hierarchical Vision Mamba for Contextually Consistent Lens Flare Removal, [[Paper]](https://arxiv.org/pdf/2508.02113.pdf),[[Code]](https://github.com/BNU-ERC-ITEA/DeflareMamba) 798 | - (arXiv 2025.08) RoadMamba: A Dual Branch Visual State Space Model for Road Surface Classification, [[Paper]](https://arxiv.org/pdf/2508.01210.pdf) 799 | - (arXiv 2025.08) Guiding WaveMamba with Frequency Maps for Image Debanding, [[Paper]](https://arxiv.org/pdf/2508.11331.pdf),[[Code]](https://github.com/xinyiW915/Debanding-PCS2025) 800 | - (arXiv 2025.08) D2-Mamba: Dual-Scale Fusion and Dual-Path Scanning with SSMs for Shadow Removal, [[Paper]](https://arxiv.org/pdf/2508.12750.pdf),[[Code]](https://github.com/xinyiW915/Debanding-PCS2025) 801 | - (arXiv 2025.09) CD-Mamba: Cloud detection with long-range spatial dependency modeling, [[Paper]](https://arxiv.org/pdf/2509.04729.pdf),[[Code]](https://github.com/kunzhan/CD-Mamba) 802 | - (arXiv 2025.09) FLARE-SSM: Deep State Space Models with Influence-Balanced Loss for 72-Hour Solar Flare Prediction, [[Paper]](https://arxiv.org/pdf/2509.09988.pdf) 803 | 804 | ## Contact & Feedback 805 | If you have any suggestions about this project, feel free to contact me. 806 | - [e-mail: yzhangcst[at]gmail.com] 807 | 808 | --------------------------------------------------------------------------------