└── README.md /README.md: -------------------------------------------------------------------------------- 1 | # Awesome-Vision-Mamba 2 | ✨✨Latest Papers on Vision Mamba and Related Areas 3 | 4 | ## Survey 5 | - A Survey on Mamba Architecture for Vision Applications [[arxiv]](https://arxiv.org/pdf/2502.07161) 6 | - Mamba in Vision: A Comprehensive Survey of Techniques and Applications [[arxiv]](https://arxiv.org/pdf/2410.03105) 7 | - Vision Mamba: A Comprehensive Survey and Taxonomy [[arxiv]](https://arxiv.org/pdf/2405.04404) 8 | - A Survey on Vision Mamba: Models, Applications and Challenges [[arxiv]](https://arxiv.org/pdf/2404.18861) 9 | - Mamba-360: Survey of State Space Models as Transformer Alternative for Long Sequence Modelling: Methods, Applications, and Challenges [[arxiv]](https://arxiv.org/pdf/2404.16112) 10 | - A Survey on Visual Mamba [[arxiv]](https://arxiv.org/pdf/2404.15956.pdf) 11 | - State Space Model for New-Generation Network Alternative to Transformers: A Survey [[arxiv]](https://arxiv.org/pdf/2404.09516.pdf) 12 | 13 | ## Computer Vision 14 | - DynamicVis: An Efficient and General Visual Foundation Model for Remote Sensing Image Understanding [[arxiv]](https://arxiv.org/pdf/2503.16426) [[code]](https://github.com/KyanChen/DynamicVis) 15 | - ACMamba: Fast Unsupervised Anomaly Detection via An Asymmetrical Consensus State Space Model [[arxiv]](https://arxiv.org/pdf/2504.11781) 16 | - VideoMAP: Toward Scalable Mamba-based Video Autoregressive Pretraining [[arxiv]](https://arxiv.org/pdf/2503.12332) [[code]](https://github.com/yunzeliu/MAP) 17 | - MaTVLM: Hybrid Mamba-Transformer for Efficient Vision-Language Modeling [[arxiv]](https://arxiv.org/pdf/2503.13440) [[code]](https://github.com/hustvl/MaTVLM) 18 | - VAMBA: Understanding Hour-Long Videos with Hybrid Mamba-Transformers [[arxiv]](https://arxiv.org/pdf/2503.11579) [[code]](https://tiger-ai-lab.github.io/Vamba/) 19 | - OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models [[arxiv]](https://arxiv.org/pdf/2503.08686) [[code]](https://github.com/hustvl/OmniMamba) 20 | - ViM-VQ: Efficient Post-Training Vector Quantization for Visual Mamba [[arxiv]](https://arxiv.org/pdf/2503.09509) 21 | - MambaFlow: A Novel and Flow-guided State Space Model for Scene Flow Estimation [[arxiv]](https://arxiv.org/pdf/2502.16907) [[code]](https://github.com/SCNU-RISLAB/MambaFlow) 22 | - TransMamba: Fast Universal Architecture Adaption from Transformers to Mamba [[arxiv]](https://arxiv.org/pdf/2502.15130) 23 | - DAMamba: Vision State Space Model with Dynamic Adaptive Scan [[arxiv]](https://arxiv.org/pdf/2502.12627) [[code]](https://github.com/ltzovo/DAMamba) 24 | - Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation [[arxiv]](https://arxiv.org/pdf/2502.13145) [[https://github.com/hustvl/mmMambacode]]() 25 | - Hybrid State-Space and GRU-based Graph Tokenization Mamba for Hyperspectral Image Classification [[arxiv]](https://arxiv.org/pdf/2502.06427) 26 | - CMamba: Learned Image Compression with State Space Models [[arxiv]](https://arxiv.org/pdf/2502.04988) 27 | - DCT-Mamba3D: Spectral Decorrelation and Spatial-Spectral Feature Extraction for Hyperspectral Image Classification [[arxiv]](https://arxiv.org/pdf/2502.01986) 28 | - Surface Vision Mamba: Leveraging Bidirectional State Space Model for Efficient Spherical Manifold Representation [[arxiv]](https://arxiv.org/pdf/2501.14679) 29 | - Mixture-of-Mamba: Enhancing Multi-Modal State-Space Models with Modality-Aware Sparsity [[arxiv]](https://arxiv.org/pdf/2501.16295) [[code]](https://github.com/Weixin-Liang/Mixture-of-Mamba) 30 | - CD-Lamba: Boosting Remote Sensing Change Detection via a Cross-Temporal Locally Adaptive State Space Model [[arxiv]](https://arxiv.org/pdf/2501.15455) [[code]](https://github.com/xwmaxwma/rschange) 31 | - QMamba: Post-Training Quantization for Vision State Space Models [[arxiv]](https://arxiv.org/pdf/2501.13624) 32 | - SMamba: Sparse Mamba for Event-based Object Detection [[arxiv]](https://arxiv.org/pdf/2501.11971) [[code]](https://github.com/Zizzzzzzz/SMamba) 33 | - WMamba: Wavelet-based Mamba for Face Forgery Detection [[arxiv]](https://arxiv.org/pdf/2501.09617) 34 | - MANTA: Diffusion Mamba for Efficient and Effective Stochastic Long-Term Dense Anticipation [[arxiv]](https://arxiv.org/pdf/2501.08837) 35 | - AVS-Mamba: Exploring Temporal and Multi-modal Mamba for Audio-Visual Segmentation [[arxiv]](https://arxiv.org/pdf/2501.07810) [[code]](https://github.com/SitongGong/AVS-Mamba) 36 | - Skip Mamba Diffusion for Monocular 3D Semantic Scene Completion [[arxiv]](https://arxiv.org/pdf/2501.07260) [[code]](https://github.com/xrkong/skimba) 37 | - Mamba-MOC: A Multicategory Remote Object Counting via State Space Model [[arxiv]](https://arxiv.org/pdf/2501.06697) [[code]](https://github.com/lp-094/Mamba-MOC) 38 | - MS-Temba: Multi-Scale Temporal Mamba for Efficient Temporal Action Detection [[arxiv]](https://arxiv.org/pdf/2501.06138) [[code]](https://github.com/thearkaprava/MS-Temba) 39 | - MambaHSI: Spatial-Spectral Mamba for Hyperspectral Image Classification [[arxiv]](https://arxiv.org/pdf/2501.04944) [[code]](https://github.com/li-yapeng/MambaHSI) 40 | - Detail Matters: Mamba-Inspired Joint Unfolding Network for Snapshot Spectral Compressive Imaging [[arxiv]](https://arxiv.org/pdf/2501.01262) [[codee]](https://github.com/Mengjie-s/MiJUN) 41 | - DepthMamba with Adaptive Fusion [[arxiv]](https://arxiv.org/pdf/2412.19964) 42 | - MaIR: A Locality- and Continuity-Preserving Mamba for Image Restoration [[arxiv]](https://arxiv.org/pdf/2412.20066) 43 | - MambaVO: Deep Visual Odometry Based on Sequential Matching Refinement and Training Smoothing [[arxiv]](https://arxiv.org/pdf/2412.20082) 44 | - STNMamba: Mamba-based Spatial-Temporal Normality Learning for Video Anomaly Detection [[arxiv]](https://arxiv.org/pdf/2412.20084) 45 | - PTQ4VM: Post-Training Quantization for Visual Mamba [[arxiv]](https://arxiv.org/pdf/2412.20386) [[code]](https://github.com/YoungHyun197/ptq4vm) 46 | - Enhancing Online Continual Learning with Plug-and-Play State Space Model and Class-Conditional Mixture of Discretization [[arxiv]](https://arxiv.org/pdf/2412.18177) 47 | - COMO: Cross-Mamba Interaction and Offset-Guided Fusion for Multimodal Object Detection [[arxiv]](https://arxiv.org/pdf/2412.18076) [[code]](https://github.com/luluyuu/COMO) 48 | - V“Mean”ba: Visual State Space Models only need 1 hidden dimension [[arxiv]](https://arxiv.org/pdf/2412.16602) 49 | - FlowMamba: Learning Point Cloud Scene Flow with Global Motion Propagation [[arxiv]](https://arxiv.org/pdf/2412.17366) 50 | - Empathetic Response in Audio-Visual Conversations Using Emotion Preference Optimization and MambaCompressor [[arxiv]](https://arxiv.org/pdf/2412.17572) 51 | - Trusted Mamba Contrastive Network for Multi-View Clustering [[arxiv]](https://arxiv.org/pdf/2412.16487) 52 | - Multi-dimensional Visual Prompt Enhanced Image Restoration via Mamba-Transformer Aggregation [[arxiv]](https://arxiv.org/pdf/2412.15845) [[code]](https://github.com/12138-chr/MTAIR) 53 | - Mamba2D: A Natively Multi-Dimensional State-Space Model for Vision Tasks [[arxiv]](https://arxiv.org/pdf/2412.16146) 54 | - Efficient Self-Supervised Video Hashing with Selective State Spaces [[arxiv]](https://arxiv.org/pdf/2412.14518) [[code]](https://github.com/gimpong/AAAI25-S5VH) 55 | - Robust Tracking via Mamba-based Context-aware Token Learning [[arxiv]](https://arxiv.org/pdf/2412.13611) [[code]](https://github.com/GXNU-ZhongLab/TemTrack) 56 | - MambaLCT: Boosting Tracking via Long-term Context State Space Model [[arxiv]](https://arxiv.org/pdf/2412.13615) [[code]](https://github.com/GXNU-ZhongLab/MambaLCT) 57 | - Faster Vision Mamba is Rebuilt in Minutes via Merged Token Re-training [[arxiv]](https://arxiv.org/pdf/2412.12496) 58 | - MambaPro: Multi-Modal Object Re-Identification with Mamba Aggregation and Synergistic Prompt [[arxiv]](https://arxiv.org/pdf/2412.10707) [[code]](https://github.com/924973292/MambaPro) 59 | - SegMAN: Omni-scale Context Modeling with State Space Models and Local Attention for Semantic Segmentation [[arxiv]](https://arxiv.org/pdf/2412.11890) [[code]](https://github.com/yunxiangfu2001/SegMAN) 60 | - Image Forgery Localization with State Space Models [[arxiv]](https://arxiv.org/pdf/2412.11214) 61 | - XYScanNet: An Interpretable State Space Model for Perceptual Image Deblurring [[arxiv]](https://arxiv.org/pdf/2412.10338) 62 | - Selective Visual Prompting in Vision Mamba [[arxiv]](https://arxiv.org/pdf/2412.08947) [[code]](https://github.com/zhoujiahuan1991/AAAI2025-SVP) 63 | - MPSI: Mamba enhancement model for pixel-wise sequential interaction Image Super-Resolution [[arxiv]](https://arxiv.org/pdf/2412.07222) 64 | - LOMA: Language-assisted Semantic Occupancy Network via Triplane Mamba [[arxiv]](https://arxiv.org/pdf/2412.08388) 65 | - Manta: Enhancing Mamba for Few-Shot Action Recognition of Long Sub-Sequence [[arxiv]](https://arxiv.org/pdf/2412.07481) [[code]](https://github.com/wenbohuang1002/Manta) 66 | - MSCrackMamba: Leveraging Vision Mamba for Crack Detection in Fused Multispectral Imagery [[arxiv]](https://arxiv.org/pdf/2412.06211) 67 | - MambaNUT: Nighttime UAV Tracking via Mamba and Adaptive Curriculum Learning [[arxiv]](https://arxiv.org/pdf/2412.00626) 68 | - AlignMamba: Enhancing Multimodal Mamba with Local and Global Cross-modal Alignment [[arxiv]](https://arxiv.org/pdf/2412.00833) 69 | - MamKPD: A Simple Mamba Baseline for Real-Time 2D Keypoint Detection [[arxiv]](https://arxiv.org/pdf/2412.01422) 70 | - Vision Mamba Distillation for Low-resolution Fine-grained Image Classification [[arxiv]](https://arxiv.org/pdf/2411.17980) [[code]](https://github.com/boa2004plaust/ViMD) 71 | - BadScan: An Architectural Backdoor Attack on Visual State Space Models [[arxiv]](https://arxiv.org/pdf/2411.17283) 72 | - FTMoMamba: Motion Generation with Frequency and Text State Space Models [[arxiv]](https://arxiv.org/pdf/2411.17532) 73 | - TinyViM: Frequency Decoupling for Tiny Hybrid Vision Mamba [[arxiv]](https://arxiv.org/pdf/2411.17473) [[code]](https://github.com/xwmaxwma/TinyViM) 74 | - Deformable Mamba for Wide Field of View Segmentation [[arxiv]](https://arxiv.org/pdf/2411.16481) [[code]](https://github.com/JieHu1996/DeformableMamba) 75 | - MobileMamba: Lightweight Multi-Receptive Visual Mamba Network [[arxiv]](https://arxiv.org/pdf/2411.15941) [[code]](https://github.com/lewandofskee/MobileMamba) 76 | - MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking [[arxiv]](https://arxiv.org/pdf/2411.15761) 77 | - Mamba-CL: Optimizing Selective State Space Model in Null Space for Continual Learning [[arxiv]](https://arxiv.org/pdf/2411.15469) 78 | - MambaVLT: Time-Evolving Multimodal State Space Model for Vision-Language Tracking [[arxiv]](https://arxiv.org/pdf/2411.15459) 79 | - Event USKT : U-State Space Model in Knowledge Transfer for Event Cameras [[arxiv]](https://arxiv.org/pdf/2411.15276) 80 | - EfficientViM: Efficient Vision Mamba with Hidden State Mixer based State Space Duality [[arxiv]](https://arxiv.org/pdf/2411.15241) [[code]](https://github.com/mlvlab/EfficientViM) 81 | - OSMamba: Omnidirectional Spectral Mamba with Dual-Domain Prior Generator for Exposure Correction [[arxiv]](https://arxiv.org/pdf/2411.15255) 82 | - MambaIRv2: Attentive State Space Restoration [[arxiv]](https://arxiv.org/pdf/2411.15269) [[code]](https://github.com/csguoh/MambaIR) 83 | - MambaDETR: Query-based Temporal Modeling using State Space Model for Multi-View 3D Object Detection [[arxiv]](https://arxiv.org/pdf/2411.13628) 84 | - M3D: Dual-Stream Selective State Spaces and Depth-Driven Framework for High-Fidelity Single-View 3D Reconstruction [[arxiv]](https://arxiv.org/pdf/2411.12635) [[code]](https://github.com/AnnnnnieZhang/M3D) 85 | - S3Mamba: Arbitrary-Scale Super-Resolution via Scaleable State Space Model [[arxiv]](https://arxiv.org/pdf/2411.11906) [[code]](https://github.com/xiapeizhe12138/S3Mamba-ArbSR) 86 | - RAWMamba: Unified sRGB-to-RAW De-rendering With State Space Model [[arxiv]](https://arxiv.org/pdf/2411.11717) 87 | - MambaPEFT: Exploring Parameter-Efficient Fine-Tuning for Mamba [[arxiv]](https://arxiv.org/pdf/2411.03855) 88 | - ShadowMamba: State-Space Model with Boundary-Region Selective Scan for Shadow Removal [[arxiv]](https://arxiv.org/pdf/2411.03260) 89 | - Adaptive Multi Scale Document Binarisation Using Vision Mamba [[arxiv]](https://arxiv.org/pdf/2410.22811) 90 | - ECMamba: Consolidating Selective State Space Model with Retinex Guidance for Efficient Multiple Exposure Correction [[arxiv]](https://arxiv.org/pdf/2410.21535) [[code]](https://github.com/LowlevelAI/ECMamba) 91 | - SpikMamba: When SNN meets Mamba in Event-based Human Action Recognition [[arxiv]](https://arxiv.org/pdf/2410.16746) [[code]](https://github.com/Typistchen/SpikMamba) 92 | - MambaSOD: Dual Mamba-Driven Cross-Modal Fusion Network for RGB-D Salient Object Detection [[arxiv]](https://arxiv.org/pdf/2410.15015) [[code]](https://github.com/YueZhan721/MambaSOD) 93 | - Spatial-Mamba: Effective Visual State Space Models via Structure-Aware State Fusion [[arxiv]](https://arxiv.org/pdf/2410.15091) [[code]](https://github.com/EdwardChasel/Spatial-Mamba) 94 | - MBPU: A Plug-and-Play State Space Model for Point Cloud Upsamping with Fast Point Rendering [[arxiv]](https://arxiv.org/pdf/2410.15941) 95 | - START: A Generalized State Space Model with Saliency-Driven Token-Aware Transformation [[arxiv]](https://arxiv.org/pdf/2410.16020) [[code]](https://github.com/lingeringlight/START) 96 | - MambaSCI: Efficient Mamba-UNet for Quad-Bayer Patterned Video Snapshot Compressive Imaging [[arxiv]](https://arxiv.org/pdf/2410.14214) [[code]](https://github.com/PAN083/MambaSCI) 97 | - RemoteDet-Mamba: A Hybrid Mamba-CNN Network for Multi-modal Object Detection in Remote Sensing Images [[arxiv]](https://arxiv.org/pdf/2410.13532) 98 | - MambaPainter: Neural Stroke-Based Rendering in a Single Step [[arxiv]](https://arxiv.org/pdf/2410.12524) [[code]](https://github.com/STomoya/MambaPainter) 99 | - MambaBEV: An efficient 3D detection model with Mamba2 [[arxiv]](https://arxiv.org/pdf/2410.12673) 100 | - Hi-Mamba: Hierarchical Mamba for Efficient Image Super-Resolution [[arxiv]](https://arxiv.org/pdf/2410.10140) 101 | - GlobalMamba: Global Image Serialization for Vision Mamba [[arxiv]](https://arxiv.org/pdf/2410.10316) [[code]](https://github.com/wangck20/GlobalMamba) 102 | - V2M: Visual 2-Dimensional Mamba for Image Representation Learning [[arxiv]](https://arxiv.org/pdf/2410.10382) [[code]](https://github.com/wangck20/V2M) 103 | - CountMamba: Exploring Multi-directional Selective State-Space Models for Plant Counting [[arxiv]](https://arxiv.org/pdf/2410.07528) 104 | - MatMamba: A Matryoshka State Space Model [[arxiv]](https://arxiv.org/pdf/2410.06718) [[code]](https://github.com/ScaledFoundations/MatMamba) 105 | - EMMA: Empowering Multi-modal Mamba with Structural and Hierarchical Alignment [[arxiv]](https://arxiv.org/pdf/2410.05938) 106 | - Monocular Visual Place Recognition in LiDAR Maps via Cross-Modal State Space Model and Multi-View Matching [[arxiv]](https://arxiv.org/pdf/2410.06285) 107 | - QuadMamba: Learning Quadtree-based Selective Scan for Visual State Space Model [[arxiv]](https://arxiv.org/pdf/2410.06806) [[code]](https://github.com/VISIONSJTU/QuadMamba) 108 | - HRVMamba: High-Resolution Visual State Space Model for Dense Prediction [[arxiv]](https://arxiv.org/pdf/2410.03174) [[code]](https://github.com/zhanghao5201/HRVMamba) 109 | - Mamba Capsule Routing Towards Part-Whole Relational Camouflaged Object Detection [[arxiv]](https://arxiv.org/pdf/2410.03987) [[code]](https://github.com/Liangbo-Cheng/mamba_capsule) 110 | - IGroupSS-Mamba: Interval Group Spatial-Spectral Mamba for Hyperspectral Image Classification [[arxiv]](https://arxiv.org/pdf/2410.05100) 111 | - Samba: Synchronized Set-of-Sequences Modeling for Multiple Object Tracking [[arxiv]](https://arxiv.org/pdf/2410.01806) [[code]](https://sambamotr.github.io/) 112 | - Exploring Token Pruning in Vision State Space Models [[arxiv]](https://arxiv.org/pdf/2409.18962) 113 | - Hybrid Mamba for Few-Shot Segmentation [[arxiv]](https://arxiv.org/pdf/2409.19613) [[code]](https://github.com/Sam1224/HMNet) 114 | - MaskMamba: A Hybrid Mamba-Transformer Model for Masked Image Generation [[arxiv]](https://arxiv.org/pdf/2409.19937) 115 | - MAP: Unleashing Hybrid Mamba-Transformer Vision Backbone's Potential with Masked Autoregressive Pretraining [[arxiv]](https://arxiv.org/pdf/2410.00871) 116 | - Path-adaptive Spatio-Temporal State Space Model for Event-based Recognition with Arbitrary Duration [[arxiv]](https://arxiv.org/pdf/2409.16953) [[code]](https://vlislab22.github.io/pastssm/) 117 | - DepMamba: Progressive Fusion Mamba for Multimodal Depression Detection [[arxiv]](https://arxiv.org/pdf/2409.15936) [[code]](https://github.com/Jiaxin-Ye/DepMamba) 118 | - GraspMamba: A Mamba-based Language-driven Grasp Detection Framework with Hierarchical Feature Learning [[arxiv]](https://arxiv.org/pdf/2409.14403) 119 | - Mamba Fusion: Learning Actions Through Questioning [[arxiv]](https://arxiv.org/pdf/2409.11513) [[code]](https://github.com/Dongzhikang/MambaVL) 120 | - PhysMamba: Efficient Remote Physiological Measurement with SlowFast Temporal Difference Mamba [[arxiv]](https://arxiv.org/pdf/2409.12031) [[code]](https://github.com/Chaoqi31/PhysMamba) 121 | - SparX: A Sparse Cross-Layer Connection Mechanism for Hierarchical Vision Mamba and Transformer Networks [[arxiv]](https://arxiv.org/pdf/2409.09649) [[code]](https://github.com/LMMMEng/SparX) 122 | - Famba-V: Fast Vision Mamba with Cross-Layer Token Fusion [[arxiv]](https://arxiv.org/pdf/2409.09808) 123 | - Mamba-ST: State Space Model for Efficient Style Transfer [[arxiv]](https://arxiv.org/pdf/2409.10385) [[code]](https://github.com/FilippoBotti/MambaST) 124 | - SITSMamba for Crop Classification based on Satellite Image Time Series [[arxiv]](https://arxiv.org/pdf/2409.09673) [[code]](https://github.com/XiaoleiQinn/SITSMamba) 125 | - CoMamba: Real-time Cooperative Perception Unlocked with State Space Models [[arxiv]](https://arxiv.org/pdf/2409.10699) 126 | - Unleashing the Potential of Mamba: Boosting a LiDAR 3D Sparse Detector by Using Cross-Model Knowledge Distillation [[arxiv]](https://arxiv.org/pdf/2409.11018) 127 | - Mamba-YOLO-World: Marrying YOLO-World with Mamba for Open-Vocabulary Detection [[arxiv]](https://arxiv.org/pdf/2409.08513) 128 | - CollaMamba: Efficient Collaborative Perception with Cross-Agent Spatial-Temporal State Space Model [[arxiv]](https://arxiv.org/pdf/2409.07714) 129 | - Mamba Policy: Towards Efficient 3D Diffusion Policy with Hybrid Selective State Models [[arxiv]](https://arxiv.org/pdf/2409.07163) [[code]](https://andycao1125.github.io/mamba_policy/) 130 | - PPMamba: A Pyramid Pooling Local Auxiliary SSM-Based Model for Remote Sensing Image Semantic Segmentation [[arxiv]](https://arxiv.org/pdf/2409.06309) 131 | - Shaking Up VLMs: Comparing Transformers and Structured State Space Models for Vision & Language Modeling [[arxiv]](https://arxiv.org/pdf/2409.05395) 132 | - Mamba-Enhanced Text-Audio-Video Alignment Network for Emotion Recognition in Conversations [[arxiv]](https://arxiv.org/pdf/2409.05243) [[code]](https://github.com/Alena-Xinran/MaTAV) 133 | - Why mamba is effective? Exploit Linear Transformer-Mamba Network for Multi-Modality Image Fusion [[arxiv]](https://arxiv.org/pdf/2409.03223) 134 | - UV-Mamba: A DCN-Enhanced State Space Model for Urban Village Boundary Identification in High-Resolution Remote Sensing Images [[arxiv]](https://arxiv.org/pdf/2409.03431) 135 | - Shuffle Mamba: State Space Models with Random Shuffle for Multi-Modal Image Fusion [[arxiv]](https://arxiv.org/pdf/2409.01728) 136 | - FMRFT: Fusion Mamba and DETR for Query Time Sequence Intersection Fish Tracking [[arxiv]](https://arxiv.org/pdf/2409.01148) 137 | - EDCSSM: Edge Detection with Convolutional State Space Model [[arxiv]](https://arxiv.org/pdf/2409.01609) 138 | - Stochastic Layer-Wise Shuffle: A Good Practice to Improve Vision Mamba Training [[arxiv]](https://arxiv.org/pdf/2408.17081) [[code]](https://github.com/huangzizheng01/ShuffleMamba) 139 | - MambaPlace:Text-to-Point-Cloud Cross-Modal Place Recognition with Attention Mamba Mechanisms [[arxiv]](https://arxiv.org/pdf/2408.15740) [[code]](https://github.com/nuozimiaowu/MambaPlace/tree/main) 140 | - ZeroMamba: Exploring Visual State Space Model for Zero-Shot Learning [[arxiv]](https://arxiv.org/pdf/2408.14868) [[code]](https://anonymous.4open.science/r/ZeroMamba) 141 | - MTMamba++: Enhancing Multi-Task Dense Scene Understanding via Mamba-Based Decoders [[arxiv]](https://arxiv.org/pdf/2408.15101) [[code]](https://github.com/EnVision-Research/MTMamba) 142 | - PointDGMamba: Domain Generalization of Point Cloud Classification via Generalized State Space Model [[arxiv]](https://arxiv.org/pdf/2408.13574) 143 | - O-Mamba: O-shape State-Space Model for Underwater Image Enhancement [[arxiv]](https://arxiv.org/pdf/2408.12816) [[code]](https://github.com/chenydong/O-Mamba) 144 | - MambaOcc: Visual State Space Model for BEV-based Occupancy Prediction with Local Adaptive Reordering [[arxiv]](https://arxiv.org/pdf/2408.11464) [[code]](https://github.com/Hub-Tian/MambaOcc) 145 | - UNetMamba: Efficient UNet-Like Mamba for Semantic Segmentation of High-Resolution Remote Sensing Images [[arxiv]](https://arxiv.org/pdf/2408.11545) [[code]](https://github.com/EnzeZhu2001/UNetMamba) 146 | - Exploring Robustness of Visual State Space model against Backdoor Attacks [[arxiv]](https://arxiv.org/pdf/2408.11679) 147 | - MambaCSR: Dual-Interleaved Scanning for Compressed Image Super-Resolution With SSMs [[arxiv]](https://arxiv.org/pdf/2408.11758) 148 | - MambaTrack: A Simple Baseline for Multiple Object Tracking with State Space Model [[arxiv]](https://arxiv.org/pdf/2408.09178) 149 | - ExpoMamba: Exploiting Frequency SSM Blocks for Efficient and Effective Image Enhancement [[arxiv]](https://arxiv.org/pdf/2408.09650) [[code]](https://github.com/eashanadhikarla/ExpoMamba) 150 | - MambaLoc: Efficient Camera Localisation via State Space Model [[arxiv]](https://arxiv.org/pdf/2408.09680) 151 | - OccMamba: Semantic Occupancy Prediction with State Space Models [[arxiv]](https://arxiv.org/pdf/2408.09859) 152 | - Multi-Scale Representation Learning for Image Restoration with State-Space Model [[arxiv]](https://arxiv.org/pdf/2408.10145) 153 | - MambaDS: Near-Surface Meteorological Field Downscaling with Topography Constrained Selective State Space Modeling [[arxiv]](https://arxiv.org/pdf/2408.10854) 154 | - MambaEVT: Event Stream based Visual Object Tracking using State Space Model [[arxiv]](https://arxiv.org/pdf/2408.10487) [[code]](https://github.com/Event-AHU/MambaEVT) 155 | - MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval [[arxiv]](https://arxiv.org/pdf/2408.10575) [[code]](https://github.com/hrtang22/MUSE) 156 | - DemMamba: Alignment-free Raw Video Demoireing with Frequency-assisted Spatio-Temporal Mamba [[arxiv]](https://arxiv.org/pdf/2408.10679) 157 | - QMambaBSR: Burst Image Super-Resolution with Query State Space Model [[arxiv]](https://arxiv.org/pdf/2408.08665) 158 | - RGBT Tracking via All-layer Multimodal Interactions with Progressive Fusion Mamba [[arxiv]](https://arxiv.org/pdf/2408.08827) 159 | - ColorMamba: Towards High-quality NIR-to-RGB Spectral Translation with Mamba [[arxiv]](https://arxiv.org/pdf/2408.08087) [[code]](https://github.com/AlexYangxx/ColorMamba/) 160 | - MambaVT: Spatio-Temporal Contextual Modeling for robust RGB-T Tracking [[arxiv]](https://arxiv.org/pdf/2408.07889) 161 | - PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space Model [[arxiv]](https://arxiv.org/pdf/2408.03540) 162 | - Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network [[arxiv]](https://arxiv.org/pdf/2408.02922) 163 | - JambaTalk: Speech-Driven 3D Talking Head Generation Based on Hybrid Transformer-Mamba Language Model [[arxiv]](https://arxiv.org/pdf/2408.01627) 164 | - DeMansia: Mamba Never Forgets Any Tokens [[arxiv]](https://arxiv.org/pdf/2408.01986) [[code]](https://github.com/catalpaaa/DeMansia) 165 | - LaMamba-Diff: Linear-Time High-Fidelity Diffusion Models Based on Local Attention and Mamba [[arxiv]](https://arxiv.org/pdf/2408.02615) 166 | - MambaST: A Plug-and-Play Cross-Spectral Spatial-Temporal Fuser for Efficient Pedestrian Detection [[arxiv]](https://arxiv.org/pdf/2408.01037) [[code]](https://github.com/XiangboGaoBarry/MambaST) 167 | - PhysMamba: Leveraging Dual-Stream Cross-Attention SSD for Remote Physiological Measurement [[arxiv]](https://arxiv.org/pdf/2408.01077) 168 | - WaveMamba: Spatial-Spectral Wavelet Mamba for Hyperspectral Image Classification [[arxiv]](https://arxiv.org/pdf/2408.01231) [[code]](https://github.com/mahmad00) 169 | - Wave-Mamba: Wavelet State Space Model for Ultra-High-Definition Low-Light Image Enhancement [[arxiv]](https://arxiv.org/pdf/2408.01276) [[code]](https://github.com/AlexZou14/Wave-Mamba) 170 | - Spatial-Spectral Morphological Mamba for Hyperspectral Image Classification [[arxiv]](https://arxiv.org/pdf/2408.01372) [[code]](https://github.com/MHassaanButt/MorpMamba) 171 | - MonoMM: A Multi-scale Mamba-Enhanced Network for Real-time Monocular 3D Object Detection [[arxiv]](https://arxiv.org/pdf/2408.00438) 172 | - RainMamba: Enhanced Locality Learning with State Space Models for Video Deraining [[arxiv]](https://arxiv.org/pdf/2407.21773) [[code]](https://github.com/TonyHongtaoWu/RainMamba) 173 | - ML-Mamba: Efficient Multi-Modal Large Language Model Utilizing Mamba-2 [[arxiv]](https://arxiv.org/pdf/2407.19832) [[code]](https://wenjunhuang94.github.io/ML-Mamba/) 174 | - VSSD: Vision Mamba with Non-Casual State Space Duality [[arxiv]](https://arxiv.org/pdf/2407.18559) [[code]](https://github.com/YuHengsss/VSSD) 175 | - LION: Linear Group RNN for 3D Object Detection in Point Clouds [[arxiv]](https://arxiv.org/pdf/2407.18232) [[code]](https://happinesslz.github.io/projects/LION/) 176 | - ALMRR: Anomaly Localization Mamba on Industrial Textured Surface with Feature Reconstruction and Refinement [[arxiv]](https://arxiv.org/pdf/2407.17705) [[code]](https://github.com/qsc1103/ALMRR) 177 | - Mamba meets crack segmentation [[arxiv]](https://arxiv.org/pdf/2407.15714) 178 | - Serialized Point Mamba: A Serialized Point Cloud Mamba Segmentation Model [[arxiv]](https://arxiv.org/pdf/2407.12319) 179 | - GroupMamba: Parameter-Efficient and Accurate Group Visual State Space Model [[arxiv]](https://arxiv.org/pdf/2407.13772) [[code]](https://github.com/Amshaker/GroupMamba) 180 | - InfiniMotion: Mamba Boosts Memory in Transformer for Arbitrary Long Motion Generation [[arxiv]](https://arxiv.org/pdf/2407.10061) [[code]](https://steve-zeyu-zhang.github.io/InfiniMotion/) 181 | - OPa-Ma: Text Guided Mamba for 360-degree Image Out-painting [[arxiv]](https://arxiv.org/pdf/2407.10923) 182 | - A Mamba-based Siamese Network for Remote Sensing Change Detection [[arxiv]](https://arxiv.org/pdf/2407.06839) [[code]](https://github.com/JayParanjape/M-CD) 183 | - HTD-Mamba: Efficient Hyperspectral Target Detection with Pyramid State Space Model [[arxiv]](https://arxiv.org/pdf/2407.06841) [[code]](https://github.com/shendb2022/HTD-Mamba) 184 | - MambaVision: A Hybrid Mamba-Transformer Vision Backbone [[arxiv]](https://arxiv.org/pdf/2407.08083) [[code]](https://github.com/NVlabs/MambaVision) 185 | - DMM: Disparity-guided Multispectral Mamba for Oriented Object Detection in Remote Sensing [[arxiv]](https://arxiv.org/pdf/2407.08132) [[code]](https://github.com/Another-0/DMM) 186 | - GraphMamba: An Efficient Graph Structure Learning Vision Mamba for Hyperspectral Image Classification [[arxiv]](https://arxiv.org/pdf/2407.08255) [[code]](https://github.com/ahappyyang/GraphMamba) 187 | - VideoMamba: Spatio-Temporal Selective State Space Model [[arxiv]](https://arxiv.org/pdf/2407.08476) 188 | - Mamba-FSCIL: Dynamic Adaptation with Selective State Space Model for Few-Shot Class-Incremental Learning [[arxiv]](https://arxiv.org/pdf/2407.06136) [[code]](https://github.com/xiaojieli0903/Mamba-FSCIL) 189 | - QueryMamba: A Mamba-Based Encoder-Decoder Architecture with a Statistical Verb-Noun Interaction Module for Video Action Forecasting @ Ego4D Long-Term Action Anticipation Challenge 2024 [[arxiv]](https://arxiv.org/pdf/2407.04184) 190 | - MTMamba: Enhancing Multi-Task Dense Scene Understanding by Mamba-Based Decoders [[arxiv]](https://arxiv.org/pdf/2407.02228) [[code]](https://github.com/EnVision-Research/MTMamba) 191 | - VFIMamba: Video Frame Interpolation with State Space Models [[arxiv]](https://arxiv.org/pdf/2407.02315) [[code]](https://github.com/MCG-NJU/VFIMamba) 192 | - Mamba or RWKV: Exploring High-Quality and High-Efficiency Segment Anything Model [[arxiv]](https://arxiv.org/pdf/2406.19369) [[code]](https://github.com/HarborYuan/ovsam) 193 | - VideoMambaPro: A Leap Forward for Mamba in Video Understanding [[arxiv]](https://arxiv.org/pdf/2406.19006) [[code]](https://github.com/hotfinda/VideoMambaPro) 194 | - Mamba24/8D: Enhancing Global Interaction in Point Clouds via State Space Model [[arxiv]](https://arxiv.org/pdf/2406.17442) 195 | - SUM: Saliency Unification through Mamba for Visual Attention Modeling [[arxiv]](https://arxiv.org/pdf/2406.17815v1) [[code]](https://github.com/Arhosseini77/SUM) 196 | - Vision Mamba-based autonomous crack segmentation on concrete, asphalt, and masonry surfaces [[arxiv]](https://arxiv.org/pdf/2406.16518) 197 | - LFMamba: Light Field Image Super-Resolution with State Space Model [[arxiv]](https://arxiv.org/pdf/2406.12463) 198 | - Voxel Mamba: Group-Free State Space Models for Point Cloud based 3D Object Detection [[arxiv]](https://arxiv.org/pdf/2406.10700) [[code]](https://github.com/gwenzhang/Voxel-Mamba) 199 | - PyramidMamba: Rethinking Pyramid Feature Fusion with Selective Space State Model for Semantic Segmentation of Remote Sensing Imagery [[arxiv]](https://arxiv.org/pdf/2406.10828) [[code]](https://github.com/WangLibo1995/GeoSeg) 200 | - Q-Mamba: On First Exploration of Vision Mamba for Image Quality Assessment [[arxiv]](https://arxiv.org/pdf/2406.09546) 201 | - PixMamba: Leveraging State Space Models in a Dual-Level Architecture for Underwater Image Enhancement [[arxiv]](https://arxiv.org/pdf/2406.08444) [[code]](https://github.com/weitunglin/pixmamba) 202 | - Towards Evaluating the Robustness of Visual State Space Models [[arxiv]](https://arxiv.org/pdf/2406.09407) [[code]](https://github.com/HashmatShadab/MambaRobustness) 203 | - DualMamba: A Lightweight Spectral-Spatial Mamba-Convolution Network for Hyperspectral Image Classification [[arxiv]](https://arxiv.org/pdf/2406.07050) 204 | - Autoregressive Pretraining with Mamba in Vision [[arxiv]](https://arxiv.org/pdf/2406.07537) [[code]](https://github.com/OliverRensu/ARM) 205 | - MambaDepth: Enhancing Long-range Dependency for Self-Supervised Fine-Structured Monocular Depth Estimation [[arxiv]](https://arxiv.org/pdf/2406.04532) [[code]](https://github.com/ionut-grigore99/MambaDepth) 206 | - Efficient 3D Shape Generation via Diffusion Mamba with Bidirectional SSMs [[arxiv]](https://arxiv.org/pdf/2406.05038) 207 | - MHS-VM: Multi-Head Scanning in Parallel Subspaces for Vision Mamba [[arxiv]](https://arxiv.org/pdf/2406.05992) [[code]](https://github.com/PixDeep/MHS-VM) 208 | - HDMba: Hyperspectral Remote Sensing Imagery Dehazing with State Space Model [[arxiv]](https://arxiv.org/pdf/2406.05700) [[code]](https://github.com/RsAI-lab/HDMba) 209 | - Mamba YOLO: SSMs-Based YOLO For Object Detection [[arxiv]](https://arxiv.org/pdf/2406.05835) [[code]](https://github.com/HZAI-ZJNU/Mamba-YOLO) 210 | - MVGamba: Unify 3D Content Generation as State Space Sequence Modeling [[arxiv]](https://arxiv.org/pdf/2406.06367) 211 | - RoboMamba: Multimodal State Space Model for Efficient Robot Reasoning and Manipulation [[arxiv]](https://arxiv.org/pdf/2406.04339) [[code]](https://sites.google.com/view/robomamba-web) 212 | - GrootVL: Tree Topology is All You Need in State Space Model [[arxiv]](https://arxiv.org/pdf/2406.02395) [[code]](https://github.com/EasonXiao-888/GrootVL) 213 | - CDMamba: Remote Sensing Image Change Detection with Mamba [[arxiv]](https://arxiv.org/pdf/2406.04207) [[code]](https://github.com/zmoka-zht/CDMamba) 214 | - LLEMamba: Low-Light Enhancement via Relighting-Guided Mamba with Deep Unfolding Network [[arxiv]](https://arxiv.org/pdf/2406.01059) 215 | - Dimba: Transformer-Mamba Diffusion Models [[arxiv]](https://arxiv.org/pdf/2406.01159) [[code]](https://dimba-project.github.io/) 216 | - S4Fusion: Saliency-aware Selective State Space Model for Infrared Visible Image Fusion [[arxiv]](https://arxiv.org/pdf/2405.20881) 217 | - DeMamba: AI-Generated Video Detection on Million-Scale GenVideo Benchmark [[arxiv]](https://arxiv.org/pdf/2405.19707) [[code]](https://github.com/chenhaoxing/DeMamba) 218 | - FourierMamba: Fourier Learning Integration with State Space Models for Image Deraining [[arxiv]](https://arxiv.org/pdf/2405.19450) 219 | - Vim-F: Visual State Space Model Benefiting from Learning in the Frequency Domain [[arxiv]](https://arxiv.org/pdf/2405.18679) [[code]](https://github.com/yws-wxs/Vim-F) 220 | - MambaLLIE: Implicit Retinex-Aware Low Light Enhancement with Global-then-Local State Space [[arxiv]](https://arxiv.org/pdf/2405.16105) [[code]](https://mamballie.github.io/anon/) 221 | - Image Deraining with Frequency-Enhanced State Space Model [[arxiv]](https://arxiv.org/pdf/2405.16470) 222 | - Demystify Mamba in Vision: A Linear Attention Perspective [[arxiv]](https://arxiv.org/pdf/2405.16605) [[code]](https://github.com/LeapLabTHU/MLLA) 223 | - MambaVC: Learned Visual Compression with Selective State Spaces [[arxiv]](https://arxiv.org/pdf/2405.15413) 224 | - PoinTramba: A Hybrid Transformer-Mamba Framework for Point Cloud Analysis [[arxiv]](https://arxiv.org/pdf/2405.15463) [[code]](https://github.com/xiaoyao3302/PoinTramba) 225 | - Meteor: Mamba-based Traversal of Rationale for Large Language and Vision Models [[arxiv]](https://arxiv.org/pdf/2405.15574) [[code]](https://github.com/ByungKwanLee/Meteor) 226 | - Multi-Scale VMamba: Hierarchy in Hierarchy Visual State Space Model [[arxiv]](https://arxiv.org/pdf/2405.14174) [[code]](https://github.com/YuHengsss/MSVMamba) 227 | - DiM: Diffusion Mamba for Efficient High-Resolution Image Synthesis [[arxiv]](https://arxiv.org/pdf/2405.14224) 228 | - MAMBA4D: Efficient Long-Sequence Point Cloud Video Understanding with Disentangled Spatial-Temporal State Space Models [[arxiv]](https://arxiv.org/pdf/2405.14338) 229 | - Scalable Visual State Space Model with Fractal Scanning [[arxiv]](https://arxiv.org/pdf/2405.14480) 230 | - Efficient Visual State Space Model for Image Deblurring [[arxiv]](https://arxiv.org/pdf/2405.14343) 231 | - Mamba®: Vision Mamba ALSO Needs Registers [[arxiv]](https://arxiv.org/pdf/2405.14858) [[code]](https://wangf3014.github.io/mambar-page/) 232 | - 3DSS-Mamba: 3D-Spectral-Spatial Mamba for Hyperspectral Image Classification [[arxiv]](https://arxiv.org/pdf/2405.12487) 233 | - Mamba-in-Mamba: Centralized Mamba-Cross-Scan in Tokenized Mamba Model for Hyperspectral Image Classification [[arxiv]](https://arxiv.org/pdf/2405.12003) [[code]](https://github.com/zhouweilian1904/Mamba-in-Mamba) 234 | - CM-UNet: Hybrid CNN-Mamba UNet for Remote Sensing Image Semantic Segmentation [[arxiv]](https://arxiv.org/pdf/2405.10530) [[code]](https://github.com/XiaoBuL/CM-UNet) 235 | - IRSRMamba: Infrared Image Super-Resolution via Mamba-based Wavelet Transform Feature Modulation Model [[arxiv]](https://arxiv.org/pdf/2405.09873) [[code]](https://github.com/yongsongH/IRSRMamba) 236 | - RSDehamba: Lightweight Vision Mamba for Remote Sensing Satellite Image Dehazing [[arxiv]](https://arxiv.org/pdf/2405.10030) 237 | - WaterMamba: Visual State Space Model for Underwater Image Enhancement [[arxiv]](https://arxiv.org/pdf/2405.08419) 238 | - Rethinking Scanning Strategies with Vision Mamba in Semantic Segmentation of Remote Sensing Imagery: An Experimental Study [[arxiv]](https://arxiv.org/pdf/2405.08493) 239 | - MambaOut: Do We Really Need Mamba for Vision? [[arxiv]](https://arxiv.org/pdf/2405.07992) [[code]](https://github.com/yuweihao/MambaOut) 240 | - OverlapMamba: Novel Shift State Space Model for LiDAR-based Place Recognition [[arxiv]](https://arxiv.org/pdf/2405.07966) 241 | - Rethinking Efficient and Effective Point-based Networks for Event Camera Classification and Regression: EventMamba [[arxiv]](https://arxiv.org/pdf/2405.06116) 242 | - Frequency-Assisted Mamba for Remote Sensing Image Super-Resolution [[arxiv]](https://arxiv.org/pdf/2405.04964) 243 | - StyleMamba: State Space Model for Efficient Text-driven Image Style Transfer [[arxiv]](https://arxiv.org/pdf/2405.05027) 244 | - VMambaCC: A Visual State Space Model for Crowd Counting [[arxiv]](https://arxiv.org/pdf/2405.03978) 245 | - DVMSR: Distillated Vision Mamba for Efficient Super-Resolution [[arxiv]](https://arxiv.org/pdf/2405.03008) [[code]](https://github.com/nathan66666/DVMSR) 246 | - SMCD: High Realism Motion Style Transfer via Mamba-based Diffusion [[arxiv]](https://arxiv.org/pdf/2405.02844) 247 | - Matten: Video Generation with Mamba-Attention [[arxiv]](https://arxiv.org/pdf/2405.03025) 248 | - Retinexmamba: Retinex-based Mamba for Low-light Image Enhancement [[arxiv]](https://arxiv.org/pdf/2405.03349) [[code]](https://github.com/YhuoyuH/RetinexMamba) 249 | - MemoryMamba: Memory-Augmented State Space Model for Defect Recognition [[arxiv]](https://arxiv.org/pdf/2405.03673) 250 | - SSUMamba: Spatial-Spectral Selective State Space Model for Hyperspectral Image Denoising [[arxiv]](https://arxiv.org/pdf/2405.01726) [[code]](https://github.com/lronkitty/SSUMamba) 251 | - FER-YOLO-Mamba: Facial Expression Detection and Classification Based on Selective State Space [[arxiv]](https://arxiv.org/pdf/2405.01828) [[code]](https://github.com/SwjtuMa/FER-YOLO-Mamba) 252 | - CLIP-Mamba: CLIP Pretrained Mamba Models with OOD and Hessian Evaluation [[arxiv]](https://arxiv.org/pdf/2404.19394) [[code]](https://github.com/raytrun/mamba-clip) 253 | - Mamba-FETrack: Frame-Event Tracking via State Space Model [[arxiv]](https://arxiv.org/pdf/2404.18174) [[code]](https://github.com/Event-AHU/Mamba_FETrack) 254 | - S2Mamba: A Spatial-spectral State Space Model for Hyperspectral Image Classification [[arxiv]](https://arxiv.org/pdf/2404.18213) [[code]](https://github.com/PURE-melo/S2Mamba) 255 | - Spectral-Spatial Mamba for Hyperspectral Image Classification [[arxiv]](https://arxiv.org/pdf/2404.18401) 256 | - RSCaMa: Remote Sensing Image Change Captioning with State Space Model [[arxiv]](https://arxiv.org/pdf/2404.18895) [[code]](https://github.com/Chen-Yang-Liu/RSCaMa) 257 | - Sparse Reconstruction of Optical Doppler Tomography Based on State Space Model [[arxiv]](https://arxiv.org/pdf/2404.17484) 258 | - CFMW: Cross-modality Fusion Mamba for Multispectral Object Detection under Adverse Weather Conditions [[arxiv]](https://arxiv.org/pdf/2404.16302) [[code]](https://github.com/lhy-zjut/CFMW) 259 | - Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model [[arxiv]](https://arxiv.org/pdf/2404.14966.pdf) 260 | - MambaUIE: Unraveling the Ocean's Secrets with Only 2.8 FLOPs [[arxiv]](https://arxiv.org/pdf/2404.13884.pdf) [[code]](https://github.com/1024AILab/MambaUIE) 261 | - MambaMOS: LiDAR-based 3D Moving Object Segmentation with Motion-aware State Space Model [[arxiv]](https://arxiv.org/pdf/2404.12794.pdf) [[code]](https://github.com/Terminal-K/MambaMOS) 262 | - CU-Mamba: Selective State Space Models with Channel Learning for Image Restoration [[arxiv]](https://arxiv.org/pdf/2404.11778.pdf) 263 | - MambaPupil: Bidirectional Selective Recurrent model for Event-based Eye tracking [[arxiv]](https://arxiv.org/pdf/2404.12083.pdf) 264 | - Text-controlled Motion Mamba: Text-Instructed Temporal Grounding of Human Motion [[arxiv]](https://arxiv.org/pdf/2404.11375.pdf) 265 | - A Novel State Space Model with Local Enhancement and State Sharing for Image Fusion [[arxiv]](https://arxiv.org/pdf/2404.09293.pdf) 266 | - Fusion-Mamba for Cross-modality Object Detection [[arxiv]](https://arxiv.org/pdf/2404.09146.pdf) 267 | - FreqMamba: Viewing Mamba from a Frequency Perspective for Image Deraining [[arxiv]](https://arxiv.org/pdf/2404.09476.pdf) 268 | - HSIDMamba: Exploring Bidirectional State-Space Models for Hyperspectral Denoising [[arxiv]](https://arxiv.org/pdf/2404.09697.pdf) 269 | - MambaDFuse: A Mamba-based Dual-phase Model for Multi-modality Image Fusion [[arxiv]](https://arxiv.org/pdf/2404.08406.pdf) 270 | - SpectralMamba: Efficient Mamba for Hyperspectral Image Classification [[arxiv]](https://arxiv.org/pdf/2404.08489.pdf) [[code]](https://github.com/danfenghong/SpectralMamba) 271 | - Simba: Mamba augmented U-ShiftGCN for Skeletal Action Recognition in Videos [[arxiv]](https://arxiv.org/pdf/2404.07645.pdf) 272 | - DGMamba: Domain Generalization via Generalized State Space Model [[arxiv]](https://arxiv.org/pdf/2404.07794.pdf) [[code]](https://github.com/longshaocong/DGMamba) 273 | - FusionMamba: Efficient Image Fusion with State Space Model [[arxiv]](https://arxiv.org/pdf/2404.07932.pdf) 274 | - MambaAD: Exploring State Space Models for Multi-class Unsupervised Anomaly Detection [[arxiv]](https://arxiv.org/pdf/2404.06564.pdf) [[code]](https://lewandofskee.github.io/projects/MambaAD/) 275 | - 3DMambaComplete: Exploring Structured State Space Model for Point Cloud Completion [[arxiv]](https://arxiv.org/pdf/2404.07106.pdf) 276 | - RhythmMamba: Fast Remote Physiological Measurement with Arbitrary Length Videos [[arxiv]](https://arxiv.org/pdf/2404.06483.pdf) [[code]](https://github.com/zizheng-guo/RhythmMamba) 277 | - Sigma: Siamese Mamba Network for Multi-Modal Semantic Segmentation [[arxiv]](https://arxiv.org/pdf/2404.04256.pdf) [[code]](https://github.com/zifuwan/Sigma) 278 | - ChangeMamba: Remote Sensing Change Detection with Spatio-Temporal State Space Model [[arxiv]](https://arxiv.org/pdf/2404.03425.pdf) [[code]](https://github.com/ChenHongruixuan/MambaCD) 279 | - InsectMamba: Insect Pest Classification with State Space Model [[arxiv]](https://arxiv.org/pdf/2404.03611.pdf) 280 | - RS3Mamba: Visual State Space Model for Remote Sensing Images Semantic Segmentation [[arxiv]](https://arxiv.org/pdf/2404.02457.pdf) [[code]](https://github.com/sstary/SSRS) 281 | - RS-Mamba for Large Remote Sensing Image Dense Prediction [[arxiv]](https://arxiv.org/pdf/2404.02668.pdf) [[code]](https://github.com/walking-shadow/Official_Remote_Sensing_Mamba) 282 | - Samba: Semantic Segmentation of Remotely Sensed Images with State Space Model [[arxiv]](https://arxiv.org/pdf/2404.01705.pdf) [[code]](https://github.com/zhuqinfeng1999/Samba) 283 | - HSIMamba: Hyperpsectral Imaging Efficient Feature Learning with Bidirectional State Space for Classification [[arxiv]](https://arxiv.org/pdf/2404.00272.pdf) 284 | - SpikeMba: Multi-Modal Spiking Saliency Mamba for Temporal Video Grounding [[arxiv]](https://arxiv.org/pdf/2404.01174.pdf) 285 | - MambaMixer: Efficient Selective State Space Models with Dual Token and Channel Selection [[arxiv]](https://arxiv.org/pdf/2403.19888.pdf) [[code]](https://mambamixer.github.io/) 286 | - Aggregating Local and Global Features via Selective State Spaces Model for Efficient Image Deblurring [[arxiv]](https://arxiv.org/pdf/2403.20106.pdf) 287 | - HARMamba: Efficient Wearable Sensor Human Activity Recognition Based on Bidirectional Selective SSM [[arxiv]](https://arxiv.org/pdf/2403.20183.pdf) 288 | - RSMamba: Remote Sensing Image Classification with State Space Model [[arxiv]](https://arxiv.org/pdf/2403.19654.pdf) [[code]](https://github.com/KyanChen/RSMamba) 289 | - Gamba: Marry Gaussian Splatting with Mamba for single view 3D reconstruction [[arxiv]](https://arxiv.org/pdf/2403.18795.pdf) 290 | - Integrating Mamba Sequence Model and Hierarchical Upsampling Network for Accurate Semantic Segmentation of Multiple Sclerosis Legion [[arxiv]](https://arxiv.org/pdf/2403.17432.pdf) 291 | - PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition [[arxiv]](https://arxiv.org/pdf/2403.17695.pdf) [[code]](https://github.com/ChenhongyiYang/PlainMamba) 292 | - ReMamber: Referring Image Segmentation with Mamba Twister [[arxiv]](https://arxiv.org/pdf/2403.17839.pdf) 293 | - VMRNN: Integrating Vision Mamba and LSTM for Efficient and Accurate Spatiotemporal Forecasting [[arxiv]](https://arxiv.org/pdf/2403.16536.pdf) [[code]](https://github.com/yyyujintang/VMRNN-PyTorch) 294 | - SiMBA: Simplified Mamba-based Architecture for Vision and Multivariate Time series [[arxiv]](https://arxiv.org/pdf/2403.15360.pdf) [[code]](https://github.com/badripatro/Simba) 295 | - Cobra: Extending Mamba to Multi-Modal Large Language Model for Efficient Inference [[arxiv]](https://arxiv.org/pdf/2403.14520.pdf) [[code]](https://sites.google.com/view/cobravlm) 296 | - VL-Mamba: Exploring State Space Models for Multimodal Learning [[arxiv]](https://arxiv.org/pdf/2403.13600.pdf) 297 | - ZigMa: Zigzag Mamba Diffusion Model [[arxiv]](https://arxiv.org/pdf/2403.13802.pdf) [[code]](https://taohu.me/zigma/) 298 | - VmambaIR: Visual State Space Model for Image Restoration [[arxiv]](https://arxiv.org/pdf/2403.11423.pdf) [[code]](https://github.com/AlphacatPlus/VmambaIR) 299 | - LocalMamba: Visual State Space Model with Windowed Selective Scan [[arxiv]](https://arxiv.org/pdf/2403.09338.pdf) [[code]](https://github.com/hunto/LocalMamba) 300 | - MambaTalk: Efficient Holistic Gesture Synthesis with Selective State Space Models [[arxiv]](https://arxiv.org/pdf/2403.09471.pdf) 301 | - Video Mamba Suite: State Space Model as a Versatile Alternative for Video Understanding [[arxiv]](https://arxiv.org/pdf/2403.09626.pdf) [[code]](https://github.com/OpenGVLab/video-mamba-suite) 302 | - Motion Mamba: Efficient and Long Sequence Motion Generation with Hierarchical and Bidirectional Selective SSM [[arxiv]](https://arxiv.org/pdf/2403.07487.pdf) [[code]](https://steve-zeyu-zhang.github.io/MotionMamba/) 303 | - Point Mamba: A Novel Point Cloud Backbone Based on State Space Model with Octree-Based Ordering Strategy [[arxiv]](https://arxiv.org/pdf/2403.06467.pdf) [[code]](https://github.com/IRMVLab/Point-Mamba) 304 | - VideoMamba: State Space Model for Efficient Video Understanding [[arxiv]](https://arxiv.org/pdf/2403.06977.pdf) [[code]](https://github.com/OpenGVLab/VideoMamba) 305 | - MiM-ISTD: Mamba-in-Mamba for Efficient Infrared Small Target Detection [[arxiv]](https://arxiv.org/pdf/2403.02148.pdf) [[code]](https://github.com/txchen-USTC/MiM-ISTD) 306 | - Point Could Mamba: Point Cloud Learning via State Space Model [[arxiv]](https://arxiv.org/pdf/2403.00762.pdf) [[code]](https://github.com/zhang-tao-whu/PCM) 307 | - Res-VMamba: Fine-Grained Food Category Visual Classification Using Selective State Space Models with Deep Residual Learning [[arxiv]](https://arxiv.org/pdf/2402.15761.pdf) [[code]](https://github.com/ChiShengChen/ResVMamba) 308 | - MambaIR: A Simple Baseline for Image Restoration with State-Space Model [[arxiv]](https://arxiv.org/abs/2402.15648) [[code]](https://github.com/csguoh/MambaIR) 309 | - Pan-Mamba: Effective pan-sharpening with State Space Model [[arxiv]](https://arxiv.org/pdf/2402.12192.pdf) [[code]](https://github.com/alexhe101/Pan-Mamba) 310 | - PointMamba: A Simple State Space Model for Point Cloud Analysis [[arxiv]](https://arxiv.org/pdf/2402.10739.pdf) [[code]](https://github.com/LMD0311/PointMamba) 311 | - Scalable Diffusion Models with State Space Backbone [[arxiv]](https://arxiv.org/abs/2402.05608) [[code]](https://github.com/feizc/DiS) 312 | - Mamba-ND: Selective State Space Modeling for Multi-Dimensional Data [[arxiv]](https://arxiv.org/pdf/2402.05892.pdf) 313 | - Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model [[arxiv]](https://arxiv.org/abs/2401.09417) [[code]](https://github.com/kyegomez/VisionMamba) 314 | - VMamba: Visual State Space Model [[arxiv]](https://arxiv.org/abs/2401.10166) [[code]](https://github.com/MzeroMiko/VMamba) 315 | - U-shaped Vision Mamba for Single Image Dehazing [[arxiv]](https://arxiv.org/pdf/2402.04139.pdf) [[code]](https://github.com/zzr-idam/UVM-Net) 316 | ## Medical Imaging 317 | - EndoMamba: An Efficient Foundation Model for Endoscopic Videos via Hierarchical Pre-training [[arxiv]](https://arxiv.org/pdf/2502.19090) 318 | - UD-Mamba: A pixel-level uncertainty-driven Mamba model for medical image segmentation [[arxiv]](https://arxiv.org/pdf/2502.02024) [[code]](https://github.com/piooip/UD-Mamba) 319 | - DM-Mamba: Dual-domain Multi-scale Mamba for MRI reconstruction [[arxiv]](https://arxiv.org/pdf/2501.08163) [[code]](https://github.com/XiaoMengLiLiLi/DM-Mamba) 320 | - MSV-Mamba: A Multiscale Vision Mamba Network for Echocardiography Segmentation [[arxiv]](https://arxiv.org/pdf/2501.07120) 321 | - Merging Context Clustering with Visual State Space Models for Medical Image Segmentation [[arxiv]](https://arxiv.org/pdf/2501.01618) [[code]](https://github.com/zymissy/CCViM) 322 | - HCMA-UNet: A Hybrid CNN-Mamba UNet with Inter-Slice Self-Attention for Efficient Breast Cancer Segmentation [[arxiv]](https://arxiv.org/pdf/2501.00751) [[code]](https://anonymous.4open.science/r/ICME2025_HCMA-UNet/README.md) 323 | - S3-Mamba: Small-Size-Sensitive Mamba for Lesion Segmentation [[arxiv]](https://arxiv.org/pdf/2412.14546) [[code]](https://github.com/ErinWang2023/S3-Mamba) 324 | - SAM-Mamba: Mamba Guided SAM Architecture for Generalized Zero-Shot Polyp Segmentation [[arxiv]](https://arxiv.org/pdf/2412.08482) [[code]](https://github.com/TapasKumarDutta1/SAM_Mamba_2025) 325 | - 2DMamba: Efficient State Space Model for Image Representation with Applications on Giga-Pixel Whole Slide Image Classification [[arxiv]](https://arxiv.org/pdf/2412.00678) [[code]](https://github.com/AtlasAnalyticsLab/2DMamba) 326 | - MambaU-Lite: A Lightweight Model based on Mamba and Integrated Channel-Spatial Attention for Skin Lesion Segmentation [[arxiv]](https://arxiv.org/pdf/2412.01405) 327 | - KAN-Mamba FusionNet: Redefining Medical Image Segmentation with Non-Linear Modeling [[arxiv]](https://arxiv.org/pdf/2411.11926) 328 | - Optical Flow Representation Alignment Mamba Diffusion Model for Medical Video Generation [[arxiv]](https://arxiv.org/pdf/2411.01647) [[code]](https://wongzbb.github.io/MedSora/) 329 | - MLLA-UNet: Mamba-like Linear Attention in an Efficient U-Shape Model for Medical Image Segmentation [[arxiv]](https://arxiv.org/pdf/2410.23738) [[code]](https://github.com/csyfjiang/MLLA-UNet) 330 | - R2Gen-Mamba: A Selective State Space Model for Radiology Report Generation [[arxiv]](https://arxiv.org/pdf/2410.18135) [[code]](https://github.com/YonghengSun1997/R2Gen-Mamba) 331 | - Taming Mambas for Voxel Level 3D Medical Image Segmentation [[arxiv]](https://arxiv.org/pdf/2410.15496) [[code]](https://anonymous.4open.science/r/WACV2025-TamingMamba/README.md) 332 | - UMambaAdj: Advancing GTV Segmentation for Head and Neck Cancer in MRI-Guided RT with UMamba and nnU-Net ResEnc Planner [[arxiv]](https://arxiv.org/pdf/2410.12940) 333 | - MambaEviScrib: Mamba and Evidence-Guided Consistency Make CNN Work Robustly for Scribble-Based Weakly Supervised Ultrasound Image Segmentation [[arxiv]](https://arxiv.org/pdf/2409.19370) [[code]](https://github.com/GtLinyer/MambaEviScrib) 334 | - DenoMamba: A fused state-space model for low-dose CT denoising [[arxiv]](https://arxiv.org/pdf/2409.13094) 335 | - MambaRecon: MRI Reconstruction with Structured State Space Models [[arxiv]](https://arxiv.org/pdf/2409.12401) [[code]](https://github.com/yilmazkorkmaz1/MambaRecon) 336 | - MambaClinix: Hierarchical Gated Convolution and Mamba-Based U-Net for Enhanced 3D Medical Image Segmentation [[arxiv]](https://arxiv.org/pdf/2409.12533) [[code]](https://github.com/CYB08/MambaClinix-PyTorch) 337 | - SPRMamba: Surgical Phase Recognition for Endoscopic Submucosal Dissection with Mamba [[arxiv]](https://arxiv.org/pdf/2409.12108) 338 | - SkinMamba: A Precision Skin Lesion Segmentation Architecture with Cross-Scale Global State Modeling and Frequency Boundary Guidance [[arxiv]](https://arxiv.org/pdf/2409.10890) [[code]](https://github.com/zs1314/SkinMamba) 339 | - MedSegMamba: 3D CNN-Mamba Hybrid Architecture for Brain Segmentation [[arxiv]](https://arxiv.org/pdf/2409.08307) 340 | - Tri-Plane Mamba: Efficiently Adapting Segment Anything Model for 3D Medical Images [[arxiv]](https://arxiv.org/pdf/2409.08492) [[code]](https://github.com/xmed-lab/TP-Mamba) 341 | - OCTAMamba: A State-Space Model Approach for Precision OCTA Vasculature Segmentation [[arxiv]](https://arxiv.org/pdf/2409.08000) [[code]](https://github.com/zs1314/OCTAMamba) 342 | - Microscopic-Mamba: Revealing the Secrets of Microscopic Images with Just 4M Parameters [[arxiv]](https://arxiv.org/pdf/2409.07896) [[code]](https://github.com/zs1314/Microscopic-Mamba) 343 | - MpoxMamba: A Grouped Mamba-based Lightweight Hybrid Network for Mpox Detection [[arxiv]](https://arxiv.org/pdf/2409.04218) 344 | - Serp-Mamba: Advancing High-Resolution Retinal Vessel Segmentation with Selective State-Space Model [[arxiv]](https://arxiv.org/pdf/2409.04356) 345 | - Mamba2MIL: State Space Duality Based Multiple Instance Learning for Computational Pathology [[arxiv]](https://arxiv.org/pdf/2408.15032) [[code]](https://github.com/YuqiZhang-Buaa/Mamba2MIL) 346 | - MSVM-UNet: Multi-Scale Vision Mamba UNet for Medical Image Segmentation [[arxiv]](https://arxiv.org/pdf/2408.13735) [[code]](https://github.com/gndlwch2w/msvm-unet) 347 | - ShapeMamba-EM: Fine-Tuning Foundation Model with Local Shape Descriptors and Mamba Blocks for 3D EM Image Segmentation [[arxiv]](https://arxiv.org/pdf/2408.14114) 348 | - LoG-VMamba: Local-Global Vision Mamba for Medical Image Segmentation [[arxiv]](https://arxiv.org/pdf/2408.14415) [[code]](https://github.com/Oulu-IMEDS/LoG-VMamba) 349 | - MambaMIM: Pre-training Mamba with State Space Token-interpolation [[arxiv]](https://arxiv.org/pdf/2408.08070) [[code]](https://github.com/FengheTan9/MambaMIM) 350 | - BioMamba: A Pre-trained Biomedical Language Representation Model Leveraging Mamba [[arxiv]](https://arxiv.org/pdf/2408.02600) [[code]](https://github.com/LeoYML/BioMamba) 351 | - Mamba? Catch The Hype Or Rethink What Really Helps for Image Registration [[arxiv]](https://arxiv.org/pdf/2407.19274) 352 | - GFE-Mamba: Mamba-based AD Multi-modal Progression Assessment via Generative Feature Extraction from MCI [[arxiv]](https://arxiv.org/pdf/2407.15719) [[code]](https://github.com/Tinysqua/GFE-Mamba) 353 | - SliceMamba for Medical Image Segmentation [[arxiv]](https://arxiv.org/pdf/2407.08481) 354 | - SR-Mamba: Effective Surgical Phase Recognition with State Space Model [[arxiv]](https://arxiv.org/pdf/2407.08333) [[code]](https://github.com/rcao-hk/SR-Mamba) 355 | - Deform-Mamba Network for MRI Super-Resolution [[arxiv]](https://arxiv.org/pdf/2407.05969) 356 | - Vision Mamba for Classification of Breast Ultrasound Images [[arxiv]](https://arxiv.org/pdf/2407.03552) 357 | - MMR-Mamba: Multi-Contrast MRI Reconstruction with Mamba and Spatial-Frequency Information Fusion [[arxiv]](https://arxiv.org/pdf/2406.18950) 358 | - Soft Masked Mamba Diffusion Model for CT to MRI Conversion [[arxiv]](https://arxiv.org/pdf/2406.15910) [[code]](https://github.com/wongzbb/DiffMa-Diffusion-Mamba) 359 | - SEDMamba: Enhancing Selective State Space Modelling with Bottleneck Mechanism and Fine-to-Coarse Temporal Fusion for Efficient Error Detection in Robot-Assisted Surgery [[arxiv]](https://arxiv.org/pdf/2406.15920) 360 | - Vision Mamba: Cutting-Edge Classification of Alzheimer's Disease with 3D MRI Scans [[arxiv]](https://arxiv.org/pdf/2406.05757) 361 | - Convolution and Attention-Free Mamba-based Cardiac Image Segmentation [[arxiv]](https://arxiv.org/pdf/2406.05786) 362 | - MUCM-Net: A Mamba Powered UCM-Net for Skin Lesion Segmentation [[arxiv]](https://arxiv.org/pdf/2405.15925) [[code]](https://github.com/chunyuyuan/MUCM-Net) 363 | - I2I-Mamba: Multi-modal medical image synthesis via selective state space modeling [[arxiv]](https://arxiv.org/pdf/2405.14022) 364 | - VM-DDPM: Vision Mamba Diffusion for Medical Image Synthesis [[arxiv]](https://arxiv.org/pdf/2405.05667) 365 | - HC-Mamba: Vision MAMBA with Hybrid Convolutional Techniques for Medical Image Segmentation [[arxiv]](https://arxiv.org/pdf/2405.05007) 366 | - AC-MAMBASEG: An adaptive convolution and Mamba-based architecture for enhanced skin lesion segmentation [[arxiv]](https://arxiv.org/pdf/2405.03011) [[code]](https://github.com/vietthanh2710/AC-MambaSeg) 367 | - Vim4Path: Self-Supervised Vision Mamba for Histopathology Images [[arxiv]](https://arxiv.org/pdf/2404.13222.pdf) [[code]](https://github.com/AtlasAnalyticsLab/Vim4Path) 368 | - FusionMamba: Dynamic Feature Enhancement for Multimodal Image Fusion with Mamba [[arxiv]](https://arxiv.org/pdf/2404.09498.pdf) [[code]](https://github.com/millieXie/FusionMamba) 369 | - ViM-UNet: Vision Mamba for Biomedical Segmentation [[arxiv]](https://arxiv.org/pdf/2404.07705.pdf) [[code]](https://github.com/constantinpape/torch-em/blob/main/vimunet.md) 370 | - VMambaMorph: a Visual Mamba-based Framework with Cross-Scan Module for Deformable 3D Image Registration [[arxiv]](https://arxiv.org/pdf/2404.05105.pdf) [[code]](https://github.com/ziyangwang007/VMambaMorph) 371 | - T-Mamba: Frequency-Enhanced Gated Long-Range Dependency for Tooth 3D CBCT Segmentation [[arxiv]](https://arxiv.org/pdf/2404.01065.pdf) [[code]](https://github.com/isbrycee/T-Mamba) 372 | - Rotate to Scan: UNet-like Mamba with Triplet SSM Module for Medical Image Segmentation [[arxiv]](https://arxiv.org/pdf/2403.17701.pdf) 373 | - H-vmunet: High-order Vision Mamba UNet for Medical Image Segmentation [[arxiv]](https://arxiv.org/pdf/2403.13642.pdf) [[code]](https://github.com/wurenkai/H-vmunet) 374 | - ProMamba: Prompt-Mamba for polyp segmentation [[arxiv]](https://arxiv.org/pdf/2403.13660.pdf) 375 | - VM-UNET-V2 Rethinking Vision Mamba UNet for Medical Image Segmentation [[arxiv]](https://arxiv.org/pdf/2403.09157.pdf) [[code]](https://github.com/nobodyplayer1/VM-UNetV2) 376 | - MD-Dose: A diffusion model based on the Mamba for radiation dose prediction [[arxiv]](https://arxiv.org/pdf/2403.08479.pdf) [[code]](https://github.com/LinjieFu-U/mamba_dose) 377 | - Large Window-based Mamba UNet for Medical Image Segmentation: Beyond Convolution and Self-attention [[arxiv]](https://arxiv.org/pdf/2403.07332.pdf) [[code]](https://github.com/wjh892521292/LMa-UNet) 378 | - MambaMIL: Enhancing Long Sequence Modeling with Sequence Reordering in Computational Pathology [[arxiv]](https://arxiv.org/pdf/2403.06800.pdf) [[code]](https://github.com/isyangshu/MambaMIL) 379 | - LightM-UNet: Mamba Assists in Lightweight UNet for Medical Image Segmentation [[arxiv]](https://arxiv.org/pdf/2403.05246.pdf) [[code]](https://github.com/MrBlankness/LightM-UNet) 380 | - MamMIL: Multiple Instance Learning for Whole Slide Images with State Space Models [[arxiv]](https://arxiv.org/pdf/2403.05160.pdf) 381 | - MedMamba: Vision Mamba for Medical Image Classification [[arxiv]](https://arxiv.org/pdf/2403.03849.pdf) [[code]](https://github.com/YubiaoYue/MedMamba) 382 | - MambaMIR: An Arbitrary-Masked Mamba for Joint Medical Image Reconstruction and Uncertainty Estimation [[arxiv]](https://arxiv.org/pdf/2402.18451.pdf) [[code]](https://github.com/ayanglab/MambaMIR) 383 | - Weak-Mamba-UNet: Visual Mamba Makes CNN and ViT Work Better for Scribble-based Medical Image Segmentation [[arxiv]](https://arxiv.org/pdf/2402.10887.pdf) [[code]](https://github.com/ziyangwang007/Mamba-UNet) 384 | - P-Mamba: Marrying Perona Malik Diffusion with Mamba for Efficient Pediatric Echocardiographic Left Ventricular Segmentation [[arxiv]](https://arxiv.org/pdf/2402.08506.pdf) 385 | - Semi-Mamba-UNet: Pixel-Level Contrastive Cross-Supervised Visual Mamba-based UNet for Semi-Supervised Medical Image Segmentation [[arxiv]](https://arxiv.org/pdf/2402.07245.pdf) [[code]](https://github.com/ziyangwang007/Mamba-UNet) 386 | - FD-Vision Mamba for Endoscopic Exposure Correction [[arxiv]](https://arxiv.org/pdf/2402.06378.pdf) [[code]](https://github.com/zzr-idam/FDVM-Net) 387 | - MambaMorph: a Mamba-based Backbone with Contrastive Feature Learning for Deformable MR-CT Registration [[arxiv]](https://arxiv.org/abs/2401.13934) [[code]](https://github.com/Guo-Stone/MambaMorph?tab=readme-ov-file) 388 | - Vivim: a Video Vision Mamba for Medical Video Object Segmentation [[arxiv]](https://arxiv.org/abs/2401.14168) [[code]](https://github.com/scott-yjyang/Vivim) 389 | - U-Mamba: Enhancing Long-range Dependency for Biomedical Image Segmentation [[arxiv]](https://arxiv.org/pdf/2401.04722.pdf) [[code]](https://github.com/JiarunLiu/Swin-UMamba?tab=readme-ov-file) 390 | - Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining [[arxiv]](https://arxiv.org/pdf/2402.03302.pdf) [[code]](https://github.com/JiarunLiu/Swin-UMamba) 391 | - nnMamba: 3D Biomedical Image Segmentation, Classification and Landmark Detection with State Space Model [[arxiv]](https://arxiv.org/pdf/2402.03526.pdf) [[code]](https://github.com/lhaof/nnMamba) 392 | - SegMamba: Long-range Sequential Modeling Mamba For 3D Medical Image Segmentation [[arxiv]](https://arxiv.org/abs/2401.13560) [[code]](https://github.com/ge-xing/SegMamba) 393 | - VM-UNet: Vision Mamba UNet for Medical Image Segmentation [[arxiv]](https://arxiv.org/abs/2402.02491) [[code]](https://github.com/JCruan519/VM-UNet) 394 | - Mamba-UNet: UNet-Like Pure Visual Mamba for Medical Image Segmentation [[arxiv]](https://arxiv.org/pdf/2402.05079.pdf) [[code]](https://github.com/ziyangwang007/Mamba-UNet) 395 | --------------------------------------------------------------------------------