BayJarvis: Blogs on state-space-model

paper Mamba: Linear-Time Sequence Modeling with Selective State Spaces - 2023-12-30

The landscape of deep learning is continually evolving, and a recent groundbreaking development comes from the world of sequence modeling. A paper titled "Mamba: Linear-Time Sequence Modeling with Selective State Spaces" introduces a novel approach that challenges the current dominance of Transformer-based models. Let's delve into this innovation. …

network-architecture The Annotated S4: Understanding Structured State Spaces in Sequence Modeling - 2023-12-22

The Annotated S4 website delves into the Structured State Space (S4) architecture, revolutionizing long-range sequence modeling in various domains, including vision, language, and audio. It distinctly moves away from Transformer models, handling over 16,000 sequence elements effectively. …