Skip to content

Commit

Permalink
Merge remote-tracking branch 'origin/main' into mathias
Browse files Browse the repository at this point in the history
  • Loading branch information
mlech26l committed Jul 12, 2022
2 parents aa0c9e2 + 99ef4c1 commit afbe5de
Show file tree
Hide file tree
Showing 5 changed files with 3 additions and 281 deletions.
56 changes: 3 additions & 53 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,8 +1,6 @@
# Structured State Spaces for Sequence Modeling
# Liquid State Space Models

This repository provides implementations and experiments for the following papers.

## Liquid S4
This repository provides implementations and experiments for Liquid S4

```bash
python3 -m train wandb=null experiment=s4-lra-cifar-new # plain S4
Expand All @@ -11,34 +9,6 @@ python3 -m train wandb=null experiment=s4-lra-cifar-new model.layer.liquid=2 # l
python3 -m train wandb=null experiment=s4-lra-cifar-new model.layer.liquid=N # liquid S4 with N terms u(i)*u(j)...
```

## SaShiMi (arXiv)

![SaShiMi](assets/sashimi.png "SaShiMi Architecture")
> **It's Raw! Audio Generation with State-Space Models**\
> Karan Goel, Albert Gu, Chris Donahue, Christopher Ré\
> Paper: https://arxiv.org/abs/2202.09729
## S4 (ICLR 2022 - Outstanding Paper HM)

![Structured State Spaces](assets/properties.png "Properties of Structured State Spaces")
> **Efficiently Modeling Long Sequences with Structured State Spaces**\
> Albert Gu, Karan Goel, Christopher Ré\
> Paper: https://arxiv.org/abs/2111.00396
## LSSL (NeurIPS 2021)

![Linear State Space Layer](assets/splash.png "Properties of Sequential State Spaces")
> **Combining Recurrent, Convolutional, and Continuous-time Models with the Linear State Space Layer**\
> Albert Gu, Isys Johnson, Karan Goel, Khaled Saab, Tri Dao, Atri Rudra, Christopher Ré\
> Paper: https://arxiv.org/abs/2110.13985
## HiPPO (NeurIPS 2020 - Spotlight)
![HiPPO Framework](assets/hippo.png "HiPPO Framework")
> **HiPPO: Recurrent Memory with Optimal Polynomial Projections**\
> Albert Gu*, Tri Dao*, Stefano Ermon, Atri Rudra, Christopher Ré\
> Paper: https://arxiv.org/abs/2008.07669

## Table of Contents
- [Repository Setup](#setup)
- S4
Expand All @@ -49,26 +19,6 @@ python3 -m train wandb=null experiment=s4-lra-cifar-new model.layer.liquid=N # l
- [Repository Structure](#overall-repository-structure)
- [Citation](#citation)

## Changelog

### 2022-05-01 - [V2.1]
- Minor updates to S4 modules
- New S4D (S4-diagonal) standalone model found at `src/models/sequence/ss/standalone/s4d.py`. Simple variant using diagonal SSMs that recovers S4's performance on most tasks. Can be run with any existing experiment config with the additional flag `model/layer=s4d` on the command line.
- New [LRA configs](#long-range-arena-lra) for updated S4 code, with an average score of ~86

### 2022-04-03
By default, S4 no longer requires installing Pykeops or a custom CUDA kernel.

### 2022-02-27 - [V2]
Code release for SaShiMi audio model.

### 2022-01-29
Added configs for time series datasets from the Informer paper.

### 2021-11-18 - [V1]
First release of this repository containing the S4 module and configs to reproduce sCIFAR, Speech Commands, Long Range Arena, and WikiText-103 results.


## Setup

### Requirements
Expand Down Expand Up @@ -372,4 +322,4 @@ If you use this codebase, or otherwise found our work valuable, please cite:
volume={33},
year={2020}
}
```
```
73 changes: 0 additions & 73 deletions deploy.py

This file was deleted.

9 changes: 0 additions & 9 deletions run_local.sh

This file was deleted.

100 changes: 0 additions & 100 deletions run_sl1.sl

This file was deleted.

46 changes: 0 additions & 46 deletions run_sl2.sl

This file was deleted.

0 comments on commit afbe5de

Please sign in to comment.