Using Mamba's recurrent mode during evaluation #1

JVP15 · 2024-05-30T23:19:56Z

So, coincidentally, I've also done some experiments using Mamba instead of a Transformer for Decision Mamba (most recent repo can be found here: https://github.com/lmco/DecisionMamba). Have you tried using Mamba's inference_params during evals (essentially, the 'recurrent' mode for Mamba) instead of the parallel mode (with the restricted context length)? It's something I was testing, but I didn't get very far with it.

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using Mamba's recurrent mode during evaluation #1

Using Mamba's recurrent mode during evaluation #1

JVP15 commented May 30, 2024

Using Mamba's recurrent mode during evaluation #1

Using Mamba's recurrent mode during evaluation #1

Comments

JVP15 commented May 30, 2024