LESSWRONG
LW

364
Chakshu Mira
0040
Message
Dialogue
Subscribe

Posts

Sorted by New

Wikitag Contributions

Comments

Sorted by
Newest
No posts to display.
No wikitag contributions to display.
Ophiology (or, how the Mamba architecture works)
Chakshu Mira1y10

## Discretize B ## # [B,N] [E->N] [B,E] B = layer.W_B(x[b,l]) # no bias

Shouldn't this be x[:,l] instead of x[b,l]?

Reply
Ophiology (or, how the Mamba architecture works)
Chakshu Mira1y10

y_t=[N]C[E,N]h_t+[E]    <this one> E    [E]xt

Shouldn't this be 'D'?

Reply
Ophiology (or, how the Mamba architecture works)
Chakshu Mira1y10

E

Did you mean 'D' here? (2nd equation of the structured SSM)

Reply
Ophiology (or, how the Mamba architecture works)
Chakshu Mira1y10

Is this a typo? (Δtvt+1)xt−1

Reply