Possibility of using Mamba SSM #390
Labels
enhancement
New feature or request
help wanted
Extra attention is needed
local
Issue related to local generation
Mamba SSM architecture has some improvements over the standard transformer based LLMs it seems. Mamba Paper. There are people working on implementations with llama.cpp, mentioned in this GitHub thread towards the end
The text was updated successfully, but these errors were encountered: