view post Post 2378 Native tensor parallel has landed in transformers!!! https://github.com/huggingface/transformers/pull/34184 thanks a lot to the torch team for their support! Contributions are welcome to support more models! π₯
view post Post mamba is now available in transformers. Thanks to @tridao and @albertgu for this brilliant model! π and the amazing mamba-ssm kernels powering this!Checkout the collection here: state-spaces/transformers-compatible-mamba-65e7b40ab87e5297e45ae406
Mamba Mamba checkpoints compatible with transformers ArthurZ/mamba-2.8b Text Generation β’ Updated Mar 4 β’ 14 β’ 1 ArthurZ/mamba-2.8b-slimpj Text Generation β’ Updated Feb 19 β’ 28 ArthurZ/mamba-1.4b Text Generation β’ Updated Feb 29 β’ 15 ArthurZ/mamba-790m Text Generation β’ Updated Feb 29 β’ 20