jondurbin
/

airoboros-jamba-3-3

Text Generation

Model card Files Files and versions

jondurbin commited on Apr 9, 2024

Commit

4dba3bd

·

verified ·

1 Parent(s): 47ee4e4

Update README.md

Files changed (1) hide show

README.md +7 -0

README.md CHANGED Viewed

@@ -21,6 +21,13 @@ Another experimental model, using mostly sythetic data generated by [airoboros](
 This fine-tune is on [jamba-v0.1](https://huggingface.co/ai21labs/Jamba-v0.1) using QLoRA.
 #### Highlights
 The base model, jamba-v0.1, supposedly has a 256k context length, but this finetune was done with 8192 so anything beyond that will likely have undefined results.

 This fine-tune is on [jamba-v0.1](https://huggingface.co/ai21labs/Jamba-v0.1) using QLoRA.
+Be sure to install the dependencies for jamba:
+```bash
+pip install transformers>=4.39.0
+pip install mamba-ssm causal-conv1d>=1.2.0
+```
 #### Highlights
 The base model, jamba-v0.1, supposedly has a 256k context length, but this finetune was done with 8192 so anything beyond that will likely have undefined results.