Robotics
Transformers
Safetensors
English
openvla
feature-extraction
vla
image-text-to-text
multimodal
pretraining
custom_code
Instructions to use openvla/openvla-7b with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Transformers
How to use openvla/openvla-7b with Transformers:
# Load model directly from transformers import AutoModelForVision2Seq model = AutoModelForVision2Seq.from_pretrained("openvla/openvla-7b", trust_remote_code=True, dtype="auto") - Notebooks
- Google Colab
- Kaggle
Update modeling_prismatic.py to account for the case where `input_ids` is `None `
#5
by eliotj - opened
Input Ids and Input Embeds are both marked Optional[torch.LongTensor] = None, however failing to pass in input_ids into the forward() method results in an error in the first block, since the code automatically checks if input_ids.shape[1] == 1 without first checking to see if input_ids is not None.
This pull request updates the logic to allow for this case in Generation with Cache and Multimodal Forward.