HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models
Paper
•
2512.09928
•
Published
•
11
HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models.
@misc{lin2025hifvlahindsightinsightforesight,
title={HiF-VLA: Hindsight, Insight and Foresight through Motion Representation for Vision-Language-Action Models},
author={Minghui Lin and Pengxiang Ding and Shu Wang and Zifeng Zhuang and Yang Liu and Xinyang Tong and Wenxuan Song and Shangke Lyu and Siteng Huang and Donglin Wang},
year={2025},
eprint={2512.09928},
archivePrefix={arXiv},
primaryClass={cs.RO},
url={https://arxiv.org/abs/2512.09928},
}
Base model
openvla/openvla-7b