Answer questions about images with text prompts
Convert models to Safetensors and open a PR
VLMEvalKit Evaluation Results Collection