Evaluate AI model predictions with correctness scores
Answer questions about your images
Generate text responses from images and text input
Meta Llama3 8b with Llava Multimodal capabilities
Submit video model evaluation results to a public benchmark