Abhay Puri's picture

1 1

Abhay Puri

abhaypuri

AI & ML interests

LLM, Vision, diffusion models

Organizations

None yet

authored a paper 12 months ago

AlignVLM: Bridging Vision and Language Latent Spaces for Multimodal Understanding

Paper • 2502.01341 • Published Feb 3, 2025 • 39

authored a paper about 1 year ago

BigDocs: An Open and Permissively-Licensed Dataset for Training Multimodal Models on Document and Code Tasks

Paper • 2412.04626 • Published Dec 5, 2024 • 13