PaliGemma
Overview
PaliGemma is Google’s open vision-language model that accepts images plus text and outputs text for captioning, visual question answering, OCR-style tasks, and detection.
About Google DeepMind
View Company ProfileTools using PaliGemma
No tools found for this model yet.
