TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

PaliGemma 2

Model family: Gemma
PaliGemma 2 combines the SigLIP-So400m vision encoder with Gemma 2 language models and is available in 3B, 10B, and 28B sizes. It is designed as a lightweight open VLM for transfer and fine-tuning across image-text tasks such as captioning, VQA, OCR-style understanding, and detection.
Text Gen 3
Released: December 5, 2024

Overview

PaliGemma 2 is Google’s upgraded open vision-language model family based on Gemma 2, available in 3B, 10B, and 28B sizes.

About Google DeepMind

Company Size: 6000
Location: London, England, GB
View Company Profile

Tools using PaliGemma 2

No tools found for this model yet.

Last updated: June 2, 2026
0 AIs selected
Clear selection
#
Name
Task