TAAFT
Free mode
100% free
Freemium
Free Trial
Deals

Groq Llama

By Groq
New Text Gen 7
Released: April 10, 2024

Overview

Groq Llama is Meta’s Llama family served on Groq hardware, giving Llama-quality outputs with very low latency.

Description

Like Groq Gemma, it uses Groq’s specialized chips and compiler to deliver extremely fast token generation while preserving the original Llama behavior. This setup suits applications that already rely on Llama prompts or fine-tunes but need much faster responses, such as code assistants, interactive tools, or large fleets of customer-facing bots.

About Groq

Established in 2016 for inference, Groq is literally built different. It’s the only custom-built inference chip that fuels developers with the performance they need at a cost that doesn’t hold them back.

Website: groq.com
View Company Profile

Related Models

Last updated: November 25, 2025