Back to Directory
Groq logo

Groq

New

Get LLM responses in milliseconds, not seconds. Drop-in OpenAI-compatible API backed by custom LPU hardware built for inference speed.

Developer Tools
4.6(6,100 reviews)freemium

Overview

Groq provides extremely low-latency LLM inference powered by its custom LPU hardware, offering an OpenAI-compatible API to run popular open models at industry-leading speeds.

Key Features

  • Very low-latency inference
  • OpenAI-compatible API
  • Popular open models hosted
  • Generous free tier
Pros
  • Blazing fast responses
  • Easy drop-in API
  • Cost-effective
Cons
  • Limited model selection
  • Capacity constraints at peak
Advertisement