Groq
Profile Overview
Groq is an artificial intelligence chip design and cloud inference startup founded in 2016 by Jonathan Ross, a former core member of the Google team that designed the Tensor Processing Unit (TPU). Headquartered in Mountain View, California, Groq is the pioneer of the Language Processing Unit (LPU), a new class of processor specifically engineered for sequential, low-latency computing tasks like running large language models. Unlike traditional GPUs, which process data in parallel and suffer from latency overheads, Groq's LPU architecture delivers deterministic, ultra-fast token generation, allowing models like Llama to run at hundreds of tokens per second. The company offers its hardware through GroqCloud, a developer API platform that has gained massive popularity for real-time AI applications such as voice assistants, search engines, and conversational bots. Groq also sells LPU hardware cards directly to enterprise data centers and government entities looking to accelerate their AI infrastructure. In August 2024, Groq raised $640 million in a Series D funding round led by BlackRock Private Equity Partners, bringing its total funding to $1 billion at a valuation of $2.8 billion. Groq's unique chip architecture represents a critical challenge to Nvidia's dominance in the AI inference market.
Last Financing Round
Flagship Offerings
Vetted AI Models
No models currently cataloged under Groq.