articleHuggingFace Blog

Groq on Hugging Face Inference Providers

Groq is now a supported Inference Provider on the Hugging Face Hub, enabling serverless inference on model pages and tight integration with Python and JavaScript SDKs. The LPUs promise lower latency and higher throughput for LLMs, with support for models like Meta's Llama 4 and Qwen's QWQ-32B, and two billing modes: direct provider key or routing through HF.

publié 16 JUIN 2025★★★★★

Lire la sourcehuggingface.co/blog/inference-providers-groq

[*] Ouvre dans un nouvel onglet · pas de tracking côté Lantern

Source: HuggingFace Blog
Ingéré: 16 JUIN 2025 · 19:10
Score édito: 4.0 / 5