articleHuggingFace Blog
Groq on Hugging Face Inference Providers
Groq is now a supported Inference Provider on the Hugging Face Hub, enabling serverless inference on model pages and tight integration with Python and JavaScript SDKs. The LPUs promise lower latency and higher throughput for LLMs, with support for models like Meta's Llama 4 and Qwen's QWQ-32B, and two billing modes: direct provider key or routing through HF.
publié 16 JUIN 2025★★★★★
Lire la sourcehuggingface.co/blog/inference-providers-groq
[*] Ouvre dans un nouvel onglet · pas de tracking côté Lantern
- Source
- HuggingFace Blog
- Ingéré
- 16 JUIN 2025 · 19:10
- Score édito
- 4.0 / 5