Veille dev & IA

Le meilleur du dev et de l'IA, scoré chaque jour par un agent. Filtré, résumé, classé. Aucune couleur, aucun bruit — juste la matière.

Issue: No. 138
Date: 18 MAI 2026
Édition: FR · DAILY
Sources: 14 actives
Articles: 0 aujourd'hui

§ Feed·Vol. 02·No. 138

Last ingest·10:00 UTC+2·Next·08:00

Filtres

Reference PanelA.1

01. Type— 5

02. Période— 3

03. Source— 7

04. Score— min.

0 actifs

$⌘K

Articles / jour0

7-jour moy.47

Lun → Dim-100%

Feed · 805 articles

trier parscore·DESC ↓

60119 JUIN00:00

articleHuggingFace Blog·il y a 11 m.

(LoRA) Fine-Tuning FLUX.1-dev on Consumer Hardware

New post shows efficient fine-tuning of FLUX.1-dev on consumer hardware via QLoRA with the diffusers library, targeting peak VRAM under ~10 GB on a single GPU. It explains loading a quantized 4-bit base model, training FP16/BF16 LoRA adapters, uses an 8-bit AdamW optimizer, and discusses options to load or merge LoRA adapters with results demonstrated on an RTX 4090.

★★★★★·HuggingFace Blog

60216 JUIN00:00

articleHuggingFace Blog·il y a 11 m.

Groq on Hugging Face Inference Providers

Groq is now a supported Inference Provider on the Hugging Face Hub, enabling serverless inference on model pages and tight integration with Python and JavaScript SDKs. The LPUs promise lower latency and higher throughput for LLMs, with support for models like Meta's Llama 4 and Qwen's QWQ-32B, and two billing modes: direct provider key or routing through HF.

★★★★★·HuggingFace Blog

60312 JUIN08:00

articleHuggingFace Blog·il y a 11 m.

How Long Prompts Block Other Requests - Optimizing LLM Performance

The article analyzes how long prefill prompts can block the prefill queue in a multi-request setting, and explains that decoding steps are light but must be sequential. It discusses two patterns - chunked prefill and request-parallel prefills - and why long prompts undermine throughput, with implications for vLLM scheduling.

★★★★★·HuggingFace Blog

60412 JUIN00:00

articleHuggingFace Blog·il y a 11 m.

Featherless AI on Hugging Face Inference Providers

Featherless AI est désormais supporté comme Inference Provider sur Hugging Face Hub, permettant l'inférence serverless directement sur les pages des modèles et accessible via les SDK JS et Python. Il prend en charge un large éventail de modèles open-source et offre deux modes d’appel (clé personnalisée ou routée par HF) avec une tarification directe sur le compte utilisateur. Des exemples Python et JS montrent comment l'utiliser avec Featherless AI.

★★★★★·HuggingFace Blog

60512 JUIN00:00

articleHuggingFace Blog·il y a 11 m.

Learn the Hugging Face Kernel Hub in 5 Minutes

Hugging Face's Kernel Hub lets Python apps load pre-compiled, optimized kernels directly from the Hub, avoiding local builds. It includes a quick code example to fetch a kernel (e.g., activation) and apply it, and discusses integrating kernels into models like RMSNorm and FlashAttention. The article also covers performance benchmarking and real-world use cases.

★★★★★·HuggingFace Blog

60611 JUIN18:27

articleHuggingFace Blog·il y a 11 m.

Post-Training Isaac GR00T N1.5 for LeRobot SO-101 Arm

Cet article présente GR00T N1.5 et explique comment réaliser un post-entraînement sur le bras LeRobot SO-101. Il propose un tutoriel pas-à-pas couvrant l'installation, la préparation du dataset et le fine-tuning, puis l'évaluation et le déploiement. Des commandes et configurations (modality.json, scripts/gr00t_finetune.py) permettent une adaptation rapide du modèle à votre robot.

★★★★★·HuggingFace Blog

60711 JUIN00:00

articleHuggingFace Blog·il y a 11 m.

Introducing Training Cluster as a Service - a new collaboration with NVIDIA

Hugging Face and NVIDIA announce Training Cluster as a Service to give researchers easy access to large GPU clusters for training foundational models, paying only for training durations. The solution combines NVIDIA DGX Cloud Lepton, regional capacity, and Hugging Face tooling to provision, price, and manage clusters, with user requests via hf.co/training-cluster.

★★★★★·HuggingFace Blog

60806 JUIN00:00