[⋯]Chargement

Built solo in Lille, FR·v0.6

Veille dev & IA

Le meilleur du dev et de l'IA, scoré chaque jour par un agent. Filtré, résumé, classé. Aucune couleur, aucun bruit — juste la matière.

Issue: No. 153
Date: 02 JUIN 2026
Édition: FR · DAILY
Sources: 14 actives
Articles: 29 aujourd'hui

§ Feed·Vol. 02·No. 153

Last ingest·10:00 UTC+2·Next·08:00

Filtres

Reference PanelA.1

01. Type— 5

02. Période— 3

03. Source— 7

04. Score— min.

0 actifs

$⌘K

Articles / jour29

7-jour moy.18

Lun → Dim

Feed · 879 articles

trier parscore·DESC ↓

64108 AOÛT00:00

articleHuggingFace Blog·il y a 10 m.

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

Accelerate ND-Parallel guides how to combine multiple parallelism strategies (data, fully sharded data, tensor, context) for multi-GPU training. It provides concrete config examples (dp_shard_size, dp_replicate_size, cp_size, tp_size) and an FSDP plugin, plus Axolotl integration and end-to-end training scripts to minimize inter-device communication at scale. The article also discusses how to compose strategies for large models and points to ready configs and docs.

★★★★★·HuggingFace Blog

64208 AOÛT00:00

toolHuggingFace Blog·il y a 10 m.

Introducing AI Sheets: a tool to work with datasets using open AI models!

AI Sheets is a no-code tool for building, transforming, and enriching datasets with open AI models, deployable locally or on the Hugging Face Hub. It uses a spreadsheet-like UI where prompts generate new columns, and you can validate or edit cells to fine-tune prompts with few-shot examples. It supports comparing models, data cleaning, classification, enrichment, and synthetic data generation.

★★★★★·HuggingFace Blog

64307 AOÛT00:00

articleHuggingFace Blog·il y a 10 m.

Vision Language Model Alignment in TRL

TRL expands Vision Language Model alignment with Mixed Preference Optimization (MPO), Group Relative Policy Optimization (GRPO), and Group Sequence Policy Optimization (GSPO), extending beyond pairwise DPO to richer signals. It also adds Reinforce Leave One Out (RLOO) and Online DPO for scalable multimodal alignment, plus native SFT support and reproducible training scripts.

★★★★★·HuggingFace Blog

64405 AOÛT00:00

articleHuggingFace Blog·il y a 10 m.

Welcome GPT OSS, the new open-source model family from OpenAI!

OpenAI releases GPT OSS, an open-weight family with 117B and 21B MoE models using 4-bit MXFP4 quantization for fast inference and low resource use. The 120B fits on an 80GB GPU and the 20B on 16GB, both Apache-2.0 licensed; usable via Hugging Face Inference Providers for local or on-device deployments.

★★★★★·HuggingFace Blog

64504 AOÛT19:51

articleHuggingFace Blog·il y a 10 m.

Measuring Open-Source Llama Nemotron Models on DeepResearch Bench

AI-Q combines Llama 3.3-70B Instruct and Llama-3.3-Nemotron-Super-49B-v1.5 to enable long-context retrieval, agentic reasoning, and tool use in open-source stacks. NVIDIA details model lineage, post-training, and transparent evaluation metrics (hallucination detection, multi-source synthesis, citation trust, RAGAS), plus a 49B Nemotron running on a single H100. DeepResearch Bench ranks AI-Q top among fully open stacks with a score of 40.52 in LLM with Search (Aug 2025).

★★★★★·HuggingFace Blog

64601 AOÛT14:25

articleHuggingFace Blog·il y a 10 m.

3LM: A Benchmark for Arabic LLMs in STEM and Code

3LM est un benchmark multidomaine pour évaluer les LLM arabes en STEM et en code, déployant trois jeux de données (Native STEM MCQs, Synthetic STEM et Arabic Code Benchmarks) et des métriques comme pass@1 via EvalPlus. Le pipeline combine OCR, génération par LLM et vérifications humaines, et propose l'accès aux jeux sur HuggingFace et le code sur GitHub.

★★★★★·HuggingFace Blog

64731 JUIL00:00

articleHuggingFace Blog·il y a 10 m.

Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio

The article demonstrates how to implement an MCP server with Gradio to plug an LLM into Hugging Face models. It covers automatic conversion of Python functions into MCP tools, real-time progress notifications, and automatic file uploads, illustrated by an AI shopping assistant using IDM-VTON for virtual try-on.

★★★★★·HuggingFace Blog

64829 JUIL00:00

articleHuggingFace Blog·il y a 10 m.

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Trackio est une bibliothèque légère d’expérimentation open-source qui remplace wandb et offre un tableau de bord local Gradio, avec synchronisation vers Hugging Face Spaces pour partager les métriques. Elle vise simplicité, traçabilité des métriques et énergie GPU via nvidia-smi, et peut être utilisée comme drop-in wandb.

★★★★★·HuggingFace Blog

64925 JUIL00:00

articleHuggingFace Blog·il y a 10 m.

Parquet Content-Defined Chunking

Parquet Content-Defined Chunking (CDC) is now available in PyArrow and Pandas, enabling deduplication of Parquet files on Hugging Face’s Xet storage layer. CDC reduces data transfer and storage costs by uploading or downloading only changed data chunks. The article shows how to enable CDC with use_content_defined_chunking and outlines several deduplication scenarios with code examples.

★★★★★·HuggingFace Blog

65025 JUIL00:00

toolHuggingFace Blog·il y a 10 m.

Say hello to `hf`: a faster, friendlier Hugging Face CLI

HF renames the Hugging Face CLI from huggingface-cli to hf to reorganize commands around a clear hf <resource> <action> pattern, improving ergonomics and discoverability. The migration keeps the legacy CLI functional with warnings and documents how to install (pip install -U huggingface_hub) and verify (hf version) and explore commands (hf --help). It also introduces the hf jobs service to run scripts or Docker images on HF infrastructure with pay-as-you-go billing.

★★★★★·HuggingFace Blog

65123 JUIL00:00

articleHuggingFace Blog·il y a 10 m.

TimeScope: How Long Can Your Video Large Multimodal Model Go?

TimeScope is an open-source benchmark that tests long-video understanding by inserting short needle clips into base videos (1 min to 8 hours). It evaluates localized retrieval, information synthesis, and fine-grained temporal perception, revealing that many SOTA models struggle with true temporal comprehension.

★★★★★·HuggingFace Blog

65223 JUIL00:00

articleHuggingFace Blog·il y a 10 m.

Fast LoRA inference for Flux with Diffusers and PEFT

LoRA adapters enable customizing diffusion models but can slow or complicate inference. This post shows how to speed up LoRA inference for Flux using a recipe based on FA3, FP8 quantization, and torch.compile, while staying hot-swapping–ready. It includes a concise code example that applies quantization, sets the attn processor, and compiles the transformer for faster inference.

★★★★★·HuggingFace Blog

65321 JUIL18:01

articleHuggingFace Blog·il y a 11 m.

Accelerate a World of LLMs on Hugging Face with NVIDIA NIM

NVIDIA NIM offre un conteneur unique pour deployer rapidement une large gamme de LLM via Hugging Face, en automatisant l’adaptation, l’analyse du modele et le choix du backend (TensorRT-LLM, vLLM, SGLang). Il prend en charge Hugging Face, GGUF et TensorRT-LLM et illustre le deployment avec Codestral-22B via une commande Docker et tokens API.

★★★★★·HuggingFace Blog

65418 JUIL00:00

articleHuggingFace Blog·il y a 11 m.

Arc Virtual Cell Challenge: A Primer

Arc Institute lance le Virtual Cell Challenge, qui vise à entraîner un modèle capable de prédire l’effet du silençage d’un gène sur une cellule, même dans des types cellulaires inédits (contexte généralisation). Le jeu de données réunit environ 300k profils RNA‑seq d’une cellule unique et propose un cadre mathématique pour séparer le signal du perturbation, l’hétérogénéité et le bruit technique, avec des architectures comme le State Transition Model et le State Embedding Model.

★★★★★·HuggingFace Blog

65517 JUIL00:00

mcpHuggingFace Blog·il y a 11 m.

Five Big Improvements to Gradio MCP Servers

Gradio has released version 5.38.0 to enhance MCP servers with Seamless Local File Support via a new File Upload endpoint, real-time progress streaming for MCP clients, and a one-line OpenAPI-to-MCP conversion using gr.load_openapi. The update also improves authentication by allowing server arguments to be declared as gr.Header and surfaced in docs.

★★★★★·HuggingFace Blog

65617 JUIL00:00

articleHuggingFace Blog·il y a 11 m.

Consilium: When Multiple LLMs Collaborate

Consilium est une plateforme qui fait débattre et faire consensus entre plusieurs LLMs via des discussions structurées, avec des modes comme consensus, vote majoritaire ou classement par choix. Déployée comme interface Gradio et serveur MCP, elle visualise une table ronde et les échanges des experts. L'article relie l'idée au MAI-DxO de Microsoft pour démontrer l'efficacité du multi-LLM.

★★★★★·HuggingFace Blog

65717 JUIL00:00

articleHuggingFace Blog·il y a 11 m.

Back to The Future: Evaluating AI Agents on Predicting Future Events

AI evaluation should shift from recalling facts to forecasting future events. FutureBench uses real-world prediction markets and live news to test agents' ability to reason under uncertainty and synthesize information to predict outcomes.

★★★★★·HuggingFace Blog

65816 JUIL00:00

articleHuggingFace Blog·il y a 11 m.

Ettin Suite: SoTA Paired Encoders and Decoders

Ettin introduces the first SoTA paired encoder-only and decoder-only models (17M–1B params) trained identically for apples-to-apples comparisons. It extends the ModernBERT recipe to both architectures, with encoders beating ModernBERT and decoders beating Llama 3.2 and SmolLM2, while preserving architecture-specific advantages.

★★★★★·HuggingFace Blog

65915 JUIL00:00

articleHuggingFace Blog·il y a 11 m.

Migrating the Hub from Git LFS to Xet

Hugging Face has deployed Xet as the Hub’s storage backend, migrating hundreds of petabytes and millions of repos with minimal disruption. The migration relies on a Git LFS Bridge and background content migrations to support both LFS and Xet, allowing a no-hard-cutover transition. The approach aims to scale storage with AI workloads while preserving existing workflows.

★★★★★·HuggingFace Blog

66010 JUIL12:54

articleHuggingFace Blog·il y a 11 m.

Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models

Kimina-Prover-72B uses a Test-Time Reinforcement Learning (TTRL) search to autonomously discover and reuse lemmas for long-horizon Lean proofs, plus an error-fixing module that interprets Lean messages for targeted corrections. It achieves a state-of-the-art miniF2F performance (92.2% pass rate) and shows pass@1/32/1024 of 63.9/84.0/87.7, with two distilled variants released (8B and 1.7B).

★★★★★·HuggingFace Blog

Page 33 / 44

← Préc.Suiv. →

20 sur 879 affichés

Issue 153 · Digest

Le résumé hebdo, livré dimanche.

20 articles classés par un agent. Aucun bruit, aucune pub. Désabonnement en un clic.

S'abonner →

[top 7 jours]B.1

01.
thunderbolt-ibverbs: We have InfiniBand at home
Lobsters
02.
Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic
HuggingFace Blog
03.
Five Years of Trying to Add Recursion to lychee
Lobsters
04.
ELF Linker Improvements in Zig
Lobsters
05.
UTF8 email with DMA: DragonFly Mail Agent
Lobsters

Colophon · MakerC.1

Quentin Lecocq · @celdama

Dev fullstack · CRO freelance · Lille, FR

Lantern est un side-project — agrégation, scoring IA, digest hebdo. Construit avec Next.js 16, Drizzle, Neon & Claude. Un seul mainteneur.

[X][GitHub][RSS][Site]

RaccourcisC.2

Recherche⌘ K
Article suivantJ
Article précédentK
OuvrirEnter
FavoriF

Veille dev & IA

§Feed · 879 articles

Feed · 879 articles