Feed·Digest·Sources·About·

[⋯]Chargement

© 2026 Lantern·Set in Geist Mono·Sources [52]·Methodology·Privacy

Built solo in Lille, FR·v0.6

Veille dev & IA

Le meilleur du dev et de l'IA, scoré chaque jour par un agent. Filtré, résumé, classé. Aucune couleur, aucun bruit — juste la matière.

Issue: No. 153
Date: 02 JUIN 2026
Édition: FR · DAILY
Sources: 14 actives
Articles: 29 aujourd'hui

§ Feed·Vol. 02·No. 153

Last ingest·10:00 UTC+2·Next·08:00

Filtres

Reference PanelA.1

01. Type— 5

02. Période— 3

03. Source— 7

04. Score— min.

0 actifs

$⌘K

Articles / jour29

7-jour moy.18

Lun → Dim

Feed · 879 articles

trier parscore·DESC ↓

70112 MAI00:00

articleHuggingFace Blog·l’an dernier

Vision Language Models (Better, faster, stronger)

Vision Language Models are getting smaller while becoming more capable, with new architectures enabling any-to-any inputs/outputs, multimodal retrieval and agents. The post surveys models like Chameleon/Lumina-mGPT, Qwen 2.5 Omni (Thinker-Talker), MiniCPM-o 2.6, Janus-Pro-7B, and Kimi-VL-A3B-Thinking, plus MoE decoders, RAG, safety, and new benchmarks (MMT-Bench, MMMU-Pro).

★★★★★·HuggingFace Blog

70211 MAI00:00

articleHuggingFace Blog·l’an dernier

LeRobot Community Datasets: The “ImageNet” of Robotics — When and How?

Cet article présente LeRobot Community Datasets comme une tentative de créer l'équivalent ImageNet pour la robotique, en insistant sur la nécessité de données diversifiées et de pratiques de curation robustes. Il identifie les défis actuels (annotations incohérentes, problèmes de correspondance de caractéristiques, épisodes de faible qualité) et propose des orientations pour améliorer la généralisation via des jeux de données plus ouverts et variés.

★★★★★·HuggingFace Blog

70330 AVR00:00

articleHuggingFace Blog·l’an dernier

How to Build an MCP Server with Gradio

Gradio now exposes Python functions as MCP tools, enabling LLMs to call them via an MCP server. The guide shows a concise 5-line example converting a letter-counting function into a tool, launching the server, and wiring it into MCP clients with a config snippet.

★★★★★·HuggingFace Blog

70430 AVR00:00

articleHuggingFace Blog·l’an dernier

The 4 Things Qwen-3’s Chat Template Teaches Us

Le Qwen-3 introduit un template de chat plus sophistiqué: la pensée peut être activée ou désactivée via enable_thinking, et la gestion du contexte est dynamique grâce à un rolling checkpoint qui conserve les réflexions pertinentes pendant les appels d’outils. Le texte souligne aussi une meilleure sérialisation des arguments des outils et montre comment ces choix influencent les performances et la lisibilité du flux de conversation par rapport à Qwen-2.5 et QwQ.

★★★★★·HuggingFace Blog

70529 AVR00:00

articleHuggingFace Blog·l’an dernier

Introducing AutoRound: Intel’s Advanced Quantization for LLMs and VLMs

AutoRound is Intel's weight-only post-training quantization that uses signed gradient descent to jointly optimize weight rounding and clipping for accurate low-bit quantization (INT2–INT8). It claims up to 2.1x higher relative accuracy at 2-bit, quantizes a 72B model in ~37 minutes on A100, and supports mixed-bit tuning, multiple export formats, and recipes (auto-round-best/light) with small calibration sets.

★★★★★·HuggingFace Blog

70629 AVR00:00

articleHuggingFace Blog·l’an dernier

Welcoming Llama Guard 4 on Hugging Face Hub

Meta lance Llama Guard 4, un modèle dense 12B multimodal pour filtrer les contenus inappropriés, disponible sur Hugging Face. Pruné à partir de Llama 4 Scout (pas MoE), il tourne sur un seul GPU et évalue texte et image avec 14 catégories de risques. La release inclut aussi Llama Prompt Guard 2 et des checkpoints ouverts, accompagnés d’un notebook interactif.

★★★★★·HuggingFace Blog

70725 AVR22:37

articleHuggingFace Blog·l’an dernier

PipelineRL

PipelineRL introduit des mises à jour de poids en vol pendant l'entraînement RL des LLM, permettant un débit d'inférence élevé tout en restant proche de l'on-policy. L'étude montre des résultats compétitifs sur 7B et 32B par rapport à Open-Reasoner-Zero sur AIME 2024 et MATH 500, avec une implémentation plus simple (pas de fonction valeur et sans pénalité KL).

★★★★★·HuggingFace Blog

70825 AVR00:00

articleHuggingFace Blog·l’an dernier

Tiny Agents: an MCP-powered agent in 50 lines of code

Cet article présente Tiny Agents, un agent MCP-powered en 50 lignes de code. Il explique que l’agent se résume à une boucle while sur un MCP client et détaille comment lancer des serveurs MCP locaux et exécuter des prompts (ex: recherche web, manipulation de fichiers) via un exemple en TypeScript.

★★★★★·HuggingFace Blog

70922 AVR18:33

articleHuggingFace Blog·l’an dernier

Finetuning olmOCR to be a faithful OCR-Engine

Researchers fine-tuned olmOCR-7B-0225-preview to preserve header and footer information, making it a more faithful OCR engine for business documents. They created a dataset of 8,000 documents with Qwen2.5-VL-72B-Instruct, trained with 4 gradient accumulation steps on 8xH100 for 2.5 epochs, and evaluated on header/footer-inclusive data using document anchoring. The result is a practical improvement for invoices and other layout-rich texts.

★★★★★·HuggingFace Blog

71016 AVR10:10

articleHuggingFace Blog·l’an dernier

Prefill and Decode for Concurrent Requests - Optimizing LLM Performance

Explains the two stages of token generation for LLMs—prefill, where input tokens are processed in parallel to produce the first token, and decode, where subsequent tokens are generated sequentially using a KV cache. It defines latency metrics (time to first token and time per output token) and analyzes how concurrent requests and batching affect throughput on multi-GPU setups. It also hints at batching patterns like prefill-first and chunked prefill to optimize latency.

★★★★★·HuggingFace Blog

71116 AVR00:00

articleHuggingFace Blog·l’an dernier

Cohere on Hugging Face Inference Providers

Cohere devient fournisseur d'inférence sur Hugging Face Hub, permettant l'inférence serverless via Cohere et Cohere Labs sur une gamme de modèles optimisés pour l'entreprise. Les points forts incluent des contextes longs (256k sur le modèle A-03-2025), une prise en charge multilingue (23 langues) et une RAG avec citations, sécurité et outils d'agentivité. L'article décrit l'usage via l'UI, les SDK clients et un notebook Colab pour tester, avec des exemples Python utilisant huggingface_hub.

★★★★★·HuggingFace Blog

71216 AVR00:00

articleHuggingFace Blog·l’an dernier

17 Reasons Why Gradio Isn't Just Another UI Library

Gradio is framed as a full framework for ML apps, not just a UI library, offering features like universal API access, an Interactive API Recorder, SSR for fast apps, automatic queueing and real-time streaming, and enterprise-grade security. The piece lists 17 capabilities that differentiate it for production ML workflows.

★★★★★·HuggingFace Blog

71316 AVR00:00

articleHuggingFace Blog·l’an dernier

Introducing HELMET: Holistically Evaluating Long-context Language Models

HELMET introduces a comprehensive benchmark for evaluating long-context language models, addressing the shortcomings of perplexity and synthetic tasks by emphasizing diversity, controllability, and reliability. The blog reports evaluation across 59 LCLMs, highlights real-world task gaps, and provides a quickstart guide and links to code, data, and the paper for practical replication.

★★★★★·HuggingFace Blog

71414 AVR00:00

articleHuggingFace Blog·l’an dernier

Hugging Face to sell open-source robots thanks to Pollen Robotics acquisition

Hugging Face acquires Pollen Robotics to push open-source robotics, extending LeRobot with Reachy 2, an open humanoid robot used in labs. Reachy 2 is open-source and VR-compatible, aimed at research and education, and can be ordered for $70,000.

★★★★★·HuggingFace Blog

71514 AVR00:00

articleHuggingFace Blog·l’an dernier

4M Models Scanned: Protect AI + Hugging Face 6 Months In

Protect AI and Hugging Face expanded Guardian's threat detection with four new modules (PAIT-ARV-100, PAIT-JOBLIB-101, PAIT-TF-200, PAIT-LMAFL-300) to cover more formats and obfuscation techniques. The integration emphasizes a zero-trust security stance with inline alerts on Hugging Face and a huntr bug‑bounty program, reporting 4.47M scans and 352k unsafe issues.

★★★★★·HuggingFace Blog

71611 AVR14:21

articleHuggingFace Blog·l’an dernier

Visual Salamandra: Pushing the Boundaries of Multimodal Understanding

Visual Salamandra extends the Salamandra 7B LLM to images and video via Google's SigLIP encoder and late-fusion, enabling vision-language alignment. It uses a four-phase training pipeline (projector pre-training, high-quality vision pretraining, instruction tuning, full multimodal tuning) with 6.1M instructions, prioritizing multilingual European data.

★★★★★·HuggingFace Blog

71709 AVR00:00

articleHuggingFace Blog·l’an dernier

Hugging Face and Cloudflare Partner to Make Real-Time Speech and Video Seamless with FastRTC

Hugging Face and Cloudflare have teamed up to give FastRTC developers instant access to enterprise-grade WebRTC infrastructure via a Hugging Face token, combining FastRTC's low-code real-time streams with Cloudflare's global TURN network. The integration allows free streaming up to 10 GB per month and provides a ready-made path to building low-latency voice and video apps, with a sample voice chat demo using Llama 4.

★★★★★·HuggingFace Blog

71808 AVR00:00

articleHuggingFace Blog·l’an dernier

Arabic Leaderboards: Introducing Arabic Instruction Following, Updating AraGen, and More

Arabic-Leaderboards Space unifies Arabic evaluations, housing AraGen-03-25 and Arabic Instruction Following, with plans to add more modalities. The AraGen-03-25 release expands to 340 QA/Reasoning/Orthography pairs and uses blind testing for fair evaluation, plus sharing Claude-3.5-Sonnet results to invite community review.

★★★★★·HuggingFace Blog

71905 AVR00:00

articleHuggingFace Blog·l’an dernier

Welcome Llama 4 Maverick & Scout on Hugging Face

Meta's Llama 4 Maverick (~400B) and Llama 4 Scout (~109B) are Mixture-of-Experts LLMs with 17B active parameters and native multimodality (text + images). They integrate with Hugging Face transformers and TGI, with Scout accessible on a single GPU via 4-/8-bit quantization and Maverick in BF16/FP8; Instruct variants support context lengths up to 1M tokens. Checkpoints are on the Hugging Face Hub under meta-llama, with Xet storage and the Llama 4 Community License.

★★★★★·HuggingFace Blog

72004 AVR00:00

articleHuggingFace Blog·l’an dernier

Journey to 1 Million Gradio Users!

Gradio evolved from a single high-level interface to a modular Blocks API, enabling flexible app composition. Its growth relied on investing in primitives, embedding virality via share links, and focusing on a growing niche. The key lesson: favor low-level components and rapid iteration to scale OSS tooling.

★★★★★·HuggingFace Blog

Page 36 / 44

← Préc.Suiv. →

20 sur 879 affichés

Issue 153 · Digest

Le résumé hebdo, livré dimanche.

20 articles classés par un agent. Aucun bruit, aucune pub. Désabonnement en un clic.

[top 7 jours]B.1

01.
thunderbolt-ibverbs: We have InfiniBand at home
Lobsters
02.
Beyond LLMs: Why Scalable Enterprise AI Adoption Depends on Agent Logic
HuggingFace Blog
03.
Five Years of Trying to Add Recursion to lychee
Lobsters
04.
ELF Linker Improvements in Zig
Lobsters
05.
UTF8 email with DMA: DragonFly Mail Agent
Lobsters

Colophon · MakerC.1

Quentin Lecocq · @celdama

Dev fullstack · CRO freelance · Lille, FR

Lantern est un side-project — agrégation, scoring IA, digest hebdo. Construit avec Next.js 16, Drizzle, Neon & Claude. Un seul mainteneur.

[X][GitHub][RSS][Site]

RaccourcisC.2

Recherche⌘ K
Article suivantJ
Article précédentK
OuvrirEnter
FavoriF