Feed·Digest·Sources·About·

[⋯]Chargement

© 2026 Lantern·Set in Geist Mono·Sources [52]·Methodology·Privacy

Built solo in Lille, FR·v0.6

Veille dev & IA

Le meilleur du dev et de l'IA, scoré chaque jour par un agent. Filtré, résumé, classé. Aucune couleur, aucun bruit — juste la matière.

Issue: No. 138
Date: 18 MAI 2026
Édition: FR · DAILY
Sources: 14 actives
Articles: 35 aujourd'hui

§ Feed·Vol. 02·No. 138

Last ingest·10:00 UTC+2·Next·08:00

Filtres

Reference PanelA.1

01. Type— 5

02. Période— 3

03. Source— 7

04. Score— min.

0 actifs

$⌘K

Articles / jour35

7-jour moy.52

Lun → Dim-91%

Feed · 822 articles

trier parscore·DESC ↓

56122 SEPT00:00

otherHuggingFace Blog·il y a 8 m.

Gaia2 and ARE: Empowering the community to study agents

Gaia2 est un nouveau benchmark d’évaluation d’agents conçu pour simuler des conditions du monde réel (lecture/écriture, bruit, échecs d’API, tâches temporelles). Il s’appuie sur le framework open-source ARE pour exécuter, déboguer et comparer des agents sur des scénarios humains complexes. Gaia2 et ARE visent à faciliter le debug et l’analyse des agents open‑world.

★★★★★·HuggingFace Blog

56219 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

Scaleway on Hugging Face Inference Providers

Scaleway is now a supported Inference Provider on the Hugging Face Hub, enabling serverless inference directly from model pages and via Python/JS SDKs. It provides access to models like gpt-oss, Qwen3, and Gemma 3, with pricing from €0.20 per million tokens and latency under 200ms for first tokens. The article covers two call modes (custom key vs routed by HF) and includes code examples for Python and JS.

★★★★★·HuggingFace Blog

56318 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

Democratizing AI Safety with RiskRubric.ai

RiskRubric.ai introduces a standardized risk assessment for AI models, delivering 0-100 scores and A-F grades across six pillars: transparency, reliability, security, privacy, safety, and reputation. The framework runs 1,000+ reliability tests, 200+ adversarial probes, automated code scans, and data/privacy reviews, enabling deployment filters and minimum score thresholds (e.g., 75).

★★★★★·HuggingFace Blog

56417 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

Public AI on Hugging Face Inference Providers

Public AI is now a supported inference provider on Hugging Face Hub, enabling serverless inference with public and sovereign models via an OpenAI-compatible API on vLLM. Integration is offered in the website UI and Python/JS SDKs, allowing custom keys or HF routing. The project is open-source and backed by distributed infrastructure donations.

★★★★★·HuggingFace Blog

56516 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

`LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot`

LeRobotDataset v3.0 packs multiple episodes per file with relational metadata to fetch individual episodes, and adds native streaming via StreamingLeRobotDataset. It includes a one-liner to convert v2.1 datasets to v3.0 and integrates with lerobot for PyTorch DataLoader usage. The format supports multi-modal robotics data and metadata-driven search on Hugging Face Hub.

★★★★★·HuggingFace Blog

56615 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

Visible Watermarking with Gradio

Visible watermarking in Gradio is presented as a simple way to distinguish AI-generated content by adding watermarks to images, videos, and text interfaces. The post shows how to use a single parameter (watermark) in components like gr.Image, gr.Video, and gr.Chatbot to display watermarks, including QR-based options. It also demonstrates attribution for copied AI text and provides practical Space examples.

★★★★★·HuggingFace Blog

56711 SEPT20:04

articleHuggingFace Blog·il y a 8 m.

Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!

WRITER présente Palmyra-mini: trois modèles légers (1.5–1.7B) dont deux variantes thinking optimisées pour raisonnement et calcul; le mode CoT permet de meilleures performances, affichant 82.87% GSM8K et 92.5% AMC23. Des options GGUF/MLX et l'inférence via vLLM/SGLang/TGI.

★★★★★·HuggingFace Blog

56811 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

Tricks from OpenAI gpt-oss YOU can use with transformers

L'article détaille les optimisations techniques livrées avec GPT-OSS dans Hugging Face Transformers : kernels téléchargeables sur le Hub (dont RMSNorm et MoE), MXFP4 quantization, tensor/p expert parallelism, sliding window et continuous batching / Paged Attention. Ces features améliorent le chargement, l'inférence et le fine‑tuning des modèles tout en restant applicables aux autres modèles de la librairie.

★★★★★·HuggingFace Blog

56910 SEPT17:04

articleHuggingFace Blog·il y a 8 m.

Fine-tune Any LLM from the Hugging Face Hub with Together AI

Together AI now enables fine-tuning of any compatible Hugging Face LLM on its infrastructure, exposing HF Hub models to streamlined customization. The guide shows a 5-minute setup to launch a training job using a base Together template and a Hugging Face model as the custom model, with options for private repos. After training, the fine-tuned model can be pushed back to the Hub automatically.

★★★★★·HuggingFace Blog

57010 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

Jupyter Agents: training LLMs to reason with notebooks

L’article présente Jupyter Agents, une pipeline pour entraîner des LLMs à résoudre des tâches de data science en exécutant du code dans des notebooks Jupyter. Il détaille le benchmarking DABStep, les améliorations de scaffolding, et le fine-tuning de Qwen3-4B-Thinking pour accroître les performances des modèles petits.

★★★★★·HuggingFace Blog

57109 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

mmBERT: ModernBERT goes Multilingual

mmBERT is a massively multilingual encoder trained on over 3T tokens across 1,800+ languages, achieving state-of-the-art results with faster inference than prior models. It extends ModernBERT with novel components and a three-phase training schedule to improve learning for low-resource languages, using progressively sampled, broader data.

★★★★★·HuggingFace Blog

57204 SEPT00:00

articleHuggingFace Blog·il y a 9 m.

Welcome EmbeddingGemma, Google's new efficient embedding model

Google introduces EmbeddingGemma, a 308M-parameter multilingual embedding model optimized for on-device use with a 2K context window. It supports 100+ languages and achieves top scores on the MMTEB/MTEB benchmarks while staying under 200 MB RAM when quantized. The model uses a Gemma3-based encoder with mean pooling and an optional 768-d output that can be truncated to 512/256/128 for speed and memory efficiency, enabling fast on-device retrieval and RAG pipelines.

★★★★★·HuggingFace Blog

57302 SEPT16:54

articleHuggingFace Blog·il y a 9 m.

SAIR: Accelerating Pharma R&D with AI-Powered Structural Intelligence

SAIR, publié par SandboxAQ sur Hugging Face, est le plus grand jeu de données open source de structures protéine-ligand 3D co-pliées, avec 5,24 millions de complexes et des IC50 expérimentaux, adressant une pénurie historique de données pour l’IA en découverte de médicaments.

★★★★★·HuggingFace Blog

57402 SEPT00:00

articleHuggingFace Blog·il y a 9 m.

Make your ZeroGPU Spaces go brrr with ahead-of-time compilation

Cet article présente l’intégration de la compilation ahead-of-time (AoT) dans ZeroGPU Spaces pour accélérer les démos IA. Il décrit les étapes: préparer les entrées, exporter et compiler le modèle, puis l’utiliser dans le pipeline, avec des exemples concrets et des démos. Il aborde aussi des optimisations avancées comme la quantification FP8, les shapes dynamiques et la compilation régionale.

★★★★★·HuggingFace Blog

57520 AOÛT22:13

articleHuggingFace Blog·il y a 9 m.

NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset

NVIDIA releases the 6 Million Multilingual Reasoning Dataset for Nemotron, translating English reasoning data into French, German, Italian, Japanese, and Spanish. The Nemotron Nano 2 9B uses a hybrid Transformer–Mamba architecture with a configurable thinking budget to balance accuracy, throughput, and cost for edge deployments, with weights released under an open license.

★★★★★·HuggingFace Blog

57619 AOÛT00:00

articleHuggingFace Blog·il y a 9 m.

Generate Images with Claude and Hugging Face

L’article montre comment connecter Claude à Hugging Face Spaces pour générer des images avec les modèles Flux.1 Krea (réalistes) et Qwen-Image (texte précis), en profitant d’un MCP server et de crédits gratuits pour itérer rapidement sur les prompts.

★★★★★·HuggingFace Blog

57718 AOÛT00:00

mcpHuggingFace Blog·il y a 9 m.

MCP for Research: How to Connect AI to Research Tools

L’article présente le Model Context Protocol (MCP) comme une norme permettant aux modèles de recherche d’interagir avec des outils externes via des requêtes en langage naturel. Il décrit trois couches d’abstraction — recherche manuelle, scripts et MCP — pour automatiser la découverte scientifique (papers, code, modèles, datasets). Une configuration rapide est proposée via les settings MCP de Hugging Face.

★★★★★·HuggingFace Blog

57818 AOÛT00:00

articleHuggingFace Blog·il y a 9 m.

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Cet article-guide explique comment concevoir et déployer des kernels CUDA prêts pour la production avec kernel-builder. Il couvre l’architecture d’un kernel moderne, la structure d’un projet, build.toml et flake.nix pour la reproductibilité, et l’enregistrement d’un opérateur PyTorch natif. Exemple RGB→Grayscale illustre l’écriture et le test d’un kernel CUDA.

★★★★★·HuggingFace Blog

57914 AOÛT12:13

articleHuggingFace Blog·il y a 9 m.

Kimina-Prover-RL

Kimina-Prover-RL est une pipeline open-source d’entraînement pour la démonstration formelle en Lean 4, basée sur une structure pensée en deux étapes (raisonnement puis Lean). Deux modèles open-source (1.7B et 0.6B) atteignent des scores élevés sur MiniF2F. Le projet fournit kimina-lean-server, kimina-client et le dataset Kimina-Prover-Promptset, et se réutilise via un fork Verl.

★★★★★·HuggingFace Blog

58013 AOÛT14:55

articleHuggingFace Blog·il y a 9 m.

Arm & ExecuTorch 0.7: Bringing Generative AI to the masses

Arm's KleidiAI powers ExecuTorch 0.7, enabling automatic on-device acceleration for GenAI across many Arm CPUs and edge devices. Using SDOT and I8MM, models like Llama 3.2 run efficiently on most Android devices and Raspberry Pi 5, with notable gains in prefill/decode speed.

★★★★★·HuggingFace Blog

Page 29 / 42

← Préc.Suiv. →

20 sur 822 affichés

Issue 138 · Digest

Le résumé hebdo, livré dimanche.

20 articles classés par un agent. Aucun bruit, aucune pub. Désabonnement en un clic.

[top 7 jours]B.1

01.
I turned a $80 RK3562 Android tablet into a Debian Linux workstation
Hacker News (100+ pts)
02.
Mullvad exit IPs as a fingerprinting vector
Lobsters
03.
Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality
HuggingFace Blog
04.
what 262,715 regex questions on stack overflow haven't answered
Lobsters
05.
int a = 5; a = a++ + ++a; a = ? (2011)
Lobsters

Colophon · MakerC.1

Quentin Lecocq · @celdama

Dev fullstack · CRO freelance · Lille, FR

Lantern est un side-project — agrégation, scoring IA, digest hebdo. Construit avec Next.js 16, Drizzle, Neon & Claude. Un seul mainteneur.

[X][GitHub][RSS][Site]

RaccourcisC.2

Recherche⌘ K
Article suivantJ
Article précédentK
OuvrirEnter
FavoriF