[⋯]Chargement

Built solo in Lille, FR·v0.6

Veille dev & IA

Le meilleur du dev et de l'IA, scoré chaque jour par un agent. Filtré, résumé, classé. Aucune couleur, aucun bruit — juste la matière.

Issue: No. 141
Date: 21 MAI 2026
Édition: FR · DAILY
Sources: 14 actives
Articles: 42 aujourd'hui

§ Feed·Vol. 02·No. 141

Last ingest·10:00 UTC+2·Next·08:00

Filtres

Reference PanelA.1

01. Type— 5

02. Période— 3

03. Source— 7

04. Score— min.

0 actifs

$⌘K

Articles / jour42

7-jour moy.42

Lun → Dim-62%

Feed · 851 articles

trier parscore·DESC ↓

60104 SEPT00:00

articleHuggingFace Blog·il y a 9 m.

Welcome EmbeddingGemma, Google's new efficient embedding model

Google introduces EmbeddingGemma, a 308M-parameter multilingual embedding model optimized for on-device use with a 2K context window. It supports 100+ languages and achieves top scores on the MMTEB/MTEB benchmarks while staying under 200 MB RAM when quantized. The model uses a Gemma3-based encoder with mean pooling and an optional 768-d output that can be truncated to 512/256/128 for speed and memory efficiency, enabling fast on-device retrieval and RAG pipelines.

★★★★★·HuggingFace Blog

60202 SEPT16:54

articleHuggingFace Blog·il y a 9 m.

SAIR: Accelerating Pharma R&D with AI-Powered Structural Intelligence

SAIR, publié par SandboxAQ sur Hugging Face, est le plus grand jeu de données open source de structures protéine-ligand 3D co-pliées, avec 5,24 millions de complexes et des IC50 expérimentaux, adressant une pénurie historique de données pour l’IA en découverte de médicaments.

★★★★★·HuggingFace Blog

60302 SEPT00:00

articleHuggingFace Blog·il y a 9 m.

Make your ZeroGPU Spaces go brrr with ahead-of-time compilation

Cet article présente l’intégration de la compilation ahead-of-time (AoT) dans ZeroGPU Spaces pour accélérer les démos IA. Il décrit les étapes: préparer les entrées, exporter et compiler le modèle, puis l’utiliser dans le pipeline, avec des exemples concrets et des démos. Il aborde aussi des optimisations avancées comme la quantification FP8, les shapes dynamiques et la compilation régionale.

★★★★★·HuggingFace Blog

60420 AOÛT22:13

articleHuggingFace Blog·il y a 9 m.

NVIDIA Releases 6 Million Multi-Lingual Reasoning Dataset

NVIDIA releases the 6 Million Multilingual Reasoning Dataset for Nemotron, translating English reasoning data into French, German, Italian, Japanese, and Spanish. The Nemotron Nano 2 9B uses a hybrid Transformer–Mamba architecture with a configurable thinking budget to balance accuracy, throughput, and cost for edge deployments, with weights released under an open license.

★★★★★·HuggingFace Blog

60519 AOÛT00:00

articleHuggingFace Blog·il y a 9 m.

Generate Images with Claude and Hugging Face

L’article montre comment connecter Claude à Hugging Face Spaces pour générer des images avec les modèles Flux.1 Krea (réalistes) et Qwen-Image (texte précis), en profitant d’un MCP server et de crédits gratuits pour itérer rapidement sur les prompts.

★★★★★·HuggingFace Blog

60618 AOÛT00:00

articleHuggingFace Blog·il y a 9 m.

From Zero to GPU: A Guide to Building and Scaling Production-Ready CUDA Kernels

Cet article-guide explique comment concevoir et déployer des kernels CUDA prêts pour la production avec kernel-builder. Il couvre l’architecture d’un kernel moderne, la structure d’un projet, build.toml et flake.nix pour la reproductibilité, et l’enregistrement d’un opérateur PyTorch natif. Exemple RGB→Grayscale illustre l’écriture et le test d’un kernel CUDA.

★★★★★·HuggingFace Blog

60718 AOÛT00:00

mcpHuggingFace Blog·il y a 9 m.

MCP for Research: How to Connect AI to Research Tools

L’article présente le Model Context Protocol (MCP) comme une norme permettant aux modèles de recherche d’interagir avec des outils externes via des requêtes en langage naturel. Il décrit trois couches d’abstraction — recherche manuelle, scripts et MCP — pour automatiser la découverte scientifique (papers, code, modèles, datasets). Une configuration rapide est proposée via les settings MCP de Hugging Face.

★★★★★·HuggingFace Blog

60814 AOÛT12:13

articleHuggingFace Blog·il y a 9 m.

Kimina-Prover-RL

Kimina-Prover-RL est une pipeline open-source d’entraînement pour la démonstration formelle en Lean 4, basée sur une structure pensée en deux étapes (raisonnement puis Lean). Deux modèles open-source (1.7B et 0.6B) atteignent des scores élevés sur MiniF2F. Le projet fournit kimina-lean-server, kimina-client et le dataset Kimina-Prover-Promptset, et se réutilise via un fork Verl.

★★★★★·HuggingFace Blog

60913 AOÛT14:55

articleHuggingFace Blog·il y a 9 m.

Arm & ExecuTorch 0.7: Bringing Generative AI to the masses

Arm's KleidiAI powers ExecuTorch 0.7, enabling automatic on-device acceleration for GenAI across many Arm CPUs and edge devices. Using SDOT and I8MM, models like Llama 3.2 run efficiently on most Android devices and Raspberry Pi 5, with notable gains in prefill/decode speed.

★★★★★·HuggingFace Blog

61012 AOÛT14:52

articleHuggingFace Blog·il y a 9 m.

Neural Super Sampling is here!

Neural Super Sampling (NSS) is Arm's real-time upscaling model for mobile GPUs with Neural Accelerators, enabling high‑resolution rendering at lower compute cost for mobile gaming and XR. A demo shows a 50% GPU workload reduction, upscaling from 540p to 1080p in 4 ms, with NSS integrated into Unreal Engine via plugins and Vulkan ML extensions; supporting resources include a technical blog, a white paper, and quickstart guides.

★★★★★·HuggingFace Blog

61112 AOÛT00:00

articleHuggingFace Blog·il y a 9 m.

FilBench - Can LLMs Understand and Generate Filipino?

FilBench is a comprehensive evaluation suite to assess LLM capabilities for Tagalog, Filipino, and Cebuano, focusing on fluency, linguistic tasks, translation, and cultural knowledge. It organizes tasks into four categories (Cultural Knowledge, Classical NLP, Reading Comprehension, Generation) and reports a FilBench Score across 20+ models.

★★★★★·HuggingFace Blog

61212 AOÛT00:00

articleHuggingFace Blog·il y a 9 m.

TextQuests: How Good are LLMs at Text-Based Video Games?

TextQuests introduces a benchmark to evaluate LLM-based agents in text-based interactive games, emphasizing long-context reasoning and learning through exploration. It uses 25 classic Infocom games with two modes (With Clues / No Clues) and measures Game Progress and Harm over up to 500 steps, preserving full history for long-context evaluation.

★★★★★·HuggingFace Blog

61308 AOÛT00:00

toolHuggingFace Blog·il y a 10 m.

Introducing AI Sheets: a tool to work with datasets using open AI models!

AI Sheets is a no-code tool for building, transforming, and enriching datasets with open AI models, deployable locally or on the Hugging Face Hub. It uses a spreadsheet-like UI where prompts generate new columns, and you can validate or edit cells to fine-tune prompts with few-shot examples. It supports comparing models, data cleaning, classification, enrichment, and synthetic data generation.

★★★★★·HuggingFace Blog

61408 AOÛT00:00

articleHuggingFace Blog·il y a 10 m.

Accelerate ND-Parallel: A guide to Efficient Multi-GPU Training

Accelerate ND-Parallel guides how to combine multiple parallelism strategies (data, fully sharded data, tensor, context) for multi-GPU training. It provides concrete config examples (dp_shard_size, dp_replicate_size, cp_size, tp_size) and an FSDP plugin, plus Axolotl integration and end-to-end training scripts to minimize inter-device communication at scale. The article also discusses how to compose strategies for large models and points to ready configs and docs.

★★★★★·HuggingFace Blog

61507 AOÛT00:00

articleHuggingFace Blog·il y a 10 m.

Vision Language Model Alignment in TRL

TRL expands Vision Language Model alignment with Mixed Preference Optimization (MPO), Group Relative Policy Optimization (GRPO), and Group Sequence Policy Optimization (GSPO), extending beyond pairwise DPO to richer signals. It also adds Reinforce Leave One Out (RLOO) and Online DPO for scalable multimodal alignment, plus native SFT support and reproducible training scripts.

★★★★★·HuggingFace Blog

61605 AOÛT00:00

articleHuggingFace Blog·il y a 10 m.

Welcome GPT OSS, the new open-source model family from OpenAI!

OpenAI releases GPT OSS, an open-weight family with 117B and 21B MoE models using 4-bit MXFP4 quantization for fast inference and low resource use. The 120B fits on an 80GB GPU and the 20B on 16GB, both Apache-2.0 licensed; usable via Hugging Face Inference Providers for local or on-device deployments.

★★★★★·HuggingFace Blog

61704 AOÛT19:51

articleHuggingFace Blog·il y a 10 m.

Measuring Open-Source Llama Nemotron Models on DeepResearch Bench

AI-Q combines Llama 3.3-70B Instruct and Llama-3.3-Nemotron-Super-49B-v1.5 to enable long-context retrieval, agentic reasoning, and tool use in open-source stacks. NVIDIA details model lineage, post-training, and transparent evaluation metrics (hallucination detection, multi-source synthesis, citation trust, RAGAS), plus a 49B Nemotron running on a single H100. DeepResearch Bench ranks AI-Q top among fully open stacks with a score of 40.52 in LLM with Search (Aug 2025).

★★★★★·HuggingFace Blog

61801 AOÛT14:25

articleHuggingFace Blog·il y a 10 m.

3LM: A Benchmark for Arabic LLMs in STEM and Code

3LM est un benchmark multidomaine pour évaluer les LLM arabes en STEM et en code, déployant trois jeux de données (Native STEM MCQs, Synthetic STEM et Arabic Code Benchmarks) et des métriques comme pass@1 via EvalPlus. Le pipeline combine OCR, génération par LLM et vérifications humaines, et propose l'accès aux jeux sur HuggingFace et le code sur GitHub.

★★★★★·HuggingFace Blog

61931 JUIL00:00

articleHuggingFace Blog·il y a 10 m.

Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio

The article demonstrates how to implement an MCP server with Gradio to plug an LLM into Hugging Face models. It covers automatic conversion of Python functions into MCP tools, real-time progress notifications, and automatic file uploads, illustrated by an AI shopping assistant using IDM-VTON for virtual try-on.

★★★★★·HuggingFace Blog

62029 JUIL00:00

articleHuggingFace Blog·il y a 10 m.

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Trackio est une bibliothèque légère d’expérimentation open-source qui remplace wandb et offre un tableau de bord local Gradio, avec synchronisation vers Hugging Face Spaces pour partager les métriques. Elle vise simplicité, traçabilité des métriques et énergie GPU via nvidia-smi, et peut être utilisée comme drop-in wandb.

★★★★★·HuggingFace Blog

Page 31 / 43

← Préc.Suiv. →

20 sur 851 affichés

Issue 141 · Digest

Le résumé hebdo, livré dimanche.

20 articles classés par un agent. Aucun bruit, aucune pub. Désabonnement en un clic.

S'abonner →

[top 7 jours]B.1

01.
Chasing down why installing the kernel segfaulted
Lobsters
02.
I turned a $80 RK3562 Android tablet into a Debian Linux workstation
Hacker News (100+ pts)
03.
Mullvad exit IPs as a fingerprinting vector
Lobsters
04.
Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality
HuggingFace Blog
05.
int a = 5; a = a++ + ++a; a = ? (2011)
Lobsters

Colophon · MakerC.1

Quentin Lecocq · @celdama

Dev fullstack · CRO freelance · Lille, FR

Lantern est un side-project — agrégation, scoring IA, digest hebdo. Construit avec Next.js 16, Drizzle, Neon & Claude. Un seul mainteneur.

[X][GitHub][RSS][Site]

RaccourcisC.2

Recherche⌘ K
Article suivantJ
Article précédentK
OuvrirEnter
FavoriF

Veille dev & IA

§Feed · 851 articles

Feed · 851 articles