[⋯]Chargement

Built solo in Lille, FR·v0.6

Veille dev & IA

Le meilleur du dev et de l'IA, scoré chaque jour par un agent. Filtré, résumé, classé. Aucune couleur, aucun bruit — juste la matière.

Issue: No. 141
Date: 21 MAI 2026
Édition: FR · DAILY
Sources: 14 actives
Articles: 42 aujourd'hui

§ Feed·Vol. 02·No. 141

Last ingest·10:00 UTC+2·Next·08:00

Filtres

Reference PanelA.1

01. Type— 5

02. Période— 3

03. Source— 7

04. Score— min.

0 actifs

$⌘K

Articles / jour42

7-jour moy.42

Lun → Dim-62%

Feed · 851 articles

trier parscore·DESC ↓

58107 OCT09:37

articleHuggingFace Blog·il y a 8 m.

BigCodeArena: Judging code generations end to end with code executions

BigCodeArena is a human-in-the-loop platform that evaluates AI code generation by executing code in sandboxed environments across multiple languages and frameworks. It enables interactive testing, multi-turn conversations, and community voting to rank models, addressing key evaluation gaps in code generation. The platform has gathered over 14,000 conversations since February 2025.

★★★★★·HuggingFace Blog

58202 OCT00:00

articleHuggingFace Blog·il y a 8 m.

SOTA OCR with Core ML and dots.ocr

L’article décrit la conversion du modèle OCR 3B paramètres dots.ocr pour fonctionnement on-device via Core ML et MLX. Il montre comment capturer le graphe PyTorch puis le compiler en .mlpackage, en simplifiant le modèle (une seule image, vision encoder en Core ML, LM en MLX). Résultat : OCR performant sans API ni réseau, avec un gain d’efficacité puissance notable sur le Neural Engine.

★★★★★·HuggingFace Blog

58301 OCT00:00

articleHuggingFace Blog·il y a 8 m.

Introducing RTEB: A New Standard for Retrieval Evaluation

L’article présente RTEB, un nouveau benchmark de retrieval conçu pour mesurer fidèlement la généralisation des modèles d’embeddings en mixant données publiques et privées. Il critique les benchmarks existants, trop corrompus par le « teaching to the test » et un décalage avec les cas d’usage réels. RTEB vise à offrir une évaluation transparente et reproductible pour orienter le développement de modèles robustes.

★★★★★·HuggingFace Blog

58429 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

VibeGame: Exploring Vibe Coding Games

L'article explore les limites du « vibe coding » pour le développement de jeux, où les modèles d'IA peinent à gérer la croissance du contexte et la complexité des plateformes. Il analyse Roblox MCP, Unity MCP et le web, concluant à l'importance d'un bon context management et d'abstractions de haut niveau. Pour y remédier, l'auteur propose Shallot, un système de gestion de contexte léger et générique.

★★★★★·HuggingFace Blog

58529 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models

L’article présente l’accélération du Qwen3-8B (modèle agentique) sur Intel Core Ultra via la speculative decoding avec un modèle Qwen3-0.6B comme brouillon, puis un affinage par pruning de profondeur (6/28 couches) et finetuning, atteignant jusqu’à ~1,4× de speedup. Ces optimisent sont intégrables via OpenVINO.GenAI et compatibles avec smolagents pour déployer un agent local performant.

★★★★★·HuggingFace Blog

58626 SEPT06:25

articleHuggingFace Blog·il y a 8 m.

Nemotron-Personas-Japan: ソブリン AI のための合成データセット

Nemotron-Personas-Japan は、日本の公的統計に沿った合成ペルソナデータセットで、総計600万件の日本語ペルソナを含みます。データは CC BY 4.0 で公開され、NeMo Data Designer を用いた合成データ生成パイプラインと複数の生成バックエンドで、日本語AIのファインチューニングやソブリンAI開発を支援します。個人を特定できる情報は含まず、教育・職業・地域・文化背景などの統計属性を自然言語で表現します。

★★★★★·HuggingFace Blog

58726 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

Swift Transformers Reaches 1.0 – and Looks to the Future

Swift Transformers 1.0 apporte une stabilité et une maturité accrues pour les développeurs Apple intégrant modèles locaux. La release rend les modules Tokenizers et Hub en modules de premier ordre et s’appuie sur swift-jinja pour les templates de chat complexes. L’objectif est de renforcer les cas d’usage MLX et agentic à venir.

★★★★★·HuggingFace Blog

58823 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

Smol2Operator: Post-Training GUI Agents for Computer Use

L’article présente Smol2Operator, une méthode de post-entraînement qui donne à un VLM léger (SmolVLM2-2.2B-Instruct) des capacités de compréhension et d’interaction avec les interfaces graphiques. En deux phases — d’abord l’ancrage perçu, puis la cognition/agenticité — les auteurs transforment des données hétérogènes en un espace d’actions unifié et open source. Ils libèrent modèles, données, outils et recettes pour reproduire et étendre la recherche.

★★★★★·HuggingFace Blog

58922 SEPT06:45

articleHuggingFace Blog·il y a 8 m.

SyGra: The One-Stop Framework for Building Data for LLMs and SLMs

SyGra est un framework low‑code/no‑code qui simplifie la création, la transformation et l’alignement de données pour LLM et SLM. Il propose une librairie Python, supporte plusieurs backends d’inférence et couvre de nombreux scénarios (Q&A, DPO, raisonnement, cross‑langue, filtrage de qualité). L’outil vise à accélérer l’alignement et réduire le travail manuel de curatage de données.

★★★★★·HuggingFace Blog

59022 SEPT00:00

otherHuggingFace Blog·il y a 8 m.

Gaia2 and ARE: Empowering the community to study agents

Gaia2 est un nouveau benchmark d’évaluation d’agents conçu pour simuler des conditions du monde réel (lecture/écriture, bruit, échecs d’API, tâches temporelles). Il s’appuie sur le framework open-source ARE pour exécuter, déboguer et comparer des agents sur des scénarios humains complexes. Gaia2 et ARE visent à faciliter le debug et l’analyse des agents open‑world.

★★★★★·HuggingFace Blog

59119 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

Scaleway on Hugging Face Inference Providers

Scaleway is now a supported Inference Provider on the Hugging Face Hub, enabling serverless inference directly from model pages and via Python/JS SDKs. It provides access to models like gpt-oss, Qwen3, and Gemma 3, with pricing from €0.20 per million tokens and latency under 200ms for first tokens. The article covers two call modes (custom key vs routed by HF) and includes code examples for Python and JS.

★★★★★·HuggingFace Blog

59218 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

Democratizing AI Safety with RiskRubric.ai

RiskRubric.ai introduces a standardized risk assessment for AI models, delivering 0-100 scores and A-F grades across six pillars: transparency, reliability, security, privacy, safety, and reputation. The framework runs 1,000+ reliability tests, 200+ adversarial probes, automated code scans, and data/privacy reviews, enabling deployment filters and minimum score thresholds (e.g., 75).

★★★★★·HuggingFace Blog

59317 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

Public AI on Hugging Face Inference Providers

Public AI is now a supported inference provider on Hugging Face Hub, enabling serverless inference with public and sovereign models via an OpenAI-compatible API on vLLM. Integration is offered in the website UI and Python/JS SDKs, allowing custom keys or HF routing. The project is open-source and backed by distributed infrastructure donations.

★★★★★·HuggingFace Blog

59416 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

`LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot`

LeRobotDataset v3.0 packs multiple episodes per file with relational metadata to fetch individual episodes, and adds native streaming via StreamingLeRobotDataset. It includes a one-liner to convert v2.1 datasets to v3.0 and integrates with lerobot for PyTorch DataLoader usage. The format supports multi-modal robotics data and metadata-driven search on Hugging Face Hub.

★★★★★·HuggingFace Blog

59515 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

Visible Watermarking with Gradio

Visible watermarking in Gradio is presented as a simple way to distinguish AI-generated content by adding watermarks to images, videos, and text interfaces. The post shows how to use a single parameter (watermark) in components like gr.Image, gr.Video, and gr.Chatbot to display watermarks, including QR-based options. It also demonstrates attribution for copied AI text and provides practical Space examples.

★★★★★·HuggingFace Blog

59611 SEPT20:04

articleHuggingFace Blog·il y a 8 m.

Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!

WRITER présente Palmyra-mini: trois modèles légers (1.5–1.7B) dont deux variantes thinking optimisées pour raisonnement et calcul; le mode CoT permet de meilleures performances, affichant 82.87% GSM8K et 92.5% AMC23. Des options GGUF/MLX et l'inférence via vLLM/SGLang/TGI.

★★★★★·HuggingFace Blog

59711 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

Tricks from OpenAI gpt-oss YOU can use with transformers

L'article détaille les optimisations techniques livrées avec GPT-OSS dans Hugging Face Transformers : kernels téléchargeables sur le Hub (dont RMSNorm et MoE), MXFP4 quantization, tensor/p expert parallelism, sliding window et continuous batching / Paged Attention. Ces features améliorent le chargement, l'inférence et le fine‑tuning des modèles tout en restant applicables aux autres modèles de la librairie.

★★★★★·HuggingFace Blog

59810 SEPT17:04

articleHuggingFace Blog·il y a 8 m.

Fine-tune Any LLM from the Hugging Face Hub with Together AI

Together AI now enables fine-tuning of any compatible Hugging Face LLM on its infrastructure, exposing HF Hub models to streamlined customization. The guide shows a 5-minute setup to launch a training job using a base Together template and a Hugging Face model as the custom model, with options for private repos. After training, the fine-tuned model can be pushed back to the Hub automatically.

★★★★★·HuggingFace Blog

59910 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

Jupyter Agents: training LLMs to reason with notebooks

L’article présente Jupyter Agents, une pipeline pour entraîner des LLMs à résoudre des tâches de data science en exécutant du code dans des notebooks Jupyter. Il détaille le benchmarking DABStep, les améliorations de scaffolding, et le fine-tuning de Qwen3-4B-Thinking pour accroître les performances des modèles petits.

★★★★★·HuggingFace Blog

60009 SEPT00:00

articleHuggingFace Blog·il y a 8 m.

mmBERT: ModernBERT goes Multilingual

mmBERT is a massively multilingual encoder trained on over 3T tokens across 1,800+ languages, achieving state-of-the-art results with faster inference than prior models. It extends ModernBERT with novel components and a three-phase training schedule to improve learning for low-resource languages, using progressively sampled, broader data.

★★★★★·HuggingFace Blog

Page 30 / 43

← Préc.Suiv. →

20 sur 851 affichés

Issue 141 · Digest

Le résumé hebdo, livré dimanche.

20 articles classés par un agent. Aucun bruit, aucune pub. Désabonnement en un clic.

S'abonner →

[top 7 jours]B.1

01.
Chasing down why installing the kernel segfaulted
Lobsters
02.
I turned a $80 RK3562 Android tablet into a Debian Linux workstation
Hacker News (100+ pts)
03.
Mullvad exit IPs as a fingerprinting vector
Lobsters
04.
Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality
HuggingFace Blog
05.
int a = 5; a = a++ + ++a; a = ? (2011)
Lobsters

Colophon · MakerC.1

Quentin Lecocq · @celdama

Dev fullstack · CRO freelance · Lille, FR

Lantern est un side-project — agrégation, scoring IA, digest hebdo. Construit avec Next.js 16, Drizzle, Neon & Claude. Un seul mainteneur.

[X][GitHub][RSS][Site]

RaccourcisC.2

Recherche⌘ K
Article suivantJ
Article précédentK
OuvrirEnter
FavoriF

Veille dev & IA

§Feed · 851 articles

Feed · 851 articles