[⋯]Loading

Built solo in Lille, FR·v0.6

Dev & AI feed

The best of dev and AI, scored every day by an agent. Filtered, summarized, ranked. No color, no noise — just the substance.

Issue: No. 139
Date: MAY 19, 2026
Edition: EN · DAILY
Sources: 14 active
Articles: 31 today

§ Feed·Vol. 02·No. 139

Last ingest·08:00 UTC+0·Next·08:00

Filters

Reference PanelA.1

01. Type— 5

02. Period— 3

03. Source— 7

04. Score— min.

0 active

$⌘K

Articles / day31

7-day avg.48

Mon → Sun-83%

Feed · 834 articles

sort byscore·DESC ↓

561OCT 1600:00

articleHuggingFace Blog·7 mo. ago

Google Cloud C4 Brings a 70% TCO improvement on GPT OSS with Intel and Hugging Face

Intel and Hugging Face demonstrate real-world gains running GPT OSS on Google Cloud C4 VMs with Granite Rapids, achieving up to 1.7x lower TCO and 1.4–1.7x higher TPOT throughput per vCPU/dollar versus C3. The work optimizes MoE expert execution to avoid redundant computation, improving utilization on large models.

★★★★★·HuggingFace Blog

562OCT 1500:00

articleHuggingFace Blog·7 mo. ago

Get your VLM running in 3 simple steps on Intel CPUs

Cet article présente une procédure en 3 étapes pour faire tourner un Vision Language Model localement sur CPU Intel, en utilisant Optimum Intel et OpenVINO avec SmolVLM. Il détaille la conversion en IR OpenVINO et deux approches de quantisation (WOQ et quantification statique) pour réduire mémoire et accélérer l'inférence.

★★★★★·HuggingFace Blog

563OCT 1323:00

articleHuggingFace Blog·7 mo. ago

Nemotron-Personas-India: Synthesized Data for Sovereign AI

NVIDIA lance Nemotron-Personas-India, un jeu de données open source de 21M de personas synthétiques alignées sur les distributions démographiques, géographiques et culturelles de l’Inde, en anglais et en hindi (Devanagari/Latin), construit avec NeMo Data Designer pour soutenir le développement de LLM multilingues et de copils culturels.

★★★★★·HuggingFace Blog

564OCT 0709:37

articleHuggingFace Blog·7 mo. ago

BigCodeArena: Judging code generations end to end with code executions

BigCodeArena is a human-in-the-loop platform that evaluates AI code generation by executing code in sandboxed environments across multiple languages and frameworks. It enables interactive testing, multi-turn conversations, and community voting to rank models, addressing key evaluation gaps in code generation. The platform has gathered over 14,000 conversations since February 2025.

★★★★★·HuggingFace Blog

565OCT 0200:00

articleHuggingFace Blog·8 mo. ago

SOTA OCR with Core ML and dots.ocr

L’article décrit la conversion du modèle OCR 3B paramètres dots.ocr pour fonctionnement on-device via Core ML et MLX. Il montre comment capturer le graphe PyTorch puis le compiler en .mlpackage, en simplifiant le modèle (une seule image, vision encoder en Core ML, LM en MLX). Résultat : OCR performant sans API ni réseau, avec un gain d’efficacité puissance notable sur le Neural Engine.

★★★★★·HuggingFace Blog

566OCT 0100:00

articleHuggingFace Blog·8 mo. ago

Introducing RTEB: A New Standard for Retrieval Evaluation

L’article présente RTEB, un nouveau benchmark de retrieval conçu pour mesurer fidèlement la généralisation des modèles d’embeddings en mixant données publiques et privées. Il critique les benchmarks existants, trop corrompus par le « teaching to the test » et un décalage avec les cas d’usage réels. RTEB vise à offrir une évaluation transparente et reproductible pour orienter le développement de modèles robustes.

★★★★★·HuggingFace Blog

567SEP 2900:00

articleHuggingFace Blog·8 mo. ago

VibeGame: Exploring Vibe Coding Games

L'article explore les limites du « vibe coding » pour le développement de jeux, où les modèles d'IA peinent à gérer la croissance du contexte et la complexité des plateformes. Il analyse Roblox MCP, Unity MCP et le web, concluant à l'importance d'un bon context management et d'abstractions de haut niveau. Pour y remédier, l'auteur propose Shallot, un système de gestion de contexte léger et générique.

★★★★★·HuggingFace Blog

568SEP 2900:00

articleHuggingFace Blog·8 mo. ago

Accelerating Qwen3-8B Agent on Intel® Core™ Ultra with Depth-Pruned Draft Models

L’article présente l’accélération du Qwen3-8B (modèle agentique) sur Intel Core Ultra via la speculative decoding avec un modèle Qwen3-0.6B comme brouillon, puis un affinage par pruning de profondeur (6/28 couches) et finetuning, atteignant jusqu’à ~1,4× de speedup. Ces optimisent sont intégrables via OpenVINO.GenAI et compatibles avec smolagents pour déployer un agent local performant.

★★★★★·HuggingFace Blog

569SEP 2606:25

articleHuggingFace Blog·8 mo. ago

Nemotron-Personas-Japan: ソブリン AI のための合成データセット

Nemotron-Personas-Japan は、日本の公的統計に沿った合成ペルソナデータセットで、総計600万件の日本語ペルソナを含みます。データは CC BY 4.0 で公開され、NeMo Data Designer を用いた合成データ生成パイプラインと複数の生成バックエンドで、日本語AIのファインチューニングやソブリンAI開発を支援します。個人を特定できる情報は含まず、教育・職業・地域・文化背景などの統計属性を自然言語で表現します。

★★★★★·HuggingFace Blog

570SEP 2600:00

articleHuggingFace Blog·8 mo. ago

Swift Transformers Reaches 1.0 – and Looks to the Future

Swift Transformers 1.0 apporte une stabilité et une maturité accrues pour les développeurs Apple intégrant modèles locaux. La release rend les modules Tokenizers et Hub en modules de premier ordre et s’appuie sur swift-jinja pour les templates de chat complexes. L’objectif est de renforcer les cas d’usage MLX et agentic à venir.

★★★★★·HuggingFace Blog

571SEP 2300:00

articleHuggingFace Blog·8 mo. ago

Smol2Operator: Post-Training GUI Agents for Computer Use

L’article présente Smol2Operator, une méthode de post-entraînement qui donne à un VLM léger (SmolVLM2-2.2B-Instruct) des capacités de compréhension et d’interaction avec les interfaces graphiques. En deux phases — d’abord l’ancrage perçu, puis la cognition/agenticité — les auteurs transforment des données hétérogènes en un espace d’actions unifié et open source. Ils libèrent modèles, données, outils et recettes pour reproduire et étendre la recherche.

★★★★★·HuggingFace Blog

572SEP 2206:45

articleHuggingFace Blog·8 mo. ago

SyGra: The One-Stop Framework for Building Data for LLMs and SLMs

SyGra est un framework low‑code/no‑code qui simplifie la création, la transformation et l’alignement de données pour LLM et SLM. Il propose une librairie Python, supporte plusieurs backends d’inférence et couvre de nombreux scénarios (Q&A, DPO, raisonnement, cross‑langue, filtrage de qualité). L’outil vise à accélérer l’alignement et réduire le travail manuel de curatage de données.

★★★★★·HuggingFace Blog

573SEP 2200:00

otherHuggingFace Blog·8 mo. ago

Gaia2 and ARE: Empowering the community to study agents

Gaia2 est un nouveau benchmark d’évaluation d’agents conçu pour simuler des conditions du monde réel (lecture/écriture, bruit, échecs d’API, tâches temporelles). Il s’appuie sur le framework open-source ARE pour exécuter, déboguer et comparer des agents sur des scénarios humains complexes. Gaia2 et ARE visent à faciliter le debug et l’analyse des agents open‑world.

★★★★★·HuggingFace Blog

574SEP 1900:00

articleHuggingFace Blog·8 mo. ago

Scaleway on Hugging Face Inference Providers

Scaleway is now a supported Inference Provider on the Hugging Face Hub, enabling serverless inference directly from model pages and via Python/JS SDKs. It provides access to models like gpt-oss, Qwen3, and Gemma 3, with pricing from €0.20 per million tokens and latency under 200ms for first tokens. The article covers two call modes (custom key vs routed by HF) and includes code examples for Python and JS.

★★★★★·HuggingFace Blog

575SEP 1800:00

articleHuggingFace Blog·8 mo. ago

Democratizing AI Safety with RiskRubric.ai

RiskRubric.ai introduces a standardized risk assessment for AI models, delivering 0-100 scores and A-F grades across six pillars: transparency, reliability, security, privacy, safety, and reputation. The framework runs 1,000+ reliability tests, 200+ adversarial probes, automated code scans, and data/privacy reviews, enabling deployment filters and minimum score thresholds (e.g., 75).

★★★★★·HuggingFace Blog

576SEP 1700:00

articleHuggingFace Blog·8 mo. ago

Public AI on Hugging Face Inference Providers

Public AI is now a supported inference provider on Hugging Face Hub, enabling serverless inference with public and sovereign models via an OpenAI-compatible API on vLLM. Integration is offered in the website UI and Python/JS SDKs, allowing custom keys or HF routing. The project is open-source and backed by distributed infrastructure donations.

★★★★★·HuggingFace Blog

577SEP 1600:00

articleHuggingFace Blog·8 mo. ago

`LeRobotDataset:v3.0`: Bringing large-scale datasets to `lerobot`

LeRobotDataset v3.0 packs multiple episodes per file with relational metadata to fetch individual episodes, and adds native streaming via StreamingLeRobotDataset. It includes a one-liner to convert v2.1 datasets to v3.0 and integrates with lerobot for PyTorch DataLoader usage. The format supports multi-modal robotics data and metadata-driven search on Hugging Face Hub.

★★★★★·HuggingFace Blog

578SEP 1500:00

articleHuggingFace Blog·8 mo. ago

Visible Watermarking with Gradio

Visible watermarking in Gradio is presented as a simple way to distinguish AI-generated content by adding watermarks to images, videos, and text interfaces. The post shows how to use a single parameter (watermark) in components like gr.Image, gr.Video, and gr.Chatbot to display watermarks, including QR-based options. It also demonstrates attribution for copied AI text and provides practical Space examples.

★★★★★·HuggingFace Blog

579SEP 1120:04

articleHuggingFace Blog·8 mo. ago

Introducing the Palmyra-mini family: Powerful, lightweight, and ready to reason!

WRITER présente Palmyra-mini: trois modèles légers (1.5–1.7B) dont deux variantes thinking optimisées pour raisonnement et calcul; le mode CoT permet de meilleures performances, affichant 82.87% GSM8K et 92.5% AMC23. Des options GGUF/MLX et l'inférence via vLLM/SGLang/TGI.

★★★★★·HuggingFace Blog

580SEP 1100:00

articleHuggingFace Blog·8 mo. ago

Tricks from OpenAI gpt-oss YOU can use with transformers

L'article détaille les optimisations techniques livrées avec GPT-OSS dans Hugging Face Transformers : kernels téléchargeables sur le Hub (dont RMSNorm et MoE), MXFP4 quantization, tensor/p expert parallelism, sliding window et continuous batching / Paged Attention. Ces features améliorent le chargement, l'inférence et le fine‑tuning des modèles tout en restant applicables aux autres modèles de la librairie.

★★★★★·HuggingFace Blog

Page 29 / 42

← Prev.Next →

20 of 834 shown

Issue 139 · Digest

The weekly digest, every Sunday.

20 articles ranked by an agent. No noise, no ads. One-click unsubscribe.

Subscribe →

[top 7 days]B.1

01.
I turned a $80 RK3562 Android tablet into a Debian Linux workstation
Hacker News (100+ pts)
02.
Mullvad exit IPs as a fingerprinting vector
Lobsters
03.
Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality
HuggingFace Blog
04.
what 262,715 regex questions on stack overflow haven't answered
Lobsters
05.
int a = 5; a = a++ + ++a; a = ? (2011)
Lobsters

Colophon · MakerC.1

Quentin Lecocq · @celdama

Fullstack dev · CRO freelance · Lille, FR

Lantern is a side-project — aggregation, AI scoring, weekly digest. Built with Next.js 16, Drizzle, Neon & Claude. One maintainer.

[X][GitHub][RSS][Site]

ShortcutsC.2

Search⌘ K
Next articleJ
Previous articleK
OpenEnter
FavoriteF

Dev & AI feed

§Feed · 834 articles

Feed · 834 articles