[⋯]Loading

Built solo in Lille, FR·v0.6

Dev & AI feed

The best of dev and AI, scored every day by an agent. Filtered, summarized, ranked. No color, no noise — just the substance.

Issue: No. 138
Date: MAY 18, 2026
Edition: EN · DAILY
Sources: 14 active
Articles: 35 today

§ Feed·Vol. 02·No. 138

Last ingest·08:00 UTC+0·Next·08:00

Filters

Reference PanelA.1

01. Type— 5

02. Period— 3

03. Source— 7

04. Score— min.

0 active

$⌘K

Articles / day35

7-day avg.52

Mon → Sun-91%

Feed · 822 articles

sort byscore·DESC ↓

681MAR 0400:00

articleHuggingFace Blog·last yr.

A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality

Cohere's Aya Vision unveils 8B and 32B multilingual vision-language models for 23 languages, with strong results on AyaVisionBench and mWildVision. It uses synthetic annotations, multilingual data expansion, and model merging, and handles high-res images via dynamic tiling and Pixel Shuffle downsampling before the vision-language connector and LLM. Training is in two stages: vision-language alignment with frozen encoders, then supervised fine-tuning across multilingual tasks.

★★★★★·HuggingFace Blog

682FEB 2800:00

articleHuggingFace Blog·last yr.

Trace & Evaluate your Agent with Arize Phoenix

Cet article présente Arize Phoenix pour tracer, évaluer et déboguer les agents IA en temps réel. Il démontre comment activer le tracing via OpenTelemetry et OpenInference, et propose des étapes concrètes pour installer smolagents, configurer un agent et lancer des tests. Des extraits de code et d’utilisation servent de guide pratique pour améliorer la traçabilité et la performance.

★★★★★·HuggingFace Blog

683FEB 2700:00

articleHuggingFace Blog·last yr.

HuggingFace, IISc partner to supercharge model building on India's diverse languages

Hugging Face and IISc/ARTPARK partner to enable access to Vaani, India's diverse open-source, multi-modal dataset. Vaani targets 54 languages, 150k hours of speech and 15k hours of transcribed data from about 1 million people across 773 districts, with Phase 1 already open-sourced and Phase 2 expanding reach. This dataset supports ASR, TTS, language identification, and speaker verification, empowering robust Indic-language models and code-switching applications.

★★★★★·HuggingFace Blog

684FEB 2500:00

toolHuggingFace Blog·last yr.

FastRTC: The Real-Time Communication Library for Python

FastRTC is a Python library that simplifies building real-time audio and video AI apps. It offers automatic voice detection and turn-taking, a built-in Gradio UI, WebRTC/WebSocket support, and easy deployment to FastAPI, plus utilities for STT/TTS and phone-based access. The post walks through a hello-world echo example and a level-up LLM chat flow with SambaNova/OpenAI integration.

★★★★★·HuggingFace Blog

685FEB 2400:00

articleHuggingFace Blog·last yr.

Remote VAEs for decoding with Inference Endpoints

Cet article présente le déport du décodeur VAE vers une endpoint distante pour réduire la mémoire et la latence lors de l’inférence de modèles de diffusion en latent. Il détaille l’intégration avec diffusers et remote_decode et montre des cas d’usage (Stable Diffusion v1.5, Flux, HunyuanVideo) via des exemples de code et les avantages (aucun stockage des données, open source).

★★★★★·HuggingFace Blog

686FEB 2100:00

articleHuggingFace Blog·last yr.

SigLIP 2: A better multilingual vision language encoder

SigLIP 2 expands Google's multilingual vision-language encoder family by adding additional training objectives to SigLIP's sigmoid loss, boosting semantic understanding, localization, and dense features. It outperforms SigLIP across scales on zero-shot classification, image-text retrieval, and transfer tasks, and introduces a dynamic resolution (naflex) variant for aspect-ratio-sensitive downstream work. The release catalogs multiple models (Base, Large, So400m, Giant) with varied patch sizes, 2

★★★★★·HuggingFace Blog

687FEB 2000:00

articleHuggingFace Blog·last yr.

SmolVLM2: Bringing Video Understanding to Every Device

SmolVLM2 rend la compréhension vidéo accessible sur tous les devices via trois tailles (2.2B, 500M et 256M) et des APIs MLX dès le départ. Par rapport à la génération précédente, il optimise la mémoire et excelle sur Video-MME, avec des démos et une interface interactive pour tester la vision et la compréhension vidéo même dans un Colab gratuit.

★★★★★·HuggingFace Blog

688FEB 1900:00

articleHuggingFace Blog·last yr.

PaliGemma 2 Mix - New Instruction Vision Language Models by Google

Google publie PaliGemma 2 Mix, une famille de modèles vision-langage fine-tunés sur OCR, captioning et autres tâches. Disponibles en 3B/10B/28B et résolutions jusqu'à 896x896, ils permettent d’estimer les performances après fine-tuning sur des tâches en aval. L'article détaille des prompts open-ended et des prefixes (caption, describe, ocr, answer) ainsi que des invites pour détection et segmentation, avec une démo.

★★★★★·HuggingFace Blog

689FEB 1800:00

articleHuggingFace Blog·last yr.

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita

Hugging Face adds Hyperbolic, Nebius AI Studio, and Novita as serverless inference providers, integrated in the Hub and accessible via JS/Python SDKs. It outlines two usage modes (custom provider keys or routing through HF) with code examples for DeepSeek-R1 and Flux.1, and explains billing depending on the routing mode.

★★★★★·HuggingFace Blog

690FEB 1400:00

articleHuggingFace Blog·last yr.

Welcome Fireworks.ai on the Hub

Fireworks.ai est désormais un fournisseur d’inférence pris en charge sur Hugging Face Hub. L’article présente comment lancer de l’inférence serverless via Fireworks.ai sur Python, JS et HTTP, avec des exemples sur des modèles comme DeepSeek-R1 et Llama-3, et détaille la facturation et les crédits PRO.

★★★★★·HuggingFace Blog

691FEB 1400:00

articleHuggingFace Blog·last yr.

Fixing Open LLM Leaderboard with Math-Verify

Math-Verify a remis à plat l'évaluation des LLM sur le Open LLM Leaderboard, en réévaluant 3 751 modèles sur 1 324 problèmes de maths difficiles. L'article explique les failles de l'ancienne méthode (format de réponse, parsing SymPy) et décrit les améliorations qui permettent une comparaison plus juste et robuste des modèles.

★★★★★·HuggingFace Blog

692FEB 1300:00

articleHuggingFace Blog·last yr.

1 Billion Classifications

The piece breaks down how to cost-effectively run 1B+ classifications or embeddings at scale, analyzing model architectures, hardware options, and deployment choices. It offers a framework to estimate cost and latency, plus a practical stack (Inference Endpoints, Hugging Face Hub, Infinity, k6) to benchmark and optimize throughput.

★★★★★·HuggingFace Blog

693FEB 1200:00

articleHuggingFace Blog·last yr.

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

The article argues that content-defined chunking (CDC) is only a means to speed up data movement, not the ultimate goal of deduplication. To scale, the team moves from per-chunk transfers to aggregation: blocks up to 64MB reduce CAS entries by ~1000x, while shards map files to blocks and detect changes, enabling faster uploads/downloads.

★★★★★·HuggingFace Blog

694FEB 1200:00

toolHuggingFace Blog·last yr.

Build awesome datasets for video generation

The post describes tooling to build video-generation datasets, extending the img2dataset approach to videos. It presents a three-stage pipeline (Acquisition, Pre-processing/Filtering, Processing) using yt-dlp, Video to Scenes, watermark and aesthetic/NSFW checks, motion scoring, and Florence-2-based captions/OCR to filter data for fine-tuning.

★★★★★·HuggingFace Blog

695FEB 1016:10

articleHuggingFace Blog·last yr.

Open R1: Update #2

OpenR1-Math-220k is a large-scale math-reasoning dataset built on 512 H100s, with two (often four) solutions per problem and about 800k reasoning traces. It uses automated filtering (Math Verify) and LLama3.3-70B-Instruct as a judge, and achieves high throughput with vLLM and SGLang (~300k solutions/day). The update also details distillation results and a pipeline extensible to other domains.

★★★★★·HuggingFace Blog

696FEB 1000:00

articleHuggingFace Blog·last yr.

The Open Arabic LLM Leaderboard 2

Open Arabic LLM Leaderboard evolved from fragmented benchmarks to a unified platform. Since May 2024, OALL hosts 14 benchmarks; later Balsam Index added ~1,400 datasets and 50,000 questions, AraGen launched 3C3H with private test cycles, and SEAL introduced a private Arabic leaderboard with human-preference evaluation. The ecosystem drew 46k visitors and 700+ models from 180+ organizations.

★★★★★·HuggingFace Blog

697FEB 0400:00

articleHuggingFace Blog·last yr.

Open-source DeepResearch – Freeing our search agents

OpenAI a publié Deep Research, un système qui navigue sur le Web pour résumer le contenu et répondre par le résumé. L’article présente une reproduction open-source du cadre agentique (CodeAgent) et les résultats sur le benchmark GAIA (≈67% en 1-shot, 47,6% niveau 3). Les auteurs visent une open-source du cadre et les prochaines étapes de reproductibilité.

★★★★★·HuggingFace Blog

698FEB 0400:00

articleHuggingFace Blog·last yr.

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

π0 and π0-FAST are Vision-Language-Action models for generalist robot control. They use flow-matching for real-time action trajectories (50 Hz) across seven platforms and 68 tasks, and introduce fast attention techniques (FlashAttention2, FlexAttention) to handle 2D masks and cross-embodiment training. The post also points to Hugging Face LeRobot repos for code and pretrained models.

★★★★★·HuggingFace Blog

699FEB 0400:00

articleHuggingFace Blog·last yr.

DABStep: Data Agent Benchmark for Multi-step Reasoning

Introducing DABstep, a benchmark of 450+ real-world data analysis tasks to evaluate multi-step reasoning in AI agents. The study finds current top agents reach only about 16% accuracy, underscoring a large gap to reliably tackle real data tasks that mix structured data and unstructured documents.

★★★★★·HuggingFace Blog

700FEB 0200:04

articleHuggingFace Blog·last yr.

Open-R1: Update #1

Open-R1: Update #1 résume les progrès pour répliquer le pipeline d’entraînement et les données synthétiques de DeepSeek-R1 (MATH-500, GRPO dans TRL 0.14, DeepSpeed ZeRO, vLLM). Le post évoque aussi les défis de longueur des générations et propose des ressources communautaires ainsi qu’un leaderboard public.

★★★★★·HuggingFace Blog

Page 35 / 42

← Prev.Next →

20 of 822 shown

Issue 138 · Digest

The weekly digest, every Sunday.

20 articles ranked by an agent. No noise, no ads. One-click unsubscribe.

Subscribe →

[top 7 days]B.1

01.
I turned a $80 RK3562 Android tablet into a Debian Linux workstation
Hacker News (100+ pts)
02.
Mullvad exit IPs as a fingerprinting vector
Lobsters
03.
Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality
HuggingFace Blog
04.
what 262,715 regex questions on stack overflow haven't answered
Lobsters
05.
int a = 5; a = a++ + ++a; a = ? (2011)
Lobsters

Colophon · MakerC.1

Quentin Lecocq · @celdama

Fullstack dev · CRO freelance · Lille, FR

Lantern is a side-project — aggregation, AI scoring, weekly digest. Built with Next.js 16, Drizzle, Neon & Claude. One maintainer.

[X][GitHub][RSS][Site]

ShortcutsC.2

Search⌘ K
Next articleJ
Previous articleK
OpenEnter
FavoriteF

Dev & AI feed

§Feed · 822 articles

Feed · 822 articles