[⋯]Loading

Built solo in Lille, FR·v0.6

Dev & AI feed

The best of dev and AI, scored every day by an agent. Filtered, summarized, ranked. No color, no noise — just the substance.

Issue: No. 139
Date: MAY 19, 2026
Edition: EN · DAILY
Sources: 14 active
Articles: 31 today

§ Feed·Vol. 02·No. 139

Last ingest·08:00 UTC+0·Next·08:00

Filters

Reference PanelA.1

01. Type— 5

02. Period— 3

03. Source— 7

04. Score— min.

0 active

$⌘K

Articles / day31

7-day avg.48

Mon → Sun-83%

Feed · 834 articles

sort byscore·DESC ↓

601AUG 0114:25

articleHuggingFace Blog·10 mo. ago

3LM: A Benchmark for Arabic LLMs in STEM and Code

3LM est un benchmark multidomaine pour évaluer les LLM arabes en STEM et en code, déployant trois jeux de données (Native STEM MCQs, Synthetic STEM et Arabic Code Benchmarks) et des métriques comme pass@1 via EvalPlus. Le pipeline combine OCR, génération par LLM et vérifications humaines, et propose l'accès aux jeux sur HuggingFace et le code sur GitHub.

★★★★★·HuggingFace Blog

602JUL 3100:00

articleHuggingFace Blog·10 mo. ago

Implementing MCP Servers in Python: An AI Shopping Assistant with Gradio

The article demonstrates how to implement an MCP server with Gradio to plug an LLM into Hugging Face models. It covers automatic conversion of Python functions into MCP tools, real-time progress notifications, and automatic file uploads, illustrated by an AI shopping assistant using IDM-VTON for virtual try-on.

★★★★★·HuggingFace Blog

603JUL 2900:00

articleHuggingFace Blog·10 mo. ago

Introducing Trackio: A Lightweight Experiment Tracking Library from Hugging Face

Trackio est une bibliothèque légère d’expérimentation open-source qui remplace wandb et offre un tableau de bord local Gradio, avec synchronisation vers Hugging Face Spaces pour partager les métriques. Elle vise simplicité, traçabilité des métriques et énergie GPU via nvidia-smi, et peut être utilisée comme drop-in wandb.

★★★★★·HuggingFace Blog

604JUL 2500:00

toolHuggingFace Blog·10 mo. ago

Say hello to `hf`: a faster, friendlier Hugging Face CLI

HF renames the Hugging Face CLI from huggingface-cli to hf to reorganize commands around a clear hf <resource> <action> pattern, improving ergonomics and discoverability. The migration keeps the legacy CLI functional with warnings and documents how to install (pip install -U huggingface_hub) and verify (hf version) and explore commands (hf --help). It also introduces the hf jobs service to run scripts or Docker images on HF infrastructure with pay-as-you-go billing.

★★★★★·HuggingFace Blog

605JUL 2500:00

articleHuggingFace Blog·10 mo. ago

Parquet Content-Defined Chunking

Parquet Content-Defined Chunking (CDC) is now available in PyArrow and Pandas, enabling deduplication of Parquet files on Hugging Face’s Xet storage layer. CDC reduces data transfer and storage costs by uploading or downloading only changed data chunks. The article shows how to enable CDC with use_content_defined_chunking and outlines several deduplication scenarios with code examples.

★★★★★·HuggingFace Blog

606JUL 2300:00

articleHuggingFace Blog·10 mo. ago

TimeScope: How Long Can Your Video Large Multimodal Model Go?

TimeScope is an open-source benchmark that tests long-video understanding by inserting short needle clips into base videos (1 min to 8 hours). It evaluates localized retrieval, information synthesis, and fine-grained temporal perception, revealing that many SOTA models struggle with true temporal comprehension.

★★★★★·HuggingFace Blog

607JUL 2300:00

articleHuggingFace Blog·10 mo. ago

Fast LoRA inference for Flux with Diffusers and PEFT

LoRA adapters enable customizing diffusion models but can slow or complicate inference. This post shows how to speed up LoRA inference for Flux using a recipe based on FA3, FP8 quantization, and torch.compile, while staying hot-swapping–ready. It includes a concise code example that applies quantization, sets the attn processor, and compiles the transformer for faster inference.

★★★★★·HuggingFace Blog

608JUL 2118:01

articleHuggingFace Blog·10 mo. ago

Accelerate a World of LLMs on Hugging Face with NVIDIA NIM

NVIDIA NIM offre un conteneur unique pour deployer rapidement une large gamme de LLM via Hugging Face, en automatisant l’adaptation, l’analyse du modele et le choix du backend (TensorRT-LLM, vLLM, SGLang). Il prend en charge Hugging Face, GGUF et TensorRT-LLM et illustre le deployment avec Codestral-22B via une commande Docker et tokens API.

★★★★★·HuggingFace Blog

609JUL 1800:00

articleHuggingFace Blog·10 mo. ago

Arc Virtual Cell Challenge: A Primer

Arc Institute lance le Virtual Cell Challenge, qui vise à entraîner un modèle capable de prédire l’effet du silençage d’un gène sur une cellule, même dans des types cellulaires inédits (contexte généralisation). Le jeu de données réunit environ 300k profils RNA‑seq d’une cellule unique et propose un cadre mathématique pour séparer le signal du perturbation, l’hétérogénéité et le bruit technique, avec des architectures comme le State Transition Model et le State Embedding Model.

★★★★★·HuggingFace Blog

610JUL 1700:00

articleHuggingFace Blog·10 mo. ago

Consilium: When Multiple LLMs Collaborate

Consilium est une plateforme qui fait débattre et faire consensus entre plusieurs LLMs via des discussions structurées, avec des modes comme consensus, vote majoritaire ou classement par choix. Déployée comme interface Gradio et serveur MCP, elle visualise une table ronde et les échanges des experts. L'article relie l'idée au MAI-DxO de Microsoft pour démontrer l'efficacité du multi-LLM.

★★★★★·HuggingFace Blog

611JUL 1700:00

mcpHuggingFace Blog·10 mo. ago

Five Big Improvements to Gradio MCP Servers

Gradio has released version 5.38.0 to enhance MCP servers with Seamless Local File Support via a new File Upload endpoint, real-time progress streaming for MCP clients, and a one-line OpenAPI-to-MCP conversion using gr.load_openapi. The update also improves authentication by allowing server arguments to be declared as gr.Header and surfaced in docs.

★★★★★·HuggingFace Blog

612JUL 1700:00

articleHuggingFace Blog·10 mo. ago

Back to The Future: Evaluating AI Agents on Predicting Future Events

AI evaluation should shift from recalling facts to forecasting future events. FutureBench uses real-world prediction markets and live news to test agents' ability to reason under uncertainty and synthesize information to predict outcomes.

★★★★★·HuggingFace Blog

613JUL 1600:00

articleHuggingFace Blog·10 mo. ago

Ettin Suite: SoTA Paired Encoders and Decoders

Ettin introduces the first SoTA paired encoder-only and decoder-only models (17M–1B params) trained identically for apples-to-apples comparisons. It extends the ModernBERT recipe to both architectures, with encoders beating ModernBERT and decoders beating Llama 3.2 and SmolLM2, while preserving architecture-specific advantages.

★★★★★·HuggingFace Blog

614JUL 1500:00

articleHuggingFace Blog·10 mo. ago

Migrating the Hub from Git LFS to Xet

Hugging Face has deployed Xet as the Hub’s storage backend, migrating hundreds of petabytes and millions of repos with minimal disruption. The migration relies on a Git LFS Bridge and background content migrations to support both LFS and Xet, allowing a no-hard-cutover transition. The approach aims to scale storage with AI workloads while preserving existing workflows.

★★★★★·HuggingFace Blog

615JUL 1012:54

articleHuggingFace Blog·10 mo. ago

Kimina-Prover: Applying Test-time RL Search on Large Formal Reasoning Models

Kimina-Prover-72B uses a Test-Time Reinforcement Learning (TTRL) search to autonomously discover and reuse lemmas for long-horizon Lean proofs, plus an error-fixing module that interprets Lean messages for targeted corrections. It achieves a state-of-the-art miniF2F performance (92.2% pass rate) and shows pass@1/32/1024 of 63.9/84.0/87.7, with two distilled variants released (8B and 1.7B).

★★★★★·HuggingFace Blog

616JUL 1000:00

articleHuggingFace Blog·10 mo. ago

Asynchronous Robot Inference: Decoupling Action Prediction and Execution

Asynchronous inference decouples action prediction from execution in robotic policies, reducing runtime lag and enabling replanning with action chunks. The article describes a two-component architecture (PolicyServer and RobotClient) using gRPC to achieve ~2× speedups and continuous operation, and explains why sequential inference falls short.

★★★★★·HuggingFace Blog

617JUL 1000:00

mcpHuggingFace Blog·10 mo. ago

Building the Hugging Face MCP Server

Building the Hugging Face MCP Server enables customized AI Assistants to access the Hub and thousands of apps through a single URL. The article compares MCP transports (Streamable HTTP vs SSE) and outlines three patterns: Direct Response, Request Scoped Streams, and Server Push Streams, with their trade-offs and needed connection management. It also covers making the server dynamic and remotely configurable, plus client-side usage hints (TypeScript/Python).

★★★★★·HuggingFace Blog

618JUL 1000:00

articleHuggingFace Blog·10 mo. ago

ScreenEnv: Deploy your full stack Desktop Agent

ScreenEnv is a Python library that runs isolated Ubuntu desktop environments in Docker to test and deploy GUI agents with full desktop control, including window management and file operations. It supports both the Model Context Protocol (MCP) and a Direct Sandbox API, enabling flexible integration with existing backends or AI systems. The article provides setup examples and a quick path to build a custom Desktop Agent using smolagents.

★★★★★·HuggingFace Blog

619JUL 0900:00

articleHuggingFace Blog·10 mo. ago

Creating custom kernels for the AMD MI300

Hugging Face and AMD optimize custom kernels for MI300X to boost Llama 3.1 405B inference in FP8 on 8 GPUs. They combine fused residual/RMS norm/FP8 conversion, SwiGLU, and a Skinny GEMM kernel, with benchmarks and open-source tooling in hf-rocm-kernels.

★★★★★·HuggingFace Blog

620JUL 0900:00

mcpHuggingFace Blog·10 mo. ago

Upskill your LLMs With Gradio MCP Servers

The article introduces the Model Context Protocol (MCP) and explains how MCP servers can extend LLMs with new abilities via an app-store-like ecosystem. It guides readers to find MCP-compatible spaces in Hugging Face Spaces and demonstrates wiring a Flux.1 Kontext[dev] MCP server to an LLM to edit images from text prompts.

★★★★★·HuggingFace Blog

Page 31 / 42

← Prev.Next →

20 of 834 shown

Issue 139 · Digest

The weekly digest, every Sunday.

20 articles ranked by an agent. No noise, no ads. One-click unsubscribe.

Subscribe →

[top 7 days]B.1

01.
I turned a $80 RK3562 Android tablet into a Debian Linux workstation
Hacker News (100+ pts)
02.
Mullvad exit IPs as a fingerprinting vector
Lobsters
03.
Granite Embedding Multilingual R2: Open Apache 2.0 Multilingual Embeddings with 32K Context — Best Sub-100M Retrieval Quality
HuggingFace Blog
04.
what 262,715 regex questions on stack overflow haven't answered
Lobsters
05.
int a = 5; a = a++ + ++a; a = ? (2011)
Lobsters

Colophon · MakerC.1

Quentin Lecocq · @celdama

Fullstack dev · CRO freelance · Lille, FR

Lantern is a side-project — aggregation, AI scoring, weekly digest. Built with Next.js 16, Drizzle, Neon & Claude. One maintainer.

[X][GitHub][RSS][Site]

ShortcutsC.2

Search⌘ K
Next articleJ
Previous articleK
OpenEnter
FavoriteF

Dev & AI feed

§Feed · 834 articles

Feed · 834 articles