articleHuggingFace Blog
LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone!
This tutorial shows how to run LLMs on-device via a React Native app, downloading GGUF models from Hugging Face and loading them with llama.rn (llama.cpp). It covers model sizing and quantization formats (Q4_0, Q3_K, IQ2_XXS, etc.), and outlines a complete app workflow from environment setup to debugging, all aimed at offline, privacy-preserving mobile AI.
publié 07 MARS 2025★★★★★
Lire la sourcehuggingface.co/blog/llm-inference-on-edge
[*] Ouvre dans un nouvel onglet · pas de tracking côté Lantern
- Source
- HuggingFace Blog
- Ingéré
- 07 MARS 2025 · 19:10
- Score édito
- 4.0 / 5