llama-cpp

Here are 363 public repositories matching this topic...

getumbrel / llama-gpt

A self-hosted, offline, ChatGPT-like chatbot. Powered by Llama 2. 100% private, with no data leaving your device. New: Code Llama support!

ai self-hosted openai llama gpt gpt-4 llm chatgpt llamacpp llama-cpp gpt4all localai llama2 llama-2 code-llama codellama

Updated Apr 23, 2024
TypeScript

SciSharp / LLamaSharp

Star

A C#/.NET library to run LLM (🦙LLaMA/LLaVA) on your local device efficiently.

chatbot llama gpt multi-modal llm llava semantic-kernel llamacpp llama-cpp llama2 llama3

Updated Mar 6, 2026
C#

Mobile-Artificial-Intelligence / maid

Sponsor

Star

Maid is a free and open source application for interfacing with llama.cpp models locally, and with Anthropic, DeepSeek, Ollama, Mistral and OpenAI models remotely.

android facebook chatbot openai llama mistral claude chatgpt anthropic llama-cpp ollama gguf mobile-artificial-intelligence deepseek

Updated Mar 7, 2026
TypeScript

Open-Source AI Camera Skills Platform, AI NVR & CCTV Surveillance. Local VLM video analysis with Qwen, DeepSeek, SmolVLM, LLaVA, MiniMax. LLM-powered agentic security camera agent — watches, understands, remembers & guards your home via Telegram, Discord or Slack. Pluggable AI skills. OpenAI, Google, Anthropic or local AI. Runs on Mac Mini & AI PC.

Updated Mar 7, 2026
JavaScript

withcatai / node-llama-cpp

Star

Run AI models locally on your machine with node.js bindings for llama.cpp. Enforce a JSON schema on the model output on the generation level

Updated Mar 6, 2026
TypeScript

undreamai / LLMUnity

Sponsor

Star

Create characters in Unity with LLMs!

chat gamedev ai unity chatbot game-development dialogue unity3d character npc llama unity2d conversational-ai rag llm generative-ai llama-cpp

Updated Mar 7, 2026
C#

gotzmann / llama.go

Star

llama.go is like llama.cpp in pure Golang!

llama gpt alpaca vicuna gpt3 gpt4 llm chatgpt dalai llama-cpp gpt4all

Updated Sep 20, 2024
Go

ggml-org / LlamaBarn

Star

A cosy home for your LLMs.

macos swift ai llms llama-cpp

Updated Feb 28, 2026
Swift

alichherawalla / off-grid-mobile-ai

Star

The Swiss Army Knife of Offline AI. Chat, Speak, and Generate Images - Privacy First, Zero Internet. Download an LLM and use it on your mobile device. No data ever leaves your phone. Supports text-to-text, vision, text-to-image

privacy-first edge-ai ondevice mobile-ai llama-cpp local-ai offline-llm gguf stable-diffusion-android offline-ai whisper-android tool-calling ondevice-ai

Updated Mar 7, 2026
TypeScript

docker / compose-for-agents

Star

Build and run AI agents using Docker Compose. A collection of ready-to-use examples for orchestrating open-source LLMs, tools, and agent runtimes.

docker docker-compose examples openai-gym self-hosted ai-agents large-language-models llama-cpp agentic-workflows

Updated Dec 12, 2025
TypeScript

mybigday / llama.rn

Star

React Native binding of llama.cpp

android ios react-native llama llm llama-cpp

Updated Mar 4, 2026
C++

the-crypt-keeper / can-ai-code

Star

Self-evaluating interview for AI coders

ai transformers humaneval llm langchain llama-cpp ggml

Updated Jun 21, 2025
Python

withcatai / catai

Star

Run AI ✨ assistant locally! with simple API for Node.js 🚀

nodejs ai chatbot openai chatui vicuna ai-assistant llm chatgpt dalai llama-cpp vicuna-installation-guide localai wizardlm local-llm catai ggmlv3 gguf node-llama-cpp

Updated Nov 16, 2025
TypeScript

mdrokz / rust-llama.cpp

Sponsor

Star

LLama.cpp rust bindings

rust machine-learning cpp model ffi crates-io llama api-bindings llama-cpp

Updated Jun 27, 2024
Rust

dipampaul17 / KVSplit

Star

Run larger LLMs with longer contexts on Apple Silicon by using differentiated precision for KV cache quantization. KVSplit enables 8-bit keys & 4-bit values, reducing memory by 59% with <1% quality loss. Includes benchmarking, visualization, and one-command setup. Optimized for M1/M2/M3 Macs with Metal support.

metal optimization quantization m2 m3 m1 memory-optimization kv-cache apple-silicon llm generative-ai llama-cpp

Updated May 21, 2025
Python

lucasjinreal / Crane

Star

A Pure Rust based LLM, VLM, VLA, TTS, OCR Inference Engine, powering by Candle & Rust. Alternate to your llama.cpp but much more simpler and cleaner..

rust mllm llama-cpp qwen2-vl spark-tts qwen3

Updated Feb 24, 2026
Rust

jlonge4 / local_llama

Star

This repo is to showcase how you can run a model locally and offline, free of OpenAI dependencies.

python offline artificial-intelligence machinelearning langchain llama-cpp llamaindex

Updated Jul 12, 2024
Python

Siddhesh2377 / ToolNeuron

Sponsor

Star

Complete offline AI ecosystem for Android: Chat (GGUF/LLMs), Images (Stable Diffusion 1.5), Voice (TTS/STT), and Knowledge (RAG Data-Packs), zero subscriptions, no data harvesting. Open-source privacy-first AI on your terms.

android kotlin open-source privacy-first jetpack-compose mobile-ai ai-assistant llm llama-cpp local-ai openrouter gguf-models sherpa-onnx offline-tts

Updated Mar 7, 2026
Kotlin

gpustack / gguf-parser-go

Star

Review/Check GGUF files and estimate the memory usage and maximum tokens per second.

go llama-cpp gguf stable-diffusion-cpp llama-box

Updated Feb 11, 2026
Go

ptsochantaris / emeltal

Sponsor

Star

Local ML voice chat using high-end models.

macos swift machine-learning natural-language-processing ai ml speech-recognition user-interface swiftui whisper-cpp llama-cpp

Updated Mar 5, 2026
C++

Improve this page

Add a description, image, and links to the llama-cpp topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llama-cpp topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llama-cpp

Here are 363 public repositories matching this topic...

getumbrel / llama-gpt

SciSharp / LLamaSharp

Mobile-Artificial-Intelligence / maid

SharpAI / DeepCamera

withcatai / node-llama-cpp

undreamai / LLMUnity

gotzmann / llama.go

ggml-org / LlamaBarn

alichherawalla / off-grid-mobile-ai

docker / compose-for-agents

mybigday / llama.rn

the-crypt-keeper / can-ai-code

withcatai / catai

mdrokz / rust-llama.cpp

dipampaul17 / KVSplit

lucasjinreal / Crane

jlonge4 / local_llama

Siddhesh2377 / ToolNeuron

gpustack / gguf-parser-go

ptsochantaris / emeltal

Improve this page

Add this topic to your repo