Local LLM Tools

Explore the highest-rated open-source tools for running Large Language Models locally. Discover inference engines, GUI clients, and API wrappers for GGUF and safetensors models. Sorted by GitHub authority and active contributions.

Defilantech / LLMKube

Kubernetes operator for local LLM inference with llama.cpp, vLLM, TGI, and mlx-server — multi-GPU

⭐ 127 Go

View Details

Enescingoz / Colab Llm

This repository provides a ready-to-use Google Colab notebook that turns Colab into a temporary

⭐ 127 Jupyter Notebook

View Details

Daviddaytw / React Native Transformers

Run local LLM from Huggingface in React-Native or Expo using onnxruntime.

⭐ 133 TypeScript

View Details

BodhiSearch / BodhiApp

Run Open Source/Open Weight LLMs locally with OpenAI compatible APIs

⭐ 133 TypeScript

View Details

Askimo Ai / Askimo

Chat, RAG search, multi-step Plans workflows, MCP tools, and Agents integration. Supports OpenAI,

⭐ 136 Kotlin

View Details

Likhithsai2580 / JARVIS

Project Jarvis is a versatile AI assistant that integrates various functionalities.

⭐ 137 Python

View Details

Peva3 / SmarterRouter

SmarterRouter: An intelligent LLM gateway and VRAM-aware router for Ollama, llama.cpp, and OpenAI.

⭐ 137 Python

View Details

Mcourtyard / M Courtyard

M-Courtyard: Local AI Model Fine-tuning Assistant for Apple Silicon. Zero-code, zero-cloud,

⭐ 137 TypeScript

View Details

Ddalcu / Mlx Serve

Native LLM inference server for Apple Silicon. OpenAI + Anthropic API compatible. No Python.

⭐ 141 Zig

View Details

Aestheticsuraj234 / Vibecode Playground

Vibecode Editor is a fullstack, web-based IDE built with Next.js and Monaco Editor. It features

⭐ 142 TypeScript

View Details

Xtekky / Gpt4local

Openai-style, fast & lightweight local language model inference w/ documents

⭐ 143 Python

View Details

Nath1295 / LLMFlex

A python package for developing AI applications with local LLMs.

⭐ 150 Python

View Details

Hanxiao / Dataroom

Give a query, get a dataroom. Pi + self-hosted Qwen3.6 research harness on a single L4.

⭐ 153 Python

View Details

Code Forge Temple / Agentic Signal

🤖 Visual AI agent workflow automation platform with local LLM integration - build intelligent

⭐ 155 TypeScript

View Details

Jegly / OfflineLLM

Private on-device AI chat for Android — runs any GGUF model locally via llama.cpp with

⭐ 169 Kotlin

View Details

Mercurialsolo / Claudectl

Orchestrate a swarm of Claude Code agents with a local brain that learns from you.

⭐ 177 Rust

View Details

UncSoft / Anubis Oss

Local LLM Testing & Benchmarking for Apple Silicon

⭐ 177 Swift

View Details

Techopolis / Afm Server

macOS menu bar app that exposes Apple's on-device Foundation Models as an OpenAI-compatible local

⭐ 184 Swift

View Details

Sfortis / Openai Tts

Custom TTS component for Home Assistant. Utilizes the OpenAI speech engine or any compatible

⭐ 194 Python

View Details

Quelmap Inc / Quelmap

Open Source Local Data Analysis Assistant.

⭐ 200 TypeScript

View Details

Devnen / Qwen3.6 Windows Server

One-click Qwen3.6-27B inference on Windows. 158 tok/s on RTX 5090, 72 tok/s on RTX 3090. Native, no

⭐ 201 Python

View Details

Fiveoutofnine / Whatcanirun

Find the best models and how to run them locally.

⭐ 241 TypeScript

View Details

AstraBert / Everything Ai

Your fully proficient, AI-powered and local chatbot assistant🤖

⭐ 248 Python

View Details

Datacrystals / AIStoryWriter

LLM story writer with a focus on high-quality long output based on a user provided prompt.

⭐ 251 Python

View Details

1 2 3 4