Raketenkater / Llm Server
Description
Auto-tuned launcher for GGUF models on llama.cpp / ik_llama.cpp — OpenAI-compatible server with multi-GPU tensor-split, MoE expert placement, measured flag tuning (AI Tune), hardware-matched HuggingFace downloads, and crash recovery. An Ollama alt...
Technical Specifications
| Core Language | |
| GitHub Authority | ⭐ 223 stars |
| Last Code Push | 2026-06-12 |
| Open Issues / Bugs | 🛠️ 0 bugs listed |
| License Type | Open-Source (Free to use) |
Get Source Code
This project is open-source and hosted on GitHub. Click below to explore the repository, deployment guides, or fork the code.
Go to Repository →