Raketenkater / Llm Server

Tags:

Description

Auto-tuned launcher for GGUF models on llama.cpp / ik_llama.cpp — OpenAI-compatible server with multi-GPU tensor-split, MoE expert placement, measured flag tuning (AI Tune), hardware-matched HuggingFace downloads, and crash recovery. An Ollama alt...

Technical Specifications

Core Language	Go
GitHub Authority	⭐ 223 stars
Last Code Push	2026-06-12
Open Issues / Bugs	🛠️ 0 bugs listed
License Type	Open-Source (Free to use)

Get Source Code

This project is open-source and hosted on GitHub. Click below to explore the repository, deployment guides, or fork the code.

Go to Repository →

Raketenkater / Llm Server

Description

Technical Specifications

Get Source Code

Browse Software Categories