← Back to Home

gguf

Distribute and run LLMs with a single file.

Auto-tuned launcher for GGUF models on llama.cpp / ik_llama.cpp — OpenAI-compatible server with