← Back to Home

llm inference

Self-hosted personalized AI in a mirror.

One-click Qwen3.6-27B inference on Windows. 158 tok/s on RTX 5090, 72 tok/s on RTX 3090. Native, no