KoboldCpp
High-performance llama.cpp-based inference engine with built-in web UI. Popular in the creative writing and roleplay community for its quality and configurability.
✨ Features
- Single-file executable
- Built-in web UI
- KoboldAI-compatible API
- GGUF support
- GPU acceleration (CUDA, Vulkan, CLBlast)
- Story/adventure mode
- Sampler configuration
👍 Pros
- Single executable, no dependencies
- Excellent performance
- Great for creative writing
- Highly configurable sampling
👎 Cons
- Niche community focus
- Less general-purpose UI
- No built-in model download
- Documentation could be better
🎯 Best For
Creative writers and roleplay enthusiasts who want maximum control over text generation