Practical llama.cpp Optimization Guide Covering VRAM, KV Cache, MoE | HACKOBAR_