[r/LocalLLaMA]score: 0.19

Qwen3.6 27B uncensored heretic v2 Native MTP Preserved is Out Now With KLD 0.0021, 6/100 Refusals and the Full 15 MTPs Preserved and Retained, Available in Safetensors, GGUFs and NVFP4s formats.

May 6, 2026

llmfan46 has released Qwen3.6-27B-uncensored-heretic-v2-Native-MTP-Preserved, an abliterated fine-tune of Qwen3 27B achieving KLD of 0.0021 and only 6/100 refusals while retaining all 15 native Multi-Token Prediction heads intact. Prior uncensored variants typically discarded MTP modules during abliteration, sacrificing speculative decoding throughput gains. This release ships in Safetensors, GGUF, and NVFP4 formats, making it immediately deployable across llama.cpp and TensorRT-LLM pipelines. Researchers and developers needing unrestricted instruction-following with preserved inference acceleration should prioritize evaluating this over earlier heretic v1 builds.

new model

SOURCE

https://www.reddit.com/r/LocalLLaMA/comments/1t5yajb/qwen36_27b_uncensored_heretic_v2_native_mtp/

← back to feed