https://huggingface.co/blog/mlabonne/abliteration
In this article, we will explore a technique called "abliteration" that can uncensor any LLM without retraining. This technique effectively removes the model's built-in refusal mechanism, allowing it to respond to all types of prompts.
No comments:
Post a Comment