LUNAR: LLM Unlearning via Neural Activation Redirection
Published in arXiv, 2025
We present LUNAR, a novel unlearning methodology grounded in the Linear Representation Hypothesis.
Recommended citation: William F. Shen, Xinchi Qiu, Meghdad Kurmanji, Alex Iacob, Lorenzo Sani, Yihong Chen, Nicola Cancedda, & Nicholas D. Lane. (2025). LUNAR: LLM Unlearning via Neural Activation Redirection.
Download Paper