BEEAR represents a significant advancement towards practical mitigation of safety backdoors in instruction-tuned LLMs. It offers a generlizable backdoor behavior mitigation method for the LLM ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results