Comparing the effectiveness of top-down and bottom-up activation steering for bypassing refusal on harmful prompts — LessWrong