The better we align AI models with our values, the easier we may make it to realign them with opposing values. The release of GPT-3, and later ChatGPT, catapulted large language models from the ...