Kagan Agras
Personal website
Blog posts
Category: Uncategorized
-
This is a thought piece I wrote after reading papers on representation engineering and adversarial robustness (like circuit breakers, neural chameleons, etc.) It was originally written spontaneously, so it is more exploratory than polished, but I wanted to share it because it captures a line of thinking I found interesting while comparing LLM reasoning and…