About me
I focus on detecting unexpected phenomena, with applications to scientific research and AI safety.
I am now a post-doc at New York University, working with Prof. Chinmay Hegde. Prior to that I received my PhD at the Hebrew University, supervised by Prof. Yedid Hoshen.
You can also check my personal site with some nice riddles and more.
News
My team together with Yuval Lemberg, Hestia, won 1st place (Defense track) and 3st place (Attack track) in the Large Language Model Capture-the-Flag Competition.
Our work on the vulnerability of concept erasure methods will be presented in ICLR 2024. Check out our project website!