About me

I focus on detecting unexpected phenomena, with applications to scientific research and AI safety.

I am now a post-doc at New York University, working with Prof. Chinmay Hegde. Prior to that I received my PhD at the Hebrew University, supervised by Prof. Yedid Hoshen.

You can also check my personal site with some nice riddles and more.

News

  • My team together with Yuval Lemberg, Hestia, won 1st place (Defense track) and 3st place (Attack track) in the Large Language Model Capture-the-Flag Competition.

  • Our work on the vulnerability of concept erasure methods will be presented in ICLR 2024. Check out our project website!