Hadas Orgad
I’m a Research Fellow at the Kempner Institute for the Study of Natural and Artificial Intelligence at Harvard University, where I study the internals of AI. My research explores how interpretability can be used as a strategic tool to improve their robustness, safety, and trustworthiness. I worked on problems related to hallucinations, bias, or unsafe outputs — with the broader goal of creating models that are both powerful and responsible.
Past: I completed my Ph.D. at the Technion – Israel Institute of Technology, supervised by Yonatan Belinkov. Before that, I spent 3.5 years at Microsoft, where I worked on AI solutions for cloud security and on the application of NLP for security-related problems. I hold both my B.Sc. and M.Sc. degrees from the Technion. I was selected as a 2023 Apple Scholar in AI/ML, and previously interned at Apple. During my master’s, I received the 2022 EMEA Generation Google Scholarship.
I’m always happy to connect with others who are excited about AI interpretability — feel free to reach out if you’d like to brainstorm or collaborate.
orgadhadas at gmail dot com