I am an AI researcher and engineer with expertise in fine-tuning large language models, reinforcement learning, and scalable models. Shoot me a message if you have any questions!
Comparing the moral jugdment of VLMs on image-query pairs with that of their underlying LLMs on description-query pairs.
Exploring architectures and methods for paramter efficient theory of mind in many agent settings.
How to abstract and extract higher agency from joint actions taken in games such as Diplomacy.
Exploration of the performance of reinforcement learning (RL) agents on the "tokens task," a decision-making test commonly used in neuroscience.
Gradient-masking with modified regulariation (l1 & l2), from the paper "Learning explanations that are hard to vary".