About Me

Hello! I’m Lucky Susanto, a research assistant at Monash University Indonesia. I aim to use AI for Societal benefit, such as dealing with toxicity, polarization, mis/dis/mal-information, and many more. My current research interest revolves around using concepts from mechanistic interpretability such as concept attribution to understand and create more robust models. Specifically, I aim to answer these questions:

How do LLMs encode higher-level concepts such as harm, morality, and safety?
How can we extract and edit these encodings to align LLMs after the training phase?
How can we enable LLMs to perform on lower-resource languages?

Find Me Online

Email: lucky[dot]susanto[at]monash[dot]edu
LinkedIn: Here
Google Scholar: Here

About Me

Most Recent Work

Current Inspiration Source

Curriculum Vitae

Find Me Online