Lucky Susanto

Logo

Experienced in: AI for Society, dataset creation, multi-linguality, multi-modality. Now interested in: AI Safety and Mechanistic Interpretability.

View My GitHub Profile

About Me

Hello! I’m Lucky Susanto, a research assistant at Monash University Indonesia. I aim to use AI for Societal benefit, such as dealing with toxicity, polarization, mis/dis/mal-information, and many more. My current research interest revolves around using concepts from mechanistic interpretability such as concept attribution to understand and create more robust models. Specifically, I aim to answer these questions:

  1. How do LLMs encode higher-level concepts such as harm, morality, and safety?
  2. How can we extract and edit these encodings to align LLMs after the training phase?
  3. How can we enable LLMs to perform on lower-resource languages?

Most Recent Work

Predicting LLM Correctness in Prosthodontics Using Metadata and Hallucination Signals


Current Inspiration Source

CLAIM: Mitigating Multilingual Object Hallucination in Large Vision-Language Models with Cross-Lingual Attention Intervention


Curriculum Vitae

Click Here


Find Me Online