Picture for Seongmin Lee

Seongmin Lee

Polo

Interpretation Meets Safety: A Survey on Interpretation Methods and Tools for Improving LLM Safety

Add code
Jun 05, 2025
Viaarxiv icon

Shape it Up! Restoring LLM Safety during Finetuning

Add code
May 22, 2025
Viaarxiv icon

Can LLM Generate Regression Tests for Software Commits?

Add code
Jan 19, 2025
Viaarxiv icon

LLM Hallucination Reasoning with Zero-shot Knowledge Test

Add code
Nov 14, 2024
Viaarxiv icon

Effective Guidance for Model Attention with Simple Yes-no Annotations

Add code
Oct 29, 2024
Figure 1 for Effective Guidance for Model Attention with Simple Yes-no Annotations
Figure 2 for Effective Guidance for Model Attention with Simple Yes-no Annotations
Figure 3 for Effective Guidance for Model Attention with Simple Yes-no Annotations
Figure 4 for Effective Guidance for Model Attention with Simple Yes-no Annotations
Viaarxiv icon

Zero-Shot Multi-Hop Question Answering via Monte-Carlo Tree Search with Large Language Models

Add code
Sep 28, 2024
Viaarxiv icon

Transformer Explainer: Interactive Learning of Text-Generative Models

Add code
Aug 08, 2024
Figure 1 for Transformer Explainer: Interactive Learning of Text-Generative Models
Viaarxiv icon

CanonicalFusion: Generating Drivable 3D Human Avatars from Multiple Images

Add code
Jul 05, 2024
Viaarxiv icon

Interactive Visual Learning for Stable Diffusion

Add code
Apr 22, 2024
Viaarxiv icon

ClickDiffusion: Harnessing LLMs for Interactive Precise Image Editing

Add code
Apr 05, 2024
Viaarxiv icon