Picture for Kwan Ho Ryan Chan

Kwan Ho Ryan Chan

SECA: Semantically Equivalent and Coherent Attacks for Eliciting LLM Hallucinations

Add code
Oct 05, 2025
Viaarxiv icon

IP-CRR: Information Pursuit for Interpretable Classification of Chest Radiology Reports

Add code
Apr 30, 2025
Viaarxiv icon

Concept Lancet: Image Editing with Compositional Representation Transplant

Add code
Apr 03, 2025
Viaarxiv icon

KDA: A Knowledge-Distilled Attacker for Generating Diverse Prompts to Jailbreak LLMs

Add code
Feb 05, 2025
Figure 1 for KDA: A Knowledge-Distilled Attacker for Generating Diverse Prompts to Jailbreak LLMs
Figure 2 for KDA: A Knowledge-Distilled Attacker for Generating Diverse Prompts to Jailbreak LLMs
Figure 3 for KDA: A Knowledge-Distilled Attacker for Generating Diverse Prompts to Jailbreak LLMs
Figure 4 for KDA: A Knowledge-Distilled Attacker for Generating Diverse Prompts to Jailbreak LLMs
Viaarxiv icon

Do LLMs "know" internally when they follow instructions?

Add code
Oct 22, 2024
Figure 1 for Do LLMs "know" internally when they follow instructions?
Figure 2 for Do LLMs "know" internally when they follow instructions?
Figure 3 for Do LLMs "know" internally when they follow instructions?
Figure 4 for Do LLMs "know" internally when they follow instructions?
Viaarxiv icon

PaCE: Parsimonious Concept Engineering for Large Language Models

Add code
Jun 06, 2024
Figure 1 for PaCE: Parsimonious Concept Engineering for Large Language Models
Figure 2 for PaCE: Parsimonious Concept Engineering for Large Language Models
Figure 3 for PaCE: Parsimonious Concept Engineering for Large Language Models
Figure 4 for PaCE: Parsimonious Concept Engineering for Large Language Models
Viaarxiv icon

Learning Interpretable Queries for Explainable Image Classification with Information Pursuit

Add code
Dec 16, 2023
Figure 1 for Learning Interpretable Queries for Explainable Image Classification with Information Pursuit
Figure 2 for Learning Interpretable Queries for Explainable Image Classification with Information Pursuit
Figure 3 for Learning Interpretable Queries for Explainable Image Classification with Information Pursuit
Figure 4 for Learning Interpretable Queries for Explainable Image Classification with Information Pursuit
Viaarxiv icon

Knowledge Pursuit Prompting for Zero-Shot Multimodal Synthesis

Add code
Nov 30, 2023
Figure 1 for Knowledge Pursuit Prompting for Zero-Shot Multimodal Synthesis
Figure 2 for Knowledge Pursuit Prompting for Zero-Shot Multimodal Synthesis
Figure 3 for Knowledge Pursuit Prompting for Zero-Shot Multimodal Synthesis
Figure 4 for Knowledge Pursuit Prompting for Zero-Shot Multimodal Synthesis
Viaarxiv icon

Variational Information Pursuit with Large Language and Multimodal Models for Interpretable Predictions

Add code
Aug 24, 2023
Figure 1 for Variational Information Pursuit with Large Language and Multimodal Models for Interpretable Predictions
Figure 2 for Variational Information Pursuit with Large Language and Multimodal Models for Interpretable Predictions
Figure 3 for Variational Information Pursuit with Large Language and Multimodal Models for Interpretable Predictions
Figure 4 for Variational Information Pursuit with Large Language and Multimodal Models for Interpretable Predictions
Viaarxiv icon

Variational Information Pursuit for Interpretable Predictions

Add code
Feb 16, 2023
Viaarxiv icon