Picture for Huahui Yi

Huahui Yi

SaFeR-ToolKit: Structured Reasoning via Virtual Tool Calling for Multimodal Safety

Add code
Mar 03, 2026
Viaarxiv icon

Improving Medical Visual Reinforcement Fine-Tuning via Perception and Reasoning Augmentation

Add code
Feb 11, 2026
Viaarxiv icon

ClueTracer: Question-to-Vision Clue Tracing for Training-Free Hallucination Suppression in Multimodal Reasoning

Add code
Feb 02, 2026
Viaarxiv icon

RareAlert: Aligning heterogeneous large language model reasoning for early rare disease risk screening

Add code
Jan 26, 2026
Viaarxiv icon

SaFeR-VLM: Toward Safety-aware Fine-grained Reasoning in Multimodal Models

Add code
Oct 08, 2025
Viaarxiv icon

Guiding Medical Vision-Language Models with Explicit Visual Prompts: Framework Design and Comprehensive Exploration of Prompt Variations

Add code
Jan 04, 2025
Figure 1 for Guiding Medical Vision-Language Models with Explicit Visual Prompts: Framework Design and Comprehensive Exploration of Prompt Variations
Figure 2 for Guiding Medical Vision-Language Models with Explicit Visual Prompts: Framework Design and Comprehensive Exploration of Prompt Variations
Figure 3 for Guiding Medical Vision-Language Models with Explicit Visual Prompts: Framework Design and Comprehensive Exploration of Prompt Variations
Figure 4 for Guiding Medical Vision-Language Models with Explicit Visual Prompts: Framework Design and Comprehensive Exploration of Prompt Variations
Viaarxiv icon

Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent

Add code
Dec 07, 2024
Figure 1 for Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent
Figure 2 for Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent
Figure 3 for Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent
Figure 4 for Evaluating Hallucination in Text-to-Image Diffusion Models with Scene-Graph based Question-Answering Agent
Viaarxiv icon

Unified Modeling Enhanced Multimodal Learning for Precision Neuro-Oncology

Add code
Jun 11, 2024
Figure 1 for Unified Modeling Enhanced Multimodal Learning for Precision Neuro-Oncology
Figure 2 for Unified Modeling Enhanced Multimodal Learning for Precision Neuro-Oncology
Figure 3 for Unified Modeling Enhanced Multimodal Learning for Precision Neuro-Oncology
Figure 4 for Unified Modeling Enhanced Multimodal Learning for Precision Neuro-Oncology
Viaarxiv icon

MLAE: Masked LoRA Experts for Parameter-Efficient Fine-Tuning

Add code
May 29, 2024
Figure 1 for MLAE: Masked LoRA Experts for Parameter-Efficient Fine-Tuning
Figure 2 for MLAE: Masked LoRA Experts for Parameter-Efficient Fine-Tuning
Figure 3 for MLAE: Masked LoRA Experts for Parameter-Efficient Fine-Tuning
Figure 4 for MLAE: Masked LoRA Experts for Parameter-Efficient Fine-Tuning
Viaarxiv icon

ConES: Concept Embedding Search for Parameter Efficient Tuning Large Vision Language Models

Add code
May 30, 2023
Figure 1 for ConES: Concept Embedding Search for Parameter Efficient Tuning Large Vision Language Models
Figure 2 for ConES: Concept Embedding Search for Parameter Efficient Tuning Large Vision Language Models
Figure 3 for ConES: Concept Embedding Search for Parameter Efficient Tuning Large Vision Language Models
Figure 4 for ConES: Concept Embedding Search for Parameter Efficient Tuning Large Vision Language Models
Viaarxiv icon