Picture for Hongcheng Liu

Hongcheng Liu

Bridging the Dynamic Perception Gap: Training-Free Draft Chain-of-Thought for Dynamic Multimodal Spatial Reasoning

Add code
May 22, 2025
Viaarxiv icon

Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal

Add code
Dec 15, 2024
Figure 1 for Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal
Figure 2 for Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal
Figure 3 for Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal
Figure 4 for Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal
Viaarxiv icon

Med-PMC: Medical Personalized Multi-modal Consultation with a Proactive Ask-First-Observe-Next Paradigm

Add code
Aug 16, 2024
Figure 1 for Med-PMC: Medical Personalized Multi-modal Consultation with a Proactive Ask-First-Observe-Next Paradigm
Figure 2 for Med-PMC: Medical Personalized Multi-modal Consultation with a Proactive Ask-First-Observe-Next Paradigm
Figure 3 for Med-PMC: Medical Personalized Multi-modal Consultation with a Proactive Ask-First-Observe-Next Paradigm
Figure 4 for Med-PMC: Medical Personalized Multi-modal Consultation with a Proactive Ask-First-Observe-Next Paradigm
Viaarxiv icon

Decoding Linguistic Representations of Human Brain

Add code
Jul 30, 2024
Figure 1 for Decoding Linguistic Representations of Human Brain
Figure 2 for Decoding Linguistic Representations of Human Brain
Figure 3 for Decoding Linguistic Representations of Human Brain
Figure 4 for Decoding Linguistic Representations of Human Brain
Viaarxiv icon

Stochastic First-Order Methods with Non-smooth and Non-Euclidean Proximal Terms for Nonconvex High-Dimensional Stochastic Optimization

Add code
Jun 27, 2024
Viaarxiv icon

M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset

Add code
Mar 21, 2024
Viaarxiv icon

Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator

Add code
Mar 14, 2024
Figure 1 for Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator
Figure 2 for Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator
Figure 3 for Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator
Figure 4 for Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator
Viaarxiv icon

M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation

Add code
Feb 19, 2024
Figure 1 for M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation
Figure 2 for M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation
Figure 3 for M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation
Figure 4 for M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation
Viaarxiv icon

MM-SAP: A Comprehensive Benchmark for Assessing Self-Awareness of Multimodal Large Language Models in Perception

Add code
Jan 15, 2024
Viaarxiv icon

New Sample Complexity Bounds for (Regularized) Sample Average Approximation in Several Heavy-Tailed, Non-Lipschitzian, and High-Dimensional Cases

Add code
Jan 01, 2024
Viaarxiv icon