Picture for Hongcheng Liu

Hongcheng Liu

VocalBench-zh: Decomposing and Benchmarking the Speech Conversational Abilities in Mandarin Context

Add code
Nov 17, 2025
Viaarxiv icon

Selecting Auxiliary Data via Neural Tangent Kernels for Low-Resource Domains

Add code
Nov 10, 2025
Viaarxiv icon

Bridging the Dynamic Perception Gap: Training-Free Draft Chain-of-Thought for Dynamic Multimodal Spatial Reasoning

Add code
May 22, 2025
Viaarxiv icon

Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal

Add code
Dec 15, 2024
Figure 1 for Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal
Figure 2 for Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal
Figure 3 for Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal
Figure 4 for Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal
Viaarxiv icon

Med-PMC: Medical Personalized Multi-modal Consultation with a Proactive Ask-First-Observe-Next Paradigm

Add code
Aug 16, 2024
Figure 1 for Med-PMC: Medical Personalized Multi-modal Consultation with a Proactive Ask-First-Observe-Next Paradigm
Figure 2 for Med-PMC: Medical Personalized Multi-modal Consultation with a Proactive Ask-First-Observe-Next Paradigm
Figure 3 for Med-PMC: Medical Personalized Multi-modal Consultation with a Proactive Ask-First-Observe-Next Paradigm
Figure 4 for Med-PMC: Medical Personalized Multi-modal Consultation with a Proactive Ask-First-Observe-Next Paradigm
Viaarxiv icon

Decoding Linguistic Representations of Human Brain

Add code
Jul 30, 2024
Figure 1 for Decoding Linguistic Representations of Human Brain
Figure 2 for Decoding Linguistic Representations of Human Brain
Figure 3 for Decoding Linguistic Representations of Human Brain
Figure 4 for Decoding Linguistic Representations of Human Brain
Viaarxiv icon

Stochastic First-Order Methods with Non-smooth and Non-Euclidean Proximal Terms for Nonconvex High-Dimensional Stochastic Optimization

Add code
Jun 27, 2024
Viaarxiv icon

M$^3$AV: A Multimodal, Multigenre, and Multipurpose Audio-Visual Academic Lecture Dataset

Add code
Mar 21, 2024
Viaarxiv icon

Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator

Add code
Mar 14, 2024
Figure 1 for Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator
Figure 2 for Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator
Figure 3 for Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator
Figure 4 for Automatic Interactive Evaluation for Large Language Models with State Aware Patient Simulator
Viaarxiv icon

M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation

Add code
Feb 19, 2024
Figure 1 for M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation
Figure 2 for M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation
Figure 3 for M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation
Figure 4 for M2K-VDG: Model-Adaptive Multimodal Knowledge Anchor Enhanced Video-grounded Dialogue Generation
Viaarxiv icon