Picture for Chao Zhang

Chao Zhang

From a Social Cognitive Perspective: Context-aware Visual Social Relationship Recognition

Jun 12, 2024
Viaarxiv icon

Can Large Language Models Understand Spatial Audio?

Jun 12, 2024
Viaarxiv icon

An Improved Empirical Fisher Approximation for Natural Gradient Descent

Add code
Jun 10, 2024
Viaarxiv icon

Spontaneous Speech-Based Suicide Risk Detection Using Whisper and Large Language Models

Add code
Jun 06, 2024
Viaarxiv icon

PLaD: Preference-based Large Language Model Distillation with Pseudo-Preference Pairs

Jun 06, 2024
Viaarxiv icon

HYDRA: Model Factorization Framework for Black-Box LLM Personalization

Add code
Jun 05, 2024
Viaarxiv icon

GeminiFusion: Efficient Pixel-wise Multimodal Fusion for Vision Transformer

Add code
Jun 04, 2024
Viaarxiv icon

Enhancing Zero-shot Text-to-Speech Synthesis with Human Feedback

Jun 02, 2024
Viaarxiv icon

MSSC-BiMamba: Multimodal Sleep Stage Classification and Early Diagnosis of Sleep Disorders with Bidirectional Mamba

May 31, 2024
Viaarxiv icon

NoteLLM-2: Multimodal Large Representation Models for Recommendation

May 27, 2024
Viaarxiv icon