Picture for Jiacheng Pang

Jiacheng Pang

Adaptive Collaboration with Humans: Metacognitive Policy Optimization for Multi-Agent LLMs with Continual Learning

Add code
Mar 09, 2026
Viaarxiv icon

MoD-DPO: Towards Mitigating Cross-modal Hallucinations in Omni LLMs using Modality Decoupled Preference Optimization

Add code
Mar 03, 2026
Viaarxiv icon

AVERE: Improving Audiovisual Emotion Reasoning with Preference Optimization

Add code
Feb 04, 2026
Viaarxiv icon

Maestro: Learning to Collaborate via Conditional Listwise Policy Optimization for Multi-Agent LLMs

Add code
Nov 08, 2025
Viaarxiv icon