Picture for Keze Wang

Keze Wang

RaCoT: Plug-and-Play Contrastive Example Generation Mechanism for Enhanced LLM Reasoning Reliability

Add code
Oct 26, 2025
Viaarxiv icon

Agent-GSPO: Communication-Efficient Multi-Agent Systems via Group Sequence Policy Optimization

Add code
Oct 26, 2025
Viaarxiv icon

Guardian: Decoupling Exploration from Safety in Reinforcement Learning

Add code
Oct 26, 2025
Viaarxiv icon

Backward-Friendly Optimization: Training Large Language Models with Approximate Gradients under Memory Constraints

Add code
Oct 26, 2025
Viaarxiv icon

Top-Down Semantic Refinement for Image Captioning

Add code
Oct 25, 2025
Viaarxiv icon

OSC: Cognitive Orchestration through Dynamic Knowledge Alignment in Multi-Agent LLM Collaboration

Add code
Sep 05, 2025
Viaarxiv icon

DART: Dual Adaptive Refinement Transfer for Open-Vocabulary Multi-Label Recognition

Add code
Aug 07, 2025
Viaarxiv icon

GAM-Agent: Game-Theoretic and Uncertainty-Aware Collaboration for Complex Visual Reasoning

Add code
May 29, 2025
Viaarxiv icon

TimeCausality: Evaluating the Causal Ability in Time Dimension for Vision Language Models

Add code
May 21, 2025
Viaarxiv icon

Exploiting Temporal Audio-Visual Correlation Embedding for Audio-Driven One-Shot Talking Head Animation

Add code
Apr 08, 2025
Figure 1 for Exploiting Temporal Audio-Visual Correlation Embedding for Audio-Driven One-Shot Talking Head Animation
Figure 2 for Exploiting Temporal Audio-Visual Correlation Embedding for Audio-Driven One-Shot Talking Head Animation
Figure 3 for Exploiting Temporal Audio-Visual Correlation Embedding for Audio-Driven One-Shot Talking Head Animation
Figure 4 for Exploiting Temporal Audio-Visual Correlation Embedding for Audio-Driven One-Shot Talking Head Animation
Viaarxiv icon