Picture for Hao Li

Hao Li

Jack

ZeroGUI: Automating Online GUI Learning at Zero Human Cost

Add code
May 29, 2025
Figure 1 for ZeroGUI: Automating Online GUI Learning at Zero Human Cost
Figure 2 for ZeroGUI: Automating Online GUI Learning at Zero Human Cost
Figure 3 for ZeroGUI: Automating Online GUI Learning at Zero Human Cost
Figure 4 for ZeroGUI: Automating Online GUI Learning at Zero Human Cost
Viaarxiv icon

Wav2Sem: Plug-and-Play Audio Semantic Decoupling for 3D Speech-Driven Facial Animation

Add code
May 29, 2025
Viaarxiv icon

SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training

Add code
May 28, 2025
Figure 1 for SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training
Figure 2 for SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training
Figure 3 for SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training
Figure 4 for SDPO: Importance-Sampled Direct Preference Optimization for Stable Diffusion Training
Viaarxiv icon

Beyond Chemical QA: Evaluating LLM's Chemical Reasoning with Modular Chemical Operations

Add code
May 27, 2025
Viaarxiv icon

Rethinking Text-based Protein Understanding: Retrieval or LLM?

Add code
May 26, 2025
Viaarxiv icon

The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants

Add code
May 26, 2025
Figure 1 for The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants
Figure 2 for The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants
Figure 3 for The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants
Figure 4 for The Avengers: A Simple Recipe for Uniting Smaller Language Models to Challenge Proprietary Giants
Viaarxiv icon

Self-Critique Guided Iterative Reasoning for Multi-hop Question Answering

Add code
May 25, 2025
Viaarxiv icon

Learning Adaptive and Temporally Causal Video Tokenization in a 1D Latent Space

Add code
May 22, 2025
Viaarxiv icon

When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning

Add code
May 21, 2025
Figure 1 for When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning
Figure 2 for When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning
Figure 3 for When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning
Figure 4 for When to Continue Thinking: Adaptive Thinking Mode Switching for Efficient Reasoning
Viaarxiv icon

CAD: A General Multimodal Framework for Video Deepfake Detection via Cross-Modal Alignment and Distillation

Add code
May 21, 2025
Viaarxiv icon