Picture for Huan Wang

Huan Wang

Stephen

OmniAgent: Audio-Guided Active Perception Agent for Omnimodal Audio-Video Understanding

Add code
Dec 29, 2025
Viaarxiv icon

StreamingAssistant: Efficient Visual Token Pruning for Accelerating Online Video Understanding

Add code
Dec 14, 2025
Viaarxiv icon

OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models

Add code
Nov 18, 2025
Figure 1 for OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models
Figure 2 for OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models
Figure 3 for OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models
Figure 4 for OmniZip: Audio-Guided Dynamic Token Compression for Fast Omnimodal Large Language Models
Viaarxiv icon

LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering

Add code
Nov 17, 2025
Figure 1 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Figure 2 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Figure 3 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Figure 4 for LoCoBench-Agent: An Interactive Benchmark for LLM Agents in Long-Context Software Engineering
Viaarxiv icon

GeoGNN: Quantifying and Mitigating Semantic Drift in Text-Attributed Graphs

Add code
Nov 12, 2025
Viaarxiv icon

ToolLibGen: Scalable Automatic Tool Creation and Aggregation for LLM Reasoning

Add code
Oct 09, 2025
Figure 1 for ToolLibGen: Scalable Automatic Tool Creation and Aggregation for LLM Reasoning
Figure 2 for ToolLibGen: Scalable Automatic Tool Creation and Aggregation for LLM Reasoning
Figure 3 for ToolLibGen: Scalable Automatic Tool Creation and Aggregation for LLM Reasoning
Figure 4 for ToolLibGen: Scalable Automatic Tool Creation and Aggregation for LLM Reasoning
Viaarxiv icon

RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning

Add code
Oct 02, 2025
Figure 1 for RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning
Figure 2 for RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning
Figure 3 for RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning
Figure 4 for RewardMap: Tackling Sparse Rewards in Fine-grained Visual Reasoning via Multi-Stage Reinforcement Learning
Viaarxiv icon

LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering

Add code
Sep 11, 2025
Figure 1 for LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
Figure 2 for LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
Figure 3 for LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
Figure 4 for LoCoBench: A Benchmark for Long-Context Large Language Models in Complex Software Engineering
Viaarxiv icon

On-Device Diffusion Transformer Policy for Efficient Robot Manipulation

Add code
Aug 01, 2025
Viaarxiv icon

UserBench: An Interactive Gym Environment for User-Centric Agents

Add code
Jul 29, 2025
Figure 1 for UserBench: An Interactive Gym Environment for User-Centric Agents
Figure 2 for UserBench: An Interactive Gym Environment for User-Centric Agents
Figure 3 for UserBench: An Interactive Gym Environment for User-Centric Agents
Figure 4 for UserBench: An Interactive Gym Environment for User-Centric Agents
Viaarxiv icon