Picture for Yunhong Wang

Yunhong Wang

PolySim: Bridging the Sim-to-Real Gap for Humanoid Control via Multi-Simulator Dynamics Randomization

Add code
Oct 02, 2025
Viaarxiv icon

RepoDebug: Repository-Level Multi-Task and Multi-Language Debugging Evaluation of Large Language Models

Add code
Sep 04, 2025
Viaarxiv icon

FIMA-Q: Post-Training Quantization for Vision Transformers by Fisher Information Matrix Approximation

Add code
Jun 13, 2025
Viaarxiv icon

ContextQFormer: A New Context Modeling Method for Multi-Turn Multi-Modal Conversations

Add code
May 29, 2025
Viaarxiv icon

TransBench: Breaking Barriers for Transferable Graphical User Interface Agents in Dynamic Digital Environments

Add code
May 23, 2025
Viaarxiv icon

ToolSpectrum : Towards Personalized Tool Utilization for Large Language Models

Add code
May 19, 2025
Viaarxiv icon

Towards Robust and Controllable Text-to-Motion via Masked Autoregressive Diffusion

Add code
May 16, 2025
Viaarxiv icon

DDAE++: Enhancing Diffusion Models Towards Unified Generative and Discriminative Learning

Add code
May 16, 2025
Viaarxiv icon

GODBench: A Benchmark for Multimodal Large Language Models in Video Comment Art

Add code
May 16, 2025
Viaarxiv icon

SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding

Add code
Apr 30, 2025
Figure 1 for SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding
Figure 2 for SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding
Figure 3 for SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding
Figure 4 for SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding
Viaarxiv icon