Picture for Weinong Wang

Weinong Wang

Uni-OPD: Unifying On-Policy Distillation with a Dual-Perspective Recipe

Add code
May 05, 2026
Viaarxiv icon

CF-VLA: Efficient Coarse-to-Fine Action Generation for Vision-Language-Action Policies

Add code
Apr 28, 2026
Viaarxiv icon

Ego-InBetween: Generating Object State Transitions in Ego-Centric Videos

Add code
Apr 20, 2026
Viaarxiv icon

Enhancing MLLM Spatial Understanding via Active 3D Scene Exploration for Multi-Perspective Reasoning

Add code
Apr 08, 2026
Viaarxiv icon

VideoTIR: Accurate Understanding for Long Videos with Efficient Tool-Integrated Reasoning

Add code
Mar 26, 2026
Viaarxiv icon

MMTIT-Bench: A Multilingual and Multi-Scenario Benchmark with Cognition-Perception-Reasoning Guided Text-Image Machine Translation

Add code
Mar 25, 2026
Viaarxiv icon

Towards Real-World Document Parsing via Realistic Scene Synthesis and Document-Aware Training

Add code
Mar 25, 2026
Viaarxiv icon

LiteLong: Resource-Efficient Long-Context Data Synthesis for LLMs

Add code
Sep 19, 2025
Figure 1 for LiteLong: Resource-Efficient Long-Context Data Synthesis for LLMs
Figure 2 for LiteLong: Resource-Efficient Long-Context Data Synthesis for LLMs
Figure 3 for LiteLong: Resource-Efficient Long-Context Data Synthesis for LLMs
Figure 4 for LiteLong: Resource-Efficient Long-Context Data Synthesis for LLMs
Viaarxiv icon

Omni-DPO: A Dual-Perspective Paradigm for Dynamic Preference Learning of LLMs

Add code
Jun 11, 2025
Viaarxiv icon

Exploring Implicit Visual Misunderstandings in Multimodal Large Language Models through Attention Analysis

Add code
May 15, 2025
Viaarxiv icon