Picture for Shihao Wang

Shihao Wang

VideoITG: Multimodal Video Understanding with Instructed Temporal Grounding

Add code
Jul 17, 2025
Viaarxiv icon

Eagle 2.5: Boosting Long-Context Post-Training for Frontier Vision-Language Models

Add code
Apr 21, 2025
Viaarxiv icon

The Tenth NTIRE 2025 Efficient Super-Resolution Challenge Report

Add code
Apr 14, 2025
Viaarxiv icon

OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning

Add code
Apr 06, 2025
Figure 1 for OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning
Figure 2 for OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning
Figure 3 for OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning
Figure 4 for OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning
Viaarxiv icon

Slow-Fast Architecture for Video Multi-Modal Large Language Models

Add code
Apr 02, 2025
Viaarxiv icon

InsViE-1M: Effective Instruction-based Video Editing with Elaborate Dataset Construction

Add code
Mar 26, 2025
Viaarxiv icon

Hydra-NeXt: Robust Closed-Loop Driving with Open-Loop Training

Add code
Mar 15, 2025
Viaarxiv icon

L3TC: Leveraging RWKV for Learned Lossless Low-Complexity Text Compression

Add code
Dec 24, 2024
Figure 1 for L3TC: Leveraging RWKV for Learned Lossless Low-Complexity Text Compression
Figure 2 for L3TC: Leveraging RWKV for Learned Lossless Low-Complexity Text Compression
Figure 3 for L3TC: Leveraging RWKV for Learned Lossless Low-Complexity Text Compression
Figure 4 for L3TC: Leveraging RWKV for Learned Lossless Low-Complexity Text Compression
Viaarxiv icon

Transformer-based toxin-protein interaction analysis prioritizes airborne particulate matter components with potential adverse health effects

Add code
Dec 21, 2024
Viaarxiv icon

StreamChat: Chatting with Streaming Video

Add code
Dec 11, 2024
Figure 1 for StreamChat: Chatting with Streaming Video
Figure 2 for StreamChat: Chatting with Streaming Video
Figure 3 for StreamChat: Chatting with Streaming Video
Figure 4 for StreamChat: Chatting with Streaming Video
Viaarxiv icon