Picture for Zhiqiang Shen

Zhiqiang Shen

Attention Is All You Need for KV Cache in Diffusion LLMs

Add code
Oct 16, 2025
Viaarxiv icon

Prompting Test-Time Scaling Is A Strong LLM Reasoning Data Augmentation

Add code
Oct 10, 2025
Viaarxiv icon

SynMatch: Rethinking Consistency in Medical Image Segmentation with Sparse Annotations

Add code
Aug 10, 2025
Viaarxiv icon

ConStyX: Content Style Augmentation for Generalizable Medical Image Segmentation

Add code
Jun 12, 2025
Viaarxiv icon

Learning Efficient and Generalizable Graph Retriever for Knowledge-Graph Question Answering

Add code
Jun 11, 2025
Viaarxiv icon

Pruning Spurious Subgraphs for Graph Out-of-Distribtuion Generalization

Add code
Jun 06, 2025
Viaarxiv icon

VideoMolmo: Spatio-Temporal Grounding Meets Pointing

Add code
Jun 05, 2025
Viaarxiv icon

Time Blindness: Why Video-Language Models Can't See What Humans Can?

Add code
May 30, 2025
Viaarxiv icon

Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents

Add code
May 30, 2025
Figure 1 for Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents
Figure 2 for Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents
Figure 3 for Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents
Figure 4 for Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents
Viaarxiv icon

Uni4D: A Unified Self-Supervised Learning Framework for Point Cloud Videos

Add code
Apr 07, 2025
Figure 1 for Uni4D: A Unified Self-Supervised Learning Framework for Point Cloud Videos
Figure 2 for Uni4D: A Unified Self-Supervised Learning Framework for Point Cloud Videos
Figure 3 for Uni4D: A Unified Self-Supervised Learning Framework for Point Cloud Videos
Figure 4 for Uni4D: A Unified Self-Supervised Learning Framework for Point Cloud Videos
Viaarxiv icon