Picture for Heng Tao Shen

Heng Tao Shen

Unleashing the Potential of Neighbors: Diffusion-based Latent Neighbor Generation for Session-based Recommendation

Add code
Jan 07, 2026
Viaarxiv icon

Fast SAM2 with Text-Driven Token Pruning

Add code
Dec 24, 2025
Viaarxiv icon

MiVLA: Towards Generalizable Vision-Language-Action Model with Human-Robot Mutual Imitation Pre-training

Add code
Dec 19, 2025
Viaarxiv icon

A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis

Add code
Dec 16, 2025
Figure 1 for A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis
Figure 2 for A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis
Figure 3 for A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis
Figure 4 for A Semantically Enhanced Generative Foundation Model Improves Pathological Image Synthesis
Viaarxiv icon

GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation

Add code
Oct 02, 2025
Figure 1 for GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation
Figure 2 for GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation
Figure 3 for GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation
Figure 4 for GeoPurify: A Data-Efficient Geometric Distillation Framework for Open-Vocabulary 3D Segmentation
Viaarxiv icon

Explore Briefly, Then Decide: Mitigating LLM Overthinking via Cumulative Entropy Regulation

Add code
Oct 02, 2025
Figure 1 for Explore Briefly, Then Decide: Mitigating LLM Overthinking via Cumulative Entropy Regulation
Figure 2 for Explore Briefly, Then Decide: Mitigating LLM Overthinking via Cumulative Entropy Regulation
Figure 3 for Explore Briefly, Then Decide: Mitigating LLM Overthinking via Cumulative Entropy Regulation
Figure 4 for Explore Briefly, Then Decide: Mitigating LLM Overthinking via Cumulative Entropy Regulation
Viaarxiv icon

Unified modality separation: A vision-language framework for unsupervised domain adaptation

Add code
Aug 07, 2025
Figure 1 for Unified modality separation: A vision-language framework for unsupervised domain adaptation
Figure 2 for Unified modality separation: A vision-language framework for unsupervised domain adaptation
Figure 3 for Unified modality separation: A vision-language framework for unsupervised domain adaptation
Figure 4 for Unified modality separation: A vision-language framework for unsupervised domain adaptation
Viaarxiv icon

Implicit Counterfactual Learning for Audio-Visual Segmentation

Add code
Jul 28, 2025
Figure 1 for Implicit Counterfactual Learning for Audio-Visual Segmentation
Figure 2 for Implicit Counterfactual Learning for Audio-Visual Segmentation
Figure 3 for Implicit Counterfactual Learning for Audio-Visual Segmentation
Figure 4 for Implicit Counterfactual Learning for Audio-Visual Segmentation
Viaarxiv icon

Multimodal Mathematical Reasoning with Diverse Solving Perspective

Add code
Jul 03, 2025
Figure 1 for Multimodal Mathematical Reasoning with Diverse Solving Perspective
Figure 2 for Multimodal Mathematical Reasoning with Diverse Solving Perspective
Figure 3 for Multimodal Mathematical Reasoning with Diverse Solving Perspective
Figure 4 for Multimodal Mathematical Reasoning with Diverse Solving Perspective
Viaarxiv icon

SafePTR: Token-Level Jailbreak Defense in Multimodal LLMs via Prune-then-Restore Mechanism

Add code
Jul 02, 2025
Viaarxiv icon