Picture for Yuan Zhang

Yuan Zhang

The Ripple Effect: On Unforeseen Complications of Backdoor Attacks

Add code
May 16, 2025
Viaarxiv icon

Generative Models in Computational Pathology: A Comprehensive Survey on Methods, Applications, and Challenges

Add code
May 16, 2025
Viaarxiv icon

Generative Pre-trained Autoregressive Diffusion Transformer

Add code
May 15, 2025
Viaarxiv icon

STORYANCHORS: Generating Consistent Multi-Scene Story Frames for Long-Form Narratives

Add code
May 13, 2025
Viaarxiv icon

Evaluating Evaluation Metrics -- The Mirage of Hallucination Detection

Add code
Apr 25, 2025
Viaarxiv icon

Leave-One-Out Stable Conformal Prediction

Add code
Apr 16, 2025
Viaarxiv icon

Distilling Knowledge from Heterogeneous Architectures for Semantic Segmentation

Add code
Apr 10, 2025
Viaarxiv icon

TimeSearch: Hierarchical Video Search with Spotlight and Reflection for Human-like Long Video Understanding

Add code
Apr 02, 2025
Viaarxiv icon

MoLe-VLA: Dynamic Layer-skipping Vision Language Action Model via Mixture-of-Layers for Efficient Robot Manipulation

Add code
Mar 26, 2025
Viaarxiv icon

Universal Speech Token Learning via Low-Bitrate Neural Codec and Pretrained Representations

Add code
Mar 15, 2025
Figure 1 for Universal Speech Token Learning via Low-Bitrate Neural Codec and Pretrained Representations
Figure 2 for Universal Speech Token Learning via Low-Bitrate Neural Codec and Pretrained Representations
Figure 3 for Universal Speech Token Learning via Low-Bitrate Neural Codec and Pretrained Representations
Figure 4 for Universal Speech Token Learning via Low-Bitrate Neural Codec and Pretrained Representations
Viaarxiv icon