Picture for Yujun Cai

Yujun Cai

Learning to Generate Cross-Task Unexploitable Examples

Add code
Dec 15, 2025
Figure 1 for Learning to Generate Cross-Task Unexploitable Examples
Figure 2 for Learning to Generate Cross-Task Unexploitable Examples
Figure 3 for Learning to Generate Cross-Task Unexploitable Examples
Figure 4 for Learning to Generate Cross-Task Unexploitable Examples
Viaarxiv icon

Spatial Blind Spot: Auditory Motion Perception Deficits in Audio LLMs

Add code
Nov 17, 2025
Viaarxiv icon

PAS: A Training-Free Stabilizer for Temporal Encoding in Video LLMs

Add code
Nov 14, 2025
Viaarxiv icon

A Survey of Vibe Coding with Large Language Models

Add code
Oct 14, 2025
Viaarxiv icon

Detecting and Mitigating Insertion Hallucination in Video-to-Audio Generation

Add code
Oct 09, 2025
Viaarxiv icon

ContextNav: Towards Agentic Multimodal In-Context Learning

Add code
Oct 06, 2025
Figure 1 for ContextNav: Towards Agentic Multimodal In-Context Learning
Figure 2 for ContextNav: Towards Agentic Multimodal In-Context Learning
Figure 3 for ContextNav: Towards Agentic Multimodal In-Context Learning
Figure 4 for ContextNav: Towards Agentic Multimodal In-Context Learning
Viaarxiv icon

Blockwise SFT for Diffusion Language Models: Reconciling Bidirectional Attention and Autoregressive Decoding

Add code
Aug 27, 2025
Viaarxiv icon

VistaWise: Building Cost-Effective Agent with Cross-Modal Knowledge Graph for Minecraft

Add code
Aug 26, 2025
Viaarxiv icon

MRFD: Multi-Region Fusion Decoding with Self-Consistency for Mitigating Hallucinations in LVLMs

Add code
Aug 14, 2025
Viaarxiv icon

$A^2R^2$: Advancing Img2LaTeX Conversion via Visual Reasoning with Attention-Guided Refinement

Add code
Jul 28, 2025
Viaarxiv icon