Information


Visual Contextual Attack: Jailbreaking MLLMs with Image-Driven Context Injection

Add code
Jul 03, 2025
Viaarxiv icon

Point3R: Streaming 3D Reconstruction with Explicit Spatial Pointer Memory

Add code
Jul 03, 2025
Viaarxiv icon

RefTok: Reference-Based Tokenization for Video Generation

Add code
Jul 03, 2025
Viaarxiv icon

Bootstrapping Grounded Chain-of-Thought in Multimodal LLMs for Data-Efficient Model Adaptation

Add code
Jul 03, 2025
Viaarxiv icon

Requirements Elicitation Follow-Up Question Generation

Add code
Jul 03, 2025
Viaarxiv icon

MvHo-IB: Multi-View Higher-Order Information Bottleneck for Brain Disorder Diagnosis

Add code
Jul 03, 2025
Viaarxiv icon

LLM-Driven Treatment Effect Estimation Under Inference Time Text Confounding

Add code
Jul 03, 2025
Viaarxiv icon

Towards Perception-Informed Latent HRTF Representations

Add code
Jul 03, 2025
Viaarxiv icon

LangScene-X: Reconstruct Generalizable 3D Language-Embedded Scenes with TriMap Video Diffusion

Add code
Jul 03, 2025
Viaarxiv icon

Knowledge Protocol Engineering: A New Paradigm for AI in Domain-Specific Knowledge Work

Add code
Jul 03, 2025
Viaarxiv icon