Picture for Yu Tian

Yu Tian

Rutgers University

TUNA: Comprehensive Fine-grained Temporal Understanding Evaluation on Dense Dynamic Videos

Add code
May 26, 2025
Viaarxiv icon

Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval

Add code
May 26, 2025
Viaarxiv icon

ViPlan: A Benchmark for Visual Planning with Symbolic Predicates and Vision-Language Models

Add code
May 19, 2025
Viaarxiv icon

From Questions to Clinical Recommendations: Large Language Models Driving Evidence-Based Clinical Decision Making

Add code
May 15, 2025
Viaarxiv icon

Seedream 3.0 Technical Report

Add code
Apr 16, 2025
Viaarxiv icon

Feature-Aware Malicious Output Detection and Mitigation

Add code
Apr 12, 2025
Viaarxiv icon

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Add code
Apr 02, 2025
Viaarxiv icon

FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics

Add code
Mar 31, 2025
Viaarxiv icon

Limits of KV Cache Compression for Tensor Attention based Autoregressive Transformers

Add code
Mar 14, 2025
Viaarxiv icon

Theoretical Guarantees for High Order Trajectory Refinement in Generative Flows

Add code
Mar 12, 2025
Viaarxiv icon