Picture for Yu Tian

Yu Tian

Rutgers University

MultiMedEdit: A Scenario-Aware Benchmark for Evaluating Knowledge Editing in Medical VQA

Add code
Aug 09, 2025
Viaarxiv icon

Integrating clinical reasoning into large language model-based diagnosis through etiology-aware attention steering

Add code
Aug 01, 2025
Viaarxiv icon

Modality Curation: Building Universal Embeddings for Advanced Multimodal Information Retrieval

Add code
May 26, 2025
Viaarxiv icon

TUNA: Comprehensive Fine-grained Temporal Understanding Evaluation on Dense Dynamic Videos

Add code
May 26, 2025
Viaarxiv icon

ViPlan: A Benchmark for Visual Planning with Symbolic Predicates and Vision-Language Models

Add code
May 19, 2025
Viaarxiv icon

From Questions to Clinical Recommendations: Large Language Models Driving Evidence-Based Clinical Decision Making

Add code
May 15, 2025
Viaarxiv icon

Seedream 3.0 Technical Report

Add code
Apr 16, 2025
Viaarxiv icon

Feature-Aware Malicious Output Detection and Mitigation

Add code
Apr 12, 2025
Figure 1 for Feature-Aware Malicious Output Detection and Mitigation
Figure 2 for Feature-Aware Malicious Output Detection and Mitigation
Figure 3 for Feature-Aware Malicious Output Detection and Mitigation
Figure 4 for Feature-Aware Malicious Output Detection and Mitigation
Viaarxiv icon

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Add code
Apr 02, 2025
Viaarxiv icon

FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics

Add code
Mar 31, 2025
Figure 1 for FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics
Figure 2 for FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics
Figure 3 for FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics
Figure 4 for FakeScope: Large Multimodal Expert Model for Transparent AI-Generated Image Forensics
Viaarxiv icon