Picture for Yi Wang

Yi Wang

NUS

TraveLLaMA: Facilitating Multi-modal Large Language Models to Understand Urban Scenes and Provide Travel Assistance

Add code
Apr 23, 2025
Viaarxiv icon

The Athenian Academy: A Seven-Layer Architecture Model for Multi-Agent Systems

Add code
Apr 18, 2025
Viaarxiv icon

InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models

Add code
Apr 15, 2025
Viaarxiv icon

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning

Add code
Apr 10, 2025
Viaarxiv icon

HRMedSeg: Unlocking High-resolution Medical Image segmentation via Memory-efficient Attention Modeling

Add code
Apr 08, 2025
Viaarxiv icon

Revealing the Intrinsic Ethical Vulnerability of Aligned Large Language Models

Add code
Apr 07, 2025
Viaarxiv icon

Benchmark of Segmentation Techniques for Pelvic Fracture in CT and X-ray: Summary of the PENGWIN 2024 Challenge

Add code
Apr 03, 2025
Viaarxiv icon

ANNEXE: Unified Analyzing, Answering, and Pixel Grounding for Egocentric Interaction

Add code
Apr 02, 2025
Viaarxiv icon

Make Your Training Flexible: Towards Deployment-Efficient Video Models

Add code
Mar 18, 2025
Viaarxiv icon

Towards a Unified Copernicus Foundation Model for Earth Vision

Add code
Mar 14, 2025
Viaarxiv icon