Picture for Yanfeng Wang

Yanfeng Wang

Cooperative Medianet Innovation Center, Shanghai Jiao Tong University, China and Shanghai AI Laboratory, China

VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression

Add code
Dec 16, 2024
Figure 1 for VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression
Figure 2 for VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression
Figure 3 for VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression
Figure 4 for VRVVC: Variable-Rate NeRF-Based Volumetric Video Compression
Viaarxiv icon

Drawing the Line: Enhancing Trustworthiness of MLLMs Through the Power of Refusal

Add code
Dec 15, 2024
Viaarxiv icon

Can Modern LLMs Act as Agent Cores in Radiology~Environments?

Add code
Dec 12, 2024
Viaarxiv icon

MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities

Add code
Dec 04, 2024
Figure 1 for MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities
Figure 2 for MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities
Figure 3 for MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities
Figure 4 for MRGen: Diffusion-based Controllable Data Engine for MRI Segmentation towards Unannotated Modalities
Viaarxiv icon

LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant

Add code
Dec 02, 2024
Figure 1 for LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Figure 2 for LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Figure 3 for LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Figure 4 for LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant
Viaarxiv icon

Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training

Add code
Nov 30, 2024
Figure 1 for Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training
Figure 2 for Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training
Figure 3 for Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training
Figure 4 for Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training
Viaarxiv icon

MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking

Add code
Nov 24, 2024
Figure 1 for MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking
Figure 2 for MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking
Figure 3 for MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking
Figure 4 for MambaTrack: Exploiting Dual-Enhancement for Night UAV Tracking
Viaarxiv icon

AuscultaBase: A Foundational Step Towards AI-Powered Body Sound Diagnostics

Add code
Nov 12, 2024
Figure 1 for AuscultaBase: A Foundational Step Towards AI-Powered Body Sound Diagnostics
Figure 2 for AuscultaBase: A Foundational Step Towards AI-Powered Body Sound Diagnostics
Figure 3 for AuscultaBase: A Foundational Step Towards AI-Powered Body Sound Diagnostics
Figure 4 for AuscultaBase: A Foundational Step Towards AI-Powered Body Sound Diagnostics
Viaarxiv icon

Task-Aware Harmony Multi-Task Decision Transformer for Offline Reinforcement Learning

Add code
Nov 02, 2024
Viaarxiv icon

ReflecTool: Towards Reflection-Aware Tool-Augmented Clinical Agents

Add code
Oct 23, 2024
Viaarxiv icon