Picture for Yiming Sun

Yiming Sun

X-MethaneWet: A Cross-scale Global Wetland Methane Emission Benchmark Dataset for Advancing Science Discovery with AI

Add code
May 23, 2025
Viaarxiv icon

From Biometrics to Environmental Control: AI-Enhanced Digital Twins for Personalized Health Interventions in Healing Landscapes

Add code
May 04, 2025
Viaarxiv icon

Multi-Scale Graph Learning for Anti-Sparse Downscaling

Add code
May 03, 2025
Viaarxiv icon

Detect-and-Guide: Self-regulation of Diffusion Models for Safe Text-to-Image Generation via Guideline Token Optimization

Add code
Mar 19, 2025
Viaarxiv icon

Rethinking Multi-modal Object Detection from the Perspective of Mono-Modality Feature Learning

Add code
Mar 14, 2025
Viaarxiv icon

Scalable In-Context Learning on Tabular Data via Retrieval-Augmented Large Language Models

Add code
Feb 05, 2025
Figure 1 for Scalable In-Context Learning on Tabular Data via Retrieval-Augmented Large Language Models
Figure 2 for Scalable In-Context Learning on Tabular Data via Retrieval-Augmented Large Language Models
Figure 3 for Scalable In-Context Learning on Tabular Data via Retrieval-Augmented Large Language Models
Figure 4 for Scalable In-Context Learning on Tabular Data via Retrieval-Augmented Large Language Models
Viaarxiv icon

Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent

Add code
Nov 08, 2024
Figure 1 for Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent
Figure 2 for Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent
Figure 3 for Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent
Figure 4 for Integrating Object Detection Modality into Visual Language Model for Enhanced Autonomous Driving Agent
Viaarxiv icon

Dynamic Brightness Adaptation for Robust Multi-modal Image Fusion

Add code
Nov 07, 2024
Figure 1 for Dynamic Brightness Adaptation for Robust Multi-modal Image Fusion
Figure 2 for Dynamic Brightness Adaptation for Robust Multi-modal Image Fusion
Figure 3 for Dynamic Brightness Adaptation for Robust Multi-modal Image Fusion
Figure 4 for Dynamic Brightness Adaptation for Robust Multi-modal Image Fusion
Viaarxiv icon

ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model

Add code
Nov 04, 2024
Figure 1 for ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model
Figure 2 for ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model
Figure 3 for ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model
Figure 4 for ChatTracker: Enhancing Visual Tracking Performance via Chatting with Multimodal Large Language Model
Viaarxiv icon

Learning Multimodal Cues of Children's Uncertainty

Add code
Oct 17, 2024
Figure 1 for Learning Multimodal Cues of Children's Uncertainty
Figure 2 for Learning Multimodal Cues of Children's Uncertainty
Figure 3 for Learning Multimodal Cues of Children's Uncertainty
Figure 4 for Learning Multimodal Cues of Children's Uncertainty
Viaarxiv icon