Picture for Yan Yan

Yan Yan

Investigating the Design Space of Visual Grounding in Multimodal Large Language Model

Add code
Aug 11, 2025
Viaarxiv icon

Direct Prediction Set Minimization via Bilevel Conformal Classifier Training

Add code
Jun 07, 2025
Viaarxiv icon

DD-Ranking: Rethinking the Evaluation of Dataset Distillation

Add code
May 19, 2025
Viaarxiv icon

InfantAgent-Next: A Multimodal Generalist Agent for Automated Computer Interaction

Add code
May 16, 2025
Viaarxiv icon

Adaptive Fault-tolerant Control of Underwater Vehicles with Thruster Failures

Add code
Apr 22, 2025
Viaarxiv icon

3DResT: A Strong Baseline for Semi-Supervised 3D Referring Expression Segmentation

Add code
Apr 17, 2025
Viaarxiv icon

Enhancing Dance-to-Music Generation via Negative Conditioning Latent Diffusion Model

Add code
Mar 28, 2025
Viaarxiv icon

Conformal Prediction Sets for Deep Generative Models via Reduction to Conformal Regression

Add code
Mar 13, 2025
Viaarxiv icon

LabelCoRank: Revolutionizing Long Tail Multi-Label Classification with Co-Occurrence Reranking

Add code
Mar 11, 2025
Viaarxiv icon

X-Field: A Physically Grounded Representation for 3D X-ray Reconstruction

Add code
Mar 11, 2025
Viaarxiv icon