Picture for Ya Zhang

Ya Zhang

CoPAD : Multi-source Trajectory Fusion and Cooperative Trajectory Prediction with Anchor-oriented Decoder in V2X Scenarios

Add code
Sep 19, 2025
Viaarxiv icon

Visual Programmability: A Guide for Code-as-Thought in Chart Understanding

Add code
Sep 11, 2025
Viaarxiv icon

Wide-In, Narrow-Out: Revokable Decoding for Efficient and Effective DLLMs

Add code
Jul 24, 2025
Viaarxiv icon

Differential-informed Sample Selection Accelerates Multimodal Contrastive Learning

Add code
Jul 17, 2025
Viaarxiv icon

ConText: Driving In-context Learning for Text Removal and Segmentation

Add code
Jun 04, 2025
Viaarxiv icon

SpatialScore: Towards Unified Evaluation for Multimodal Spatial Understanding

Add code
May 22, 2025
Viaarxiv icon

AutoMedEval: Harnessing Language Models for Automatic Medical Capability Evaluation

Add code
May 17, 2025
Viaarxiv icon

Multi-Agent System for Comprehensive Soccer Understanding

Add code
May 06, 2025
Viaarxiv icon

Multi-Scale Target-Aware Representation Learning for Fundus Image Enhancement

Add code
May 03, 2025
Viaarxiv icon

Combatting Dimensional Collapse in LLM Pre-Training Data via Diversified File Selection

Add code
Apr 29, 2025
Viaarxiv icon