Picture for Yutong Lu

Yutong Lu

AIM: Asymmetric Information Masking for Visual Question Answering Continual Learning

Add code
Apr 16, 2026
Viaarxiv icon

GTPBD-MM: A Global Terraced Parcel and Boundary Dataset with Multi-Modality

Add code
Apr 14, 2026
Viaarxiv icon

HM-Bench: A Comprehensive Benchmark for Multimodal Large Language Models in Hyperspectral Remote Sensing

Add code
Apr 10, 2026
Viaarxiv icon

Beyond Few-Step Inference: Accelerating Video Diffusion Transformer Model Serving with Inter-Request Caching Reuse

Add code
Apr 06, 2026
Viaarxiv icon

AGCD: Agent-Guided Cross-Modal Decoding for Weather Forecasting

Add code
Mar 16, 2026
Viaarxiv icon

AgroNVILA: Perception-Reasoning Decoupling for Multi-view Agricultural Multimodal Large Language Models

Add code
Mar 15, 2026
Viaarxiv icon

AscendKernelGen: A Systematic Study of LLM-Based Kernel Generation for Neural Processing Units

Add code
Jan 12, 2026
Viaarxiv icon

Equivariant Diffusion for Crystal Structure Prediction

Add code
Dec 08, 2025
Viaarxiv icon

PolyKAN: Efficient Fused GPU Operators for Polynomial Kolmogorov-Arnold Network Variants

Add code
Nov 18, 2025
Viaarxiv icon

MolGraph-xLSTM: A graph-based dual-level xLSTM framework with multi-head mixture-of-experts for enhanced molecular representation and interpretability

Add code
Jan 30, 2025
Viaarxiv icon