Picture for Zhenyu Yang

Zhenyu Yang

C-MAG: Cascade Multimodal Attributed Graphs for Supply Chain Link Prediction

Add code
Aug 13, 2025
Viaarxiv icon

Efficient Agent: Optimizing Planning Capability for Multimodal Retrieval Augmented Generation

Add code
Aug 12, 2025
Viaarxiv icon

Embedding Radiomics into Vision Transformers for Multimodal Medical Image Classification

Add code
Apr 15, 2025
Viaarxiv icon

Improved Visual-Spatial Reasoning via R1-Zero-Like Training

Add code
Apr 01, 2025
Viaarxiv icon

H2VU-Benchmark: A Comprehensive Benchmark for Hierarchical Holistic Video Understanding

Add code
Mar 31, 2025
Viaarxiv icon

An Explainable Neural Radiomic Sequence Model with Spatiotemporal Continuity for Quantifying 4DCT-based Pulmonary Ventilation

Add code
Mar 31, 2025
Viaarxiv icon

LandMarkSystem Technical Report

Add code
Mar 27, 2025
Viaarxiv icon

Layton: Latent Consistency Tokenizer for 1024-pixel Image Reconstruction and Generation by 256 Tokens

Add code
Mar 12, 2025
Viaarxiv icon

X2I: Seamless Integration of Multimodal Understanding into Diffusion Transformer via Attention Distillation

Add code
Mar 08, 2025
Viaarxiv icon

Large-Scale AI in Telecom: Charting the Roadmap for Innovation, Scalability, and Enhanced Digital Experiences

Add code
Mar 06, 2025
Viaarxiv icon