Picture for Hang Yu

Hang Yu

From One-to-One to Many-to-Many: Dynamic Cross-Layer Injection for Deep Vision-Language Fusion

Add code
Jan 15, 2026
Viaarxiv icon

C2LLM Technical Report: A New Frontier in Code Retrieval via Adaptive Cross-Attention Pooling

Add code
Dec 24, 2025
Viaarxiv icon

Point What You Mean: Visually Grounded Instruction Policy

Add code
Dec 22, 2025
Figure 1 for Point What You Mean: Visually Grounded Instruction Policy
Figure 2 for Point What You Mean: Visually Grounded Instruction Policy
Figure 3 for Point What You Mean: Visually Grounded Instruction Policy
Figure 4 for Point What You Mean: Visually Grounded Instruction Policy
Viaarxiv icon

SpaceDrive: Infusing Spatial Awareness into VLM-based Autonomous Driving

Add code
Dec 11, 2025
Viaarxiv icon

HYPE: Hybrid Planning with Ego Proposal-Conditioned Predictions

Add code
Oct 14, 2025
Viaarxiv icon

F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data

Add code
Oct 02, 2025
Figure 1 for F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data
Figure 2 for F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data
Figure 3 for F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data
Figure 4 for F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data
Viaarxiv icon

FinZero: Launching Multi-modal Financial Time Series Forecast with Large Reasoning Model

Add code
Sep 10, 2025
Viaarxiv icon

MMGraphRAG: Bridging Vision and Language with Interpretable Multimodal Knowledge Graphs

Add code
Jul 28, 2025
Viaarxiv icon

ABC: Adaptive BayesNet Structure Learning for Computational Scalable Multi-task Image Compression

Add code
Jun 18, 2025
Figure 1 for ABC: Adaptive BayesNet Structure Learning for Computational Scalable Multi-task Image Compression
Figure 2 for ABC: Adaptive BayesNet Structure Learning for Computational Scalable Multi-task Image Compression
Figure 3 for ABC: Adaptive BayesNet Structure Learning for Computational Scalable Multi-task Image Compression
Figure 4 for ABC: Adaptive BayesNet Structure Learning for Computational Scalable Multi-task Image Compression
Viaarxiv icon

CHARM: Considering Human Attributes for Reinforcement Modeling

Add code
Jun 16, 2025
Figure 1 for CHARM: Considering Human Attributes for Reinforcement Modeling
Figure 2 for CHARM: Considering Human Attributes for Reinforcement Modeling
Figure 3 for CHARM: Considering Human Attributes for Reinforcement Modeling
Figure 4 for CHARM: Considering Human Attributes for Reinforcement Modeling
Viaarxiv icon