Picture for Yifan Zhang

Yifan Zhang

University of Nottingham Ningbo China

Think Hierarchically, Act Dynamically: Hierarchical Multi-modal Fusion and Reasoning for Vision-and-Language Navigation

Add code
Apr 23, 2025
Viaarxiv icon

QuaDMix: Quality-Diversity Balanced Data Selection for Efficient LLM Pretraining

Add code
Apr 23, 2025
Viaarxiv icon

COBRA: Algorithm-Architecture Co-optimized Binary Transformer Accelerator for Edge Inference

Add code
Apr 22, 2025
Viaarxiv icon

TeLLMe: An Energy-Efficient Ternary LLM Accelerator for Prefilling and Decoding on Edge FPGAs

Add code
Apr 22, 2025
Viaarxiv icon

Guiding VLM Agents with Process Rewards at Inference Time for GUI Navigation

Add code
Apr 22, 2025
Viaarxiv icon

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

Add code
Apr 22, 2025
Viaarxiv icon

Integrating Artificial Intelligence with Human Expertise: An In-depth Analysis of ChatGPT's Capabilities in Generating Metamorphic Relations

Add code
Mar 28, 2025
Viaarxiv icon

Network-wide Freeway Traffic Estimation Using Sparse Sensor Data: A Dirichlet Graph Auto-Encoder Approach

Add code
Mar 20, 2025
Viaarxiv icon

Enhancing Code LLM Training with Programmer Attention

Add code
Mar 19, 2025
Viaarxiv icon

DriveGEN: Generalized and Robust 3D Detection in Driving via Controllable Text-to-Image Diffusion Generation

Add code
Mar 14, 2025
Viaarxiv icon