Picture for Yifei Zhang

Yifei Zhang

PRISM: Prosody-Integrated Multi-Agent Reasoning Framework for Empathetic Spoken Dialogue

Add code
Jun 11, 2026
Viaarxiv icon

Double Preconditioning (DoPr): Optimization for Test-Time Performance, not Validation Loss

Add code
Jun 04, 2026
Viaarxiv icon

SAILRec: Steering LLM Attention to Dual-Side Semantically Aligned Collaborative Embeddings for Recommendation

Add code
Jun 03, 2026
Viaarxiv icon

Beyond Text Following: Repairable Arbitration Reversals in Audio-Language Models

Add code
Jun 03, 2026
Viaarxiv icon

CR-Seg: Attention-Guided and CoT-Enhanced Coarse-to-Refined Reasoning Segmentation

Add code
Jun 03, 2026
Viaarxiv icon

GenPT: Beyond Self-Report for Reliable LLM Psychometrics via Generative Projective Testing

Add code
May 30, 2026
Viaarxiv icon

Masked Next-Scale Prediction for Self-supervised Scene Text Recognition

Add code
May 14, 2026
Viaarxiv icon

From Parameter Dynamics to Risk Scoring : Quantifying Sample-Level Safety Degradation in LLM Fine-tuning

Add code
May 06, 2026
Viaarxiv icon

From Concept to Capability: Evaluating 3D Gaussian Splatting for Synthetic Scene Editing in Autonomous Driving

Add code
May 03, 2026
Viaarxiv icon

Hierarchically Robust Zero-shot Vision-language Models

Add code
Apr 20, 2026
Viaarxiv icon