Picture for Yuhao Zhang

Yuhao Zhang

Department of Computer Science University of Manchester UK

GeoFM: Enhancing Geometric Reasoning of MLLMs via Synthetic Data Generation through Formal Language

Add code
Oct 31, 2025
Viaarxiv icon

UNILocPro: Unified Localization Integrating Model-Based Geometry and Channel Charting

Add code
Oct 31, 2025
Viaarxiv icon

EchoMind: An Interrelated Multi-level Benchmark for Evaluating Empathetic Speech Language Models

Add code
Oct 26, 2025
Viaarxiv icon

Lost in the Maze: Overcoming Context Limitations in Long-Horizon Agentic Search

Add code
Oct 21, 2025
Viaarxiv icon

EchoX: Towards Mitigating Acoustic-Semantic Gap via Echo Training for Speech-to-Speech LLMs

Add code
Sep 11, 2025
Viaarxiv icon

SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning

Add code
Sep 11, 2025
Viaarxiv icon

SPFT-SQL: Enhancing Large Language Model for Text-to-SQL Parsing by Self-Play Fine-Tuning

Add code
Sep 04, 2025
Viaarxiv icon

Adaptive Deep Reasoning: Triggering Deep Thinking When Needed

Add code
May 26, 2025
Viaarxiv icon

Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation

Add code
May 21, 2025
Figure 1 for Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation
Figure 2 for Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation
Figure 3 for Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation
Figure 4 for Leveraging Unit Language Guidance to Advance Speech Modeling in Textless Speech-to-Speech Translation
Viaarxiv icon

Anymate: A Dataset and Baselines for Learning 3D Object Rigging

Add code
May 09, 2025
Viaarxiv icon