Picture for Cheng Yang

Cheng Yang

Silent Speech Interfaces in the Era of Large Language Models: A Comprehensive Taxonomy and Systematic Review

Add code
Mar 12, 2026
Viaarxiv icon

Graph Tokenization for Bridging Graphs and Transformers

Add code
Mar 11, 2026
Viaarxiv icon

From Narrow to Panoramic Vision: Attention-Guided Cold-Start Reshapes Multimodal Reasoning

Add code
Mar 04, 2026
Viaarxiv icon

ATA: Bridging Implicit Reasoning with Attention-Guided and Action-Guided Inference for Vision-Language Action Models

Add code
Mar 02, 2026
Viaarxiv icon

Fuz-RL: A Fuzzy-Guided Robust Framework for Safe Reinforcement Learning under Uncertainty

Add code
Feb 24, 2026
Viaarxiv icon

AutoWebWorld: Synthesizing Infinite Verifiable Web Environments via Finite State Machines

Add code
Feb 15, 2026
Viaarxiv icon

UReason: Benchmarking the Reasoning Paradox in Unified Multimodal Models

Add code
Feb 09, 2026
Viaarxiv icon

Multimodal Generative Retrieval Model with Staged Pretraining for Food Delivery on Meituan

Add code
Feb 06, 2026
Viaarxiv icon

Towards a Science of Collective AI: LLM-based Multi-Agent Systems Need a Transition from Blind Trial-and-Error to Rigorous Science

Add code
Feb 05, 2026
Viaarxiv icon

GLASS: A Generative Recommender for Long-sequence Modeling via SID-Tier and Semantic Search

Add code
Feb 05, 2026
Viaarxiv icon