Picture for Yang Zhao

Yang Zhao

Frank

LLMs for High-Frequency Decision-Making: Normalized Action Reward-Guided Consistency Policy Optimization

Add code
Mar 03, 2026
Viaarxiv icon

SAGE-LLM: Towards Safe and Generalizable LLM Controller with Fuzzy-CBF Verification and Graph-Structured Knowledge Retrieval for UAV Decision

Add code
Feb 27, 2026
Viaarxiv icon

SceneReVis: A Self-Reflective Vision-Grounded Framework for 3D Indoor Scene Synthesis via Multi-turn RL

Add code
Feb 10, 2026
Viaarxiv icon

Semantic Search At LinkedIn

Add code
Feb 07, 2026
Viaarxiv icon

ShardMemo: Masked MoE Routing for Sharded Agentic LLM Memory

Add code
Jan 29, 2026
Viaarxiv icon

SKANet: A Cognitive Dual-Stream Framework with Adaptive Modality Fusion for Robust Compound GNSS Interference Classification

Add code
Jan 19, 2026
Viaarxiv icon

PhyG-MoE: A Physics-Guided Mixture-of-Experts Framework for Energy-Efficient GNSS Interference Recognition

Add code
Jan 19, 2026
Viaarxiv icon

Consolidation or Adaptation? PRISM: Disentangling SFT and RL Data via Gradient Concentration

Add code
Jan 12, 2026
Viaarxiv icon

MAESTRO: Meta-learning Adaptive Estimation of Scalarization Trade-offs for Reward Optimization

Add code
Jan 12, 2026
Viaarxiv icon

HiSciBench: A Hierarchical Multi-disciplinary Benchmark for Scientific Intelligence from Reading to Discovery

Add code
Dec 28, 2025
Viaarxiv icon