Picture for Sheng Wang

Sheng Wang

Adaptation of Agentic AI

Add code
Dec 22, 2025
Figure 1 for Adaptation of Agentic AI
Figure 2 for Adaptation of Agentic AI
Figure 3 for Adaptation of Agentic AI
Figure 4 for Adaptation of Agentic AI
Viaarxiv icon

Task Vector in TTS: Toward Emotionally Expressive Dialectal Speech Synthesis

Add code
Dec 21, 2025
Viaarxiv icon

FutureX: Enhance End-to-End Autonomous Driving via Latent Chain-of-Thought World Model

Add code
Dec 12, 2025
Figure 1 for FutureX: Enhance End-to-End Autonomous Driving via Latent Chain-of-Thought World Model
Figure 2 for FutureX: Enhance End-to-End Autonomous Driving via Latent Chain-of-Thought World Model
Figure 3 for FutureX: Enhance End-to-End Autonomous Driving via Latent Chain-of-Thought World Model
Figure 4 for FutureX: Enhance End-to-End Autonomous Driving via Latent Chain-of-Thought World Model
Viaarxiv icon

Carbon Price Forecasting with Structural Breaks: A Comparative Study of Deep Learning Models

Add code
Nov 07, 2025
Viaarxiv icon

Pathology-CoT: Learning Visual Chain-of-Thought Agent from Expert Whole Slide Image Diagnosis Behavior

Add code
Oct 06, 2025
Viaarxiv icon

FedAPM: Federated Learning via ADMM with Partial Model Personalization

Add code
Jun 05, 2025
Viaarxiv icon

HAMF: A Hybrid Attention-Mamba Framework for Joint Scene Context Understanding and Future Motion Representation Learning

Add code
May 21, 2025
Figure 1 for HAMF: A Hybrid Attention-Mamba Framework for Joint Scene Context Understanding and Future Motion Representation Learning
Figure 2 for HAMF: A Hybrid Attention-Mamba Framework for Joint Scene Context Understanding and Future Motion Representation Learning
Figure 3 for HAMF: A Hybrid Attention-Mamba Framework for Joint Scene Context Understanding and Future Motion Representation Learning
Figure 4 for HAMF: A Hybrid Attention-Mamba Framework for Joint Scene Context Understanding and Future Motion Representation Learning
Viaarxiv icon

ReactDiff: Latent Diffusion for Facial Reaction Generation

Add code
May 20, 2025
Viaarxiv icon

UniCAD: Efficient and Extendable Architecture for Multi-Task Computer-Aided Diagnosis System

Add code
May 15, 2025
Viaarxiv icon

Enhancing Speech-to-Speech Dialogue Modeling with End-to-End Retrieval-Augmented Generation

Add code
Apr 27, 2025
Figure 1 for Enhancing Speech-to-Speech Dialogue Modeling with End-to-End Retrieval-Augmented Generation
Figure 2 for Enhancing Speech-to-Speech Dialogue Modeling with End-to-End Retrieval-Augmented Generation
Figure 3 for Enhancing Speech-to-Speech Dialogue Modeling with End-to-End Retrieval-Augmented Generation
Viaarxiv icon