Picture for Shuai Wang

Shuai Wang

The Hong Kong University of Science and Technology

VLM-CAD: VLM-Optimized Collaborative Agent Design Workflow for Analog Circuit Sizing

Add code
Jan 12, 2026
Viaarxiv icon

The ICASSP 2026 Automatic Song Aesthetics Evaluation Challenge

Add code
Jan 12, 2026
Viaarxiv icon

NextFlow: Unified Sequential Modeling Activates Multimodal Understanding and Generation

Add code
Jan 05, 2026
Viaarxiv icon

Physically-Grounded Manifold Projection Model for Generalizable Metal Artifact Reduction in Dental CBCT

Add code
Jan 01, 2026
Viaarxiv icon

RepetitionCurse: Measuring and Understanding Router Imbalance in Mixture-of-Experts LLMs under DoS Stress

Add code
Dec 30, 2025
Viaarxiv icon

Physically-Grounded Manifold Projection with Foundation Priors for Metal Artifact Reduction in Dental CBCT

Add code
Dec 30, 2025
Viaarxiv icon

USE: A Unified Model for Universal Sound Separation and Extraction

Add code
Dec 24, 2025
Viaarxiv icon

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Add code
Dec 23, 2025
Viaarxiv icon

What Does the Speaker Embedding Encode?

Add code
Dec 20, 2025
Figure 1 for What Does the Speaker Embedding Encode?
Figure 2 for What Does the Speaker Embedding Encode?
Figure 3 for What Does the Speaker Embedding Encode?
Figure 4 for What Does the Speaker Embedding Encode?
Viaarxiv icon

Behavior Tokens Speak Louder: Disentangled Explainable Recommendation with Behavior Vocabulary

Add code
Dec 17, 2025
Figure 1 for Behavior Tokens Speak Louder: Disentangled Explainable Recommendation with Behavior Vocabulary
Figure 2 for Behavior Tokens Speak Louder: Disentangled Explainable Recommendation with Behavior Vocabulary
Figure 3 for Behavior Tokens Speak Louder: Disentangled Explainable Recommendation with Behavior Vocabulary
Figure 4 for Behavior Tokens Speak Louder: Disentangled Explainable Recommendation with Behavior Vocabulary
Viaarxiv icon