Picture for Shuai Wang

Shuai Wang

The Hong Kong University of Science and Technology

Behavior Tokens Speak Louder: Disentangled Explainable Recommendation with Behavior Vocabulary

Add code
Dec 17, 2025
Figure 1 for Behavior Tokens Speak Louder: Disentangled Explainable Recommendation with Behavior Vocabulary
Figure 2 for Behavior Tokens Speak Louder: Disentangled Explainable Recommendation with Behavior Vocabulary
Figure 3 for Behavior Tokens Speak Louder: Disentangled Explainable Recommendation with Behavior Vocabulary
Figure 4 for Behavior Tokens Speak Louder: Disentangled Explainable Recommendation with Behavior Vocabulary
Viaarxiv icon

Diffusion Language Model Inference with Monte Carlo Tree Search

Add code
Dec 13, 2025
Viaarxiv icon

Exploring Spatial-Temporal Representation via Star Graph for mmWave Radar-based Human Activity Recognition

Add code
Dec 12, 2025
Figure 1 for Exploring Spatial-Temporal Representation via Star Graph for mmWave Radar-based Human Activity Recognition
Figure 2 for Exploring Spatial-Temporal Representation via Star Graph for mmWave Radar-based Human Activity Recognition
Figure 3 for Exploring Spatial-Temporal Representation via Star Graph for mmWave Radar-based Human Activity Recognition
Figure 4 for Exploring Spatial-Temporal Representation via Star Graph for mmWave Radar-based Human Activity Recognition
Viaarxiv icon

Taxonomy, Evaluation and Exploitation of IPI-Centric LLM Agent Defense Frameworks

Add code
Nov 19, 2025
Viaarxiv icon

Learning Representation and Synergy Invariances: A Povable Framework for Generalized Multimodal Face Anti-Spoofing

Add code
Nov 18, 2025
Figure 1 for Learning Representation and Synergy Invariances: A Povable Framework for Generalized Multimodal Face Anti-Spoofing
Figure 2 for Learning Representation and Synergy Invariances: A Povable Framework for Generalized Multimodal Face Anti-Spoofing
Figure 3 for Learning Representation and Synergy Invariances: A Povable Framework for Generalized Multimodal Face Anti-Spoofing
Figure 4 for Learning Representation and Synergy Invariances: A Povable Framework for Generalized Multimodal Face Anti-Spoofing
Viaarxiv icon

BSO: Binary Spiking Online Optimization Algorithm

Add code
Nov 16, 2025
Viaarxiv icon

Scaling Law Analysis in Federated Learning: How to Select the Optimal Model Size?

Add code
Nov 15, 2025
Viaarxiv icon

Time-Layer Adaptive Alignment for Speaker Similarity in Flow-Matching Based Zero-Shot TTS

Add code
Nov 13, 2025
Figure 1 for Time-Layer Adaptive Alignment for Speaker Similarity in Flow-Matching Based Zero-Shot TTS
Figure 2 for Time-Layer Adaptive Alignment for Speaker Similarity in Flow-Matching Based Zero-Shot TTS
Figure 3 for Time-Layer Adaptive Alignment for Speaker Similarity in Flow-Matching Based Zero-Shot TTS
Figure 4 for Time-Layer Adaptive Alignment for Speaker Similarity in Flow-Matching Based Zero-Shot TTS
Viaarxiv icon

ELEGANCE: Efficient LLM Guidance for Audio-Visual Target Speech Extraction

Add code
Nov 09, 2025
Viaarxiv icon

Planning Oriented Integrated Sensing and Communication

Add code
Oct 27, 2025
Viaarxiv icon