Picture for Shuang Yang

Shuang Yang

DBMIF: a deep balanced multimodal iterative fusion framework for air- and bone-conduction speech enhancement

Add code
Mar 03, 2026
Viaarxiv icon

MaRI: Accelerating Ranking Model Inference via Structural Re-parameterization in Large Scale Recommendation System

Add code
Feb 26, 2026
Viaarxiv icon

From Agnostic to Specific: Latent Preference Diffusion for Multi-Behavior Sequential Recommendation

Add code
Feb 26, 2026
Viaarxiv icon

SARM: LLM-Augmented Semantic Anchor for End-to-End Live-Streaming Ranking

Add code
Feb 10, 2026
Viaarxiv icon

OneLive: Dynamically Unified Generative Framework for Live-Streaming Recommendation

Add code
Feb 09, 2026
Viaarxiv icon

QARM V2: Quantitative Alignment Multi-Modal Recommendation for Reasoning User Sequence Modeling

Add code
Feb 09, 2026
Viaarxiv icon

UniRec: Unified Multimodal Encoding for LLM-Based Recommendations

Add code
Jan 27, 2026
Viaarxiv icon

ArchPilot: A Proxy-Guided Multi-Agent Approach for Machine Learning Engineering

Add code
Nov 06, 2025
Viaarxiv icon

SAIL-Embedding Technical Report: Omni-modal Embedding Foundation Model

Add code
Oct 14, 2025
Figure 1 for SAIL-Embedding Technical Report: Omni-modal Embedding Foundation Model
Figure 2 for SAIL-Embedding Technical Report: Omni-modal Embedding Foundation Model
Figure 3 for SAIL-Embedding Technical Report: Omni-modal Embedding Foundation Model
Figure 4 for SAIL-Embedding Technical Report: Omni-modal Embedding Foundation Model
Viaarxiv icon

GLip: A Global-Local Integrated Progressive Framework for Robust Visual Speech Recognition

Add code
Sep 19, 2025
Viaarxiv icon