Picture for Shiyao Wang

Shiyao Wang

MISS: Multi-Modal Tree Indexing and Searching with Lifelong Sequential Behavior for Retrieval Recommendation

Add code
Aug 20, 2025
Viaarxiv icon

Kwai Keye-VL Technical Report

Add code
Jul 02, 2025
Viaarxiv icon

Kling-Foley: Multimodal Diffusion Transformer for High-Quality Video-to-Audio Generation

Add code
Jun 24, 2025
Viaarxiv icon

OneRec Technical Report

Add code
Jun 16, 2025
Viaarxiv icon

Chinese-LiPS: A Chinese audio-visual speech recognition dataset with Lip-reading and Presentation Slides

Add code
Apr 21, 2025
Viaarxiv icon

SeniorTalk: A Chinese Conversation Dataset with Rich Annotations for Super-Aged Seniors

Add code
Mar 20, 2025
Viaarxiv icon

CS-Dialogue: A 104-Hour Dataset of Spontaneous Mandarin-English Code-Switching Dialogues for Speech Recognition

Add code
Feb 26, 2025
Viaarxiv icon

OneRec: Unifying Retrieve and Rank with Generative Recommender and Iterative Preference Alignment

Add code
Feb 26, 2025
Viaarxiv icon

Is AI Robust Enough for Scientific Research?

Add code
Dec 19, 2024
Figure 1 for Is AI Robust Enough for Scientific Research?
Figure 2 for Is AI Robust Enough for Scientific Research?
Figure 3 for Is AI Robust Enough for Scientific Research?
Figure 4 for Is AI Robust Enough for Scientific Research?
Viaarxiv icon

QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou

Add code
Nov 18, 2024
Figure 1 for QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
Figure 2 for QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
Figure 3 for QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
Figure 4 for QARM: Quantitative Alignment Multi-Modal Recommendation at Kuaishou
Viaarxiv icon