Picture for Siyuan Wang

Siyuan Wang

Fudan University

HardcoreLogic: Challenging Large Reasoning Models with Long-tail Logic Puzzle Games

Add code
Oct 14, 2025
Viaarxiv icon

SMILE: SeMantic Ids Enhanced CoLd Item Representation for Click-through Rate Prediction in E-commerce SEarch

Add code
Oct 14, 2025
Viaarxiv icon

Marine Chlorophyll Prediction and Driver Analysis based on LSTM-RF Hybrid Models

Add code
Aug 07, 2025
Viaarxiv icon

Autoregressive Semantic Visual Reconstruction Helps VLMs Understand Better

Add code
Jun 10, 2025
Viaarxiv icon

OneSug: The Unified End-to-End Generative Framework for E-commerce Query Suggestion

Add code
Jun 07, 2025
Viaarxiv icon

MMGT: Motion Mask Guided Two-Stage Network for Co-Speech Gesture Video Generation

Add code
May 29, 2025
Viaarxiv icon

AutoJudger: An Agent-Driven Framework for Efficient Benchmarking of MLLMs

Add code
May 27, 2025
Viaarxiv icon

Not All Models Suit Expert Offloading: On Local Routing Consistency of Mixture-of-Expert Models

Add code
May 21, 2025
Viaarxiv icon

OViP: Online Vision-Language Preference Learning

Add code
May 21, 2025
Figure 1 for OViP: Online Vision-Language Preference Learning
Figure 2 for OViP: Online Vision-Language Preference Learning
Figure 3 for OViP: Online Vision-Language Preference Learning
Figure 4 for OViP: Online Vision-Language Preference Learning
Viaarxiv icon

VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate

Add code
Apr 16, 2025
Figure 1 for VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Figure 2 for VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Figure 3 for VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Figure 4 for VGDFR: Diffusion-based Video Generation with Dynamic Latent Frame Rate
Viaarxiv icon