Picture for Tingting Gao

Tingting Gao

OneRec-V2 Technical Report

Add code
Aug 28, 2025
Viaarxiv icon

MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion

Add code
Aug 20, 2025
Viaarxiv icon

Kwai Keye-VL Technical Report

Add code
Jul 02, 2025
Viaarxiv icon

OneRec Technical Report

Add code
Jun 16, 2025
Viaarxiv icon

ContextQFormer: A New Context Modeling Method for Multi-Turn Multi-Modal Conversations

Add code
May 29, 2025
Viaarxiv icon

Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning

Add code
May 27, 2025
Viaarxiv icon

GODBench: A Benchmark for Multimodal Large Language Models in Video Comment Art

Add code
May 16, 2025
Viaarxiv icon

R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Add code
May 05, 2025
Viaarxiv icon

SeriesBench: A Benchmark for Narrative-Driven Drama Series Understanding

Add code
Apr 30, 2025
Viaarxiv icon

VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform

Add code
Apr 21, 2025
Viaarxiv icon