Picture for Tingting Gao

Tingting Gao

Compression then Matching: An Efficient Pre-training Paradigm for Multimodal Embedding

Add code
Nov 11, 2025
Viaarxiv icon

LiveStar: Live Streaming Assistant for Real-World Online Video Understanding

Add code
Nov 07, 2025
Viaarxiv icon

OneRec-V2 Technical Report

Add code
Aug 28, 2025
Figure 1 for OneRec-V2 Technical Report
Figure 2 for OneRec-V2 Technical Report
Figure 3 for OneRec-V2 Technical Report
Figure 4 for OneRec-V2 Technical Report
Viaarxiv icon

MUSE: Multi-Subject Unified Synthesis via Explicit Layout Semantic Expansion

Add code
Aug 20, 2025
Viaarxiv icon

Kwai Keye-VL Technical Report

Add code
Jul 02, 2025
Viaarxiv icon

OneRec Technical Report

Add code
Jun 16, 2025
Figure 1 for OneRec Technical Report
Figure 2 for OneRec Technical Report
Figure 3 for OneRec Technical Report
Figure 4 for OneRec Technical Report
Viaarxiv icon

ContextQFormer: A New Context Modeling Method for Multi-Turn Multi-Modal Conversations

Add code
May 29, 2025
Viaarxiv icon

Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning

Add code
May 27, 2025
Figure 1 for Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning
Figure 2 for Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning
Figure 3 for Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning
Figure 4 for Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning
Viaarxiv icon

GODBench: A Benchmark for Multimodal Large Language Models in Video Comment Art

Add code
May 16, 2025
Viaarxiv icon

R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Add code
May 05, 2025
Viaarxiv icon