Picture for Kaiyu Tang

Kaiyu Tang

VideoTemp-o3: Harmonizing Temporal Grounding and Video Understanding in Agentic Thinking-with-Videos

Add code
Feb 08, 2026
Viaarxiv icon

Joint Reward Modeling: Internalizing Chain-of-Thought for Efficient Visual Reward Models

Add code
Feb 07, 2026
Viaarxiv icon

SpatialReward: Bridging the Perception Gap in Online RL for Image Editing via Explicit Spatial Reasoning

Add code
Feb 07, 2026
Viaarxiv icon

Kwai Keye-VL Technical Report

Add code
Jul 02, 2025
Viaarxiv icon

A Closer Look on Memorization in Tabular Diffusion Model: A Data-Centric Perspective

Add code
May 28, 2025
Viaarxiv icon

R1-Reward: Training Multimodal Reward Model Through Stable Reinforcement Learning

Add code
May 05, 2025
Viaarxiv icon

VLM as Policy: Common-Law Content Moderation Framework for Short Video Platform

Add code
Apr 21, 2025
Viaarxiv icon

Kwai-STaR: Transform LLMs into State-Transition Reasoners

Add code
Nov 07, 2024
Figure 1 for Kwai-STaR: Transform LLMs into State-Transition Reasoners
Figure 2 for Kwai-STaR: Transform LLMs into State-Transition Reasoners
Figure 3 for Kwai-STaR: Transform LLMs into State-Transition Reasoners
Figure 4 for Kwai-STaR: Transform LLMs into State-Transition Reasoners
Viaarxiv icon