Picture for Kai Tian

Kai Tian

MARS2 2025 Challenge on Multimodal Reasoning: Datasets, Methods, Results, Discussion, and Outlook

Add code
Sep 17, 2025
Viaarxiv icon

AdsQA: Towards Advertisement Video Understanding

Add code
Sep 10, 2025
Viaarxiv icon

A Survey of Reinforcement Learning for Large Reasoning Models

Add code
Sep 10, 2025
Viaarxiv icon

ReviewRL: Towards Automated Scientific Review with RL

Add code
Aug 14, 2025
Viaarxiv icon

Code-Vision: Evaluating Multimodal LLMs Logic Understanding and Code Generation Capabilities

Add code
Feb 17, 2025
Viaarxiv icon

Generative Regression Based Watch Time Prediction for Video Recommendation: Model and Performance

Add code
Dec 28, 2024
Figure 1 for Generative Regression Based Watch Time Prediction for Video Recommendation: Model and Performance
Figure 2 for Generative Regression Based Watch Time Prediction for Video Recommendation: Model and Performance
Figure 3 for Generative Regression Based Watch Time Prediction for Video Recommendation: Model and Performance
Figure 4 for Generative Regression Based Watch Time Prediction for Video Recommendation: Model and Performance
Viaarxiv icon

Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation

Add code
Jul 12, 2024
Figure 1 for Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
Figure 2 for Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
Figure 3 for Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
Figure 4 for Large Language Models as Biomedical Hypothesis Generators: A Comprehensive Evaluation
Viaarxiv icon

Intuitive Fine-Tuning: Towards Unifying SFT and RLHF into a Single Process

Add code
May 20, 2024
Figure 1 for Intuitive Fine-Tuning: Towards Unifying SFT and RLHF into a Single Process
Figure 2 for Intuitive Fine-Tuning: Towards Unifying SFT and RLHF into a Single Process
Figure 3 for Intuitive Fine-Tuning: Towards Unifying SFT and RLHF into a Single Process
Figure 4 for Intuitive Fine-Tuning: Towards Unifying SFT and RLHF into a Single Process
Viaarxiv icon

Large Language Models are Zero Shot Hypothesis Proposers

Add code
Nov 10, 2023
Figure 1 for Large Language Models are Zero Shot Hypothesis Proposers
Figure 2 for Large Language Models are Zero Shot Hypothesis Proposers
Figure 3 for Large Language Models are Zero Shot Hypothesis Proposers
Figure 4 for Large Language Models are Zero Shot Hypothesis Proposers
Viaarxiv icon

Social Interaction-Aware Dynamical Models and Decision Making for Autonomous Vehicles

Add code
Oct 31, 2023
Viaarxiv icon