Picture for Chong Luo

Chong Luo

ReasonGen-R1: CoT for Autoregressive Image generation models through SFT and RL

Add code
May 30, 2025
Viaarxiv icon

ViaRL: Adaptive Temporal Grounding via Visual Iterated Amplification Reinforcement Learning

Add code
May 21, 2025
Viaarxiv icon

Efficient RL Training for Reasoning Models via Length-Aware Optimization

Add code
May 18, 2025
Viaarxiv icon

JointDiT: Enhancing RGB-Depth Joint Modeling with Diffusion Transformers

Add code
May 01, 2025
Viaarxiv icon

Subject-driven Video Generation via Disentangled Identity and Motion

Add code
Apr 23, 2025
Viaarxiv icon

HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models

Add code
Mar 14, 2025
Viaarxiv icon

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Add code
Mar 03, 2025
Viaarxiv icon

Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning

Add code
Feb 20, 2025
Viaarxiv icon

FloVD: Optical Flow Meets Video Diffusion Model for Enhanced Camera-Controlled Video Synthesis

Add code
Feb 12, 2025
Viaarxiv icon

MageBench: Bridging Large Multimodal Models to Agents

Add code
Dec 05, 2024
Viaarxiv icon