Picture for Xin Cai

Xin Cai

AnyRecon: Arbitrary-View 3D Reconstruction with Video Diffusion Model

Add code
Apr 21, 2026
Viaarxiv icon

DA-VAE: Plug-in Latent Compression for Diffusion via Detail Alignment

Add code
Mar 23, 2026
Viaarxiv icon

MindDriver: Introducing Progressive Multimodal Reasoning for Autonomous Driving

Add code
Feb 25, 2026
Viaarxiv icon

FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution

Add code
Oct 14, 2025
Figure 1 for FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution
Figure 2 for FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution
Figure 3 for FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution
Figure 4 for FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution
Viaarxiv icon

RubikSQL: Lifelong Learning Agentic Knowledge Base as an Industrial NL2SQL System

Add code
Aug 25, 2025
Figure 1 for RubikSQL: Lifelong Learning Agentic Knowledge Base as an Industrial NL2SQL System
Figure 2 for RubikSQL: Lifelong Learning Agentic Knowledge Base as an Industrial NL2SQL System
Figure 3 for RubikSQL: Lifelong Learning Agentic Knowledge Base as an Industrial NL2SQL System
Figure 4 for RubikSQL: Lifelong Learning Agentic Knowledge Base as an Industrial NL2SQL System
Viaarxiv icon

LoRA-Edit: Controllable First-Frame-Guided Video Editing via Mask-Aware LoRA Fine-Tuning

Add code
Jun 11, 2025
Viaarxiv icon

One Framework to Rule Them All: Unifying RL-Based and RL-Free Methods in RLHF

Add code
Mar 26, 2025
Figure 1 for One Framework to Rule Them All: Unifying RL-Based and RL-Free Methods in RLHF
Viaarxiv icon

UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion

Add code
Jan 20, 2025
Figure 1 for UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion
Figure 2 for UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion
Figure 3 for UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion
Figure 4 for UltraFusion: Ultra High Dynamic Imaging using Exposure Fusion
Viaarxiv icon

Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution

Add code
Jan 20, 2025
Figure 1 for Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution
Figure 2 for Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution
Figure 3 for Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution
Figure 4 for Teaching Large Language Models to Regress Accurate Image Quality Scores using Score Distribution
Viaarxiv icon

DetailGen3D: Generative 3D Geometry Enhancement via Data-Dependent Flow

Add code
Nov 25, 2024
Viaarxiv icon