Picture for Hai Ci

Hai Ci

LumosX: Relate Any Identities with Their Attributes for Personalized Video Generation

Add code
Mar 20, 2026
Viaarxiv icon

Less Data, Faster Convergence: Goal-Driven Data Optimization for Multimodal Instruction Tuning

Add code
Mar 12, 2026
Viaarxiv icon

World-VLA-Loop: Closed-Loop Learning of Video World Model and VLA Policy

Add code
Feb 06, 2026
Viaarxiv icon

H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos

Add code
Dec 10, 2025
Figure 1 for H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos
Figure 2 for H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos
Figure 3 for H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos
Figure 4 for H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos
Viaarxiv icon

OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization

Add code
Aug 29, 2025
Figure 1 for OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization
Figure 2 for OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization
Figure 3 for OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization
Figure 4 for OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization
Viaarxiv icon

macOSWorld: A Multilingual Interactive Benchmark for GUI Agents

Add code
Jun 05, 2025
Figure 1 for macOSWorld: A Multilingual Interactive Benchmark for GUI Agents
Figure 2 for macOSWorld: A Multilingual Interactive Benchmark for GUI Agents
Figure 3 for macOSWorld: A Multilingual Interactive Benchmark for GUI Agents
Figure 4 for macOSWorld: A Multilingual Interactive Benchmark for GUI Agents
Viaarxiv icon

Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization

Add code
Apr 21, 2025
Figure 1 for Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization
Figure 2 for Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization
Figure 3 for Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization
Figure 4 for Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization
Viaarxiv icon

Impossible Videos

Add code
Mar 18, 2025
Figure 1 for Impossible Videos
Figure 2 for Impossible Videos
Figure 3 for Impossible Videos
Figure 4 for Impossible Videos
Viaarxiv icon

In-Context Defense in Computer Agents: An Empirical Study

Add code
Mar 12, 2025
Viaarxiv icon

LongViTU: Instruction Tuning for Long-Form Video Understanding

Add code
Jan 09, 2025
Figure 1 for LongViTU: Instruction Tuning for Long-Form Video Understanding
Figure 2 for LongViTU: Instruction Tuning for Long-Form Video Understanding
Figure 3 for LongViTU: Instruction Tuning for Long-Form Video Understanding
Figure 4 for LongViTU: Instruction Tuning for Long-Form Video Understanding
Viaarxiv icon