Picture for Hai Ci

Hai Ci

H2R-Grounder: A Paired-Data-Free Paradigm for Translating Human Interaction Videos into Physically Grounded Robot Videos

Add code
Dec 10, 2025
Viaarxiv icon

OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization

Add code
Aug 29, 2025
Figure 1 for OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization
Figure 2 for OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization
Figure 3 for OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization
Figure 4 for OptMark: Robust Multi-bit Diffusion Watermarking via Inference Time Optimization
Viaarxiv icon

macOSWorld: A Multilingual Interactive Benchmark for GUI Agents

Add code
Jun 05, 2025
Figure 1 for macOSWorld: A Multilingual Interactive Benchmark for GUI Agents
Figure 2 for macOSWorld: A Multilingual Interactive Benchmark for GUI Agents
Figure 3 for macOSWorld: A Multilingual Interactive Benchmark for GUI Agents
Figure 4 for macOSWorld: A Multilingual Interactive Benchmark for GUI Agents
Viaarxiv icon

Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization

Add code
Apr 21, 2025
Figure 1 for Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization
Figure 2 for Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization
Figure 3 for Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization
Figure 4 for Cyc3D: Fine-grained Controllable 3D Generation via Cycle Consistency Regularization
Viaarxiv icon

Impossible Videos

Add code
Mar 18, 2025
Viaarxiv icon

In-Context Defense in Computer Agents: An Empirical Study

Add code
Mar 12, 2025
Viaarxiv icon

LongViTU: Instruction Tuning for Long-Form Video Understanding

Add code
Jan 09, 2025
Figure 1 for LongViTU: Instruction Tuning for Long-Form Video Understanding
Figure 2 for LongViTU: Instruction Tuning for Long-Form Video Understanding
Figure 3 for LongViTU: Instruction Tuning for Long-Form Video Understanding
Figure 4 for LongViTU: Instruction Tuning for Long-Form Video Understanding
Viaarxiv icon

UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI

Add code
Dec 30, 2024
Figure 1 for UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
Figure 2 for UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
Figure 3 for UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
Figure 4 for UnrealZoo: Enriching Photo-realistic Virtual Worlds for Embodied AI
Viaarxiv icon

IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation

Add code
Dec 16, 2024
Figure 1 for IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation
Figure 2 for IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation
Figure 3 for IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation
Figure 4 for IDProtector: An Adversarial Noise Encoder to Protect Against ID-Preserving Image Generation
Viaarxiv icon

Anti-Reference: Universal and Immediate Defense Against Reference-Based Generation

Add code
Dec 08, 2024
Viaarxiv icon