Picture for Deng Cai

Deng Cai

Anchor Forcing: Anchor Memory and Tri-Region RoPE for Interactive Streaming Video Diffusion

Add code
Mar 12, 2026
Viaarxiv icon

GeoSeg: Training-Free Reasoning-Driven Segmentation in Remote Sensing Imagery

Add code
Mar 04, 2026
Viaarxiv icon

RePO: Bridging On-Policy Learning and Off-Policy Knowledge through Rephrasing Policy Optimization

Add code
Feb 11, 2026
Viaarxiv icon

SNR-Edit: Structure-Aware Noise Rectification for Inversion-Free Flow-Based Editing

Add code
Jan 27, 2026
Viaarxiv icon

ThinkRL-Edit: Thinking in Reinforcement Learning for Reasoning-Centric Image Editing

Add code
Jan 06, 2026
Viaarxiv icon

TokenSqueeze: Performance-Preserving Compression for Reasoning LLMs

Add code
Nov 17, 2025
Viaarxiv icon

The End of Manual Decoding: Towards Truly End-to-End Language Models

Add code
Oct 30, 2025
Figure 1 for The End of Manual Decoding: Towards Truly End-to-End Language Models
Figure 2 for The End of Manual Decoding: Towards Truly End-to-End Language Models
Figure 3 for The End of Manual Decoding: Towards Truly End-to-End Language Models
Figure 4 for The End of Manual Decoding: Towards Truly End-to-End Language Models
Viaarxiv icon

Enhancing Spatial Reasoning through Visual and Textual Thinking

Add code
Jul 28, 2025
Viaarxiv icon

SeqPE: Transformer with Sequential Position Encoding

Add code
Jun 16, 2025
Viaarxiv icon

GeoCAD: Local Geometry-Controllable CAD Generation

Add code
Jun 12, 2025
Figure 1 for GeoCAD: Local Geometry-Controllable CAD Generation
Figure 2 for GeoCAD: Local Geometry-Controllable CAD Generation
Figure 3 for GeoCAD: Local Geometry-Controllable CAD Generation
Figure 4 for GeoCAD: Local Geometry-Controllable CAD Generation
Viaarxiv icon