Picture for Tianheng Cheng

Tianheng Cheng

Cross-Layer Attentive Feature Upsampling for Low-latency Semantic Segmentation

Add code
Jan 03, 2026
Viaarxiv icon

Seedance 1.5 pro: A Native Audio-Visual Joint Generation Foundation Model

Add code
Dec 23, 2025
Viaarxiv icon

LENS: Learning to Segment Anything with Unified Reinforced Reasoning

Add code
Aug 19, 2025
Viaarxiv icon

Seed1.5-VL Technical Report

Add code
May 11, 2025
Figure 1 for Seed1.5-VL Technical Report
Figure 2 for Seed1.5-VL Technical Report
Figure 3 for Seed1.5-VL Technical Report
Figure 4 for Seed1.5-VL Technical Report
Viaarxiv icon

GroundingSuite: Measuring Complex Multi-Granular Pixel Grounding

Add code
Mar 13, 2025
Viaarxiv icon

Multimodal Mamba: Decoder-only Multimodal State Space Model via Quadratic to Linear Distillation

Add code
Feb 18, 2025
Viaarxiv icon

GaussTR: Foundation Model-Aligned Gaussian Transformer for Self-Supervised 3D Spatial Understanding

Add code
Dec 17, 2024
Viaarxiv icon

Mask-Adapter: The Devil is in the Masks for Open-Vocabulary Segmentation

Add code
Dec 05, 2024
Viaarxiv icon

ControlAR: Controllable Image Generation with Autoregressive Models

Add code
Oct 03, 2024
Figure 1 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 2 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 3 for ControlAR: Controllable Image Generation with Autoregressive Models
Figure 4 for ControlAR: Controllable Image Generation with Autoregressive Models
Viaarxiv icon

Occupancy as Set of Points

Add code
Jul 04, 2024
Figure 1 for Occupancy as Set of Points
Figure 2 for Occupancy as Set of Points
Figure 3 for Occupancy as Set of Points
Figure 4 for Occupancy as Set of Points
Viaarxiv icon