Picture for Shaofeng Zhang

Shaofeng Zhang

Repulsor: Accelerating Generative Modeling with a Contrastive Memory Bank

Add code
Dec 13, 2025
Viaarxiv icon

Dual-Branch Center-Surrounding Contrast: Rethinking Contrastive Learning for 3D Point Clouds

Add code
Dec 09, 2025
Viaarxiv icon

Denoising Vision Transformer Autoencoder with Spectral Self-Regularization

Add code
Nov 16, 2025
Viaarxiv icon

VideoREPA: Learning Physics for Video Generation through Relational Alignment with Foundation Models

Add code
May 29, 2025
Viaarxiv icon

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances

Add code
Feb 07, 2025
Figure 1 for Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances
Figure 2 for Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances
Figure 3 for Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances
Figure 4 for Point2RBox-v2: Rethinking Point-supervised Oriented Object Detection with Spatial Layout Among Instances
Viaarxiv icon

Motion Control for Enhanced Complex Action Video Generation

Add code
Nov 13, 2024
Figure 1 for Motion Control for Enhanced Complex Action Video Generation
Figure 2 for Motion Control for Enhanced Complex Action Video Generation
Figure 3 for Motion Control for Enhanced Complex Action Video Generation
Figure 4 for Motion Control for Enhanced Complex Action Video Generation
Viaarxiv icon

Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation

Add code
Nov 04, 2024
Figure 1 for Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation
Figure 2 for Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation
Figure 3 for Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation
Figure 4 for Exploiting Unlabeled Data with Multiple Expert Teachers for Open Vocabulary Aerial Object Detection and Its Orientation Adaptation
Viaarxiv icon

PCP-MAE: Learning to Predict Centers for Point Masked Autoencoders

Add code
Aug 16, 2024
Viaarxiv icon

ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation

Add code
Jun 26, 2024
Figure 1 for ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
Figure 2 for ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
Figure 3 for ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
Figure 4 for ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
Viaarxiv icon