Picture for Shengeng Tang

Shengeng Tang

OmniVL-Guard: Towards Unified Vision-Language Forgery Detection and Grounding via Balanced RL

Add code
Feb 12, 2026
Viaarxiv icon

Accelerating Controllable Generation via Hybrid-grained Cache

Add code
Nov 14, 2025
Viaarxiv icon

Open-World 3D Scene Graph Generation for Retrieval-Augmented Reasoning

Add code
Nov 08, 2025
Figure 1 for Open-World 3D Scene Graph Generation for Retrieval-Augmented Reasoning
Figure 2 for Open-World 3D Scene Graph Generation for Retrieval-Augmented Reasoning
Figure 3 for Open-World 3D Scene Graph Generation for Retrieval-Augmented Reasoning
Figure 4 for Open-World 3D Scene Graph Generation for Retrieval-Augmented Reasoning
Viaarxiv icon

SLRTP2025 Sign Language Production Challenge: Methodology, Results, and Future Work

Add code
Aug 09, 2025
Viaarxiv icon

SplitGaussian: Reconstructing Dynamic Scenes via Visual Geometry Decomposition

Add code
Aug 06, 2025
Figure 1 for SplitGaussian: Reconstructing Dynamic Scenes via Visual Geometry Decomposition
Figure 2 for SplitGaussian: Reconstructing Dynamic Scenes via Visual Geometry Decomposition
Figure 3 for SplitGaussian: Reconstructing Dynamic Scenes via Visual Geometry Decomposition
Figure 4 for SplitGaussian: Reconstructing Dynamic Scenes via Visual Geometry Decomposition
Viaarxiv icon

Motion is the Choreographer: Learning Latent Pose Dynamics for Seamless Sign Language Generation

Add code
Aug 06, 2025
Viaarxiv icon

StgcDiff: Spatial-Temporal Graph Condition Diffusion for Sign Language Transition Generation

Add code
Jun 16, 2025
Viaarxiv icon

Towards Fine-Grained Emotion Understanding via Skeleton-Based Micro-Gesture Recognition

Add code
Jun 15, 2025
Viaarxiv icon

SignAligner: Harmonizing Complementary Pose Modalities for Coherent Sign Language Generation

Add code
Jun 13, 2025
Viaarxiv icon

Wi-CBR: WiFi-based Cross-domain Behavior Recognition via Multimodal Collaborative Awareness

Add code
Jun 13, 2025
Viaarxiv icon