Picture for Jian Yang

Jian Yang

additional authors not shown

Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking

Add code
Dec 20, 2024
Figure 1 for Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
Figure 2 for Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
Figure 3 for Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
Figure 4 for Exploiting Multimodal Spatial-temporal Patterns for Video Object Tracking
Viaarxiv icon

Qwen2.5 Technical Report

Add code
Dec 19, 2024
Figure 1 for Qwen2.5 Technical Report
Figure 2 for Qwen2.5 Technical Report
Figure 3 for Qwen2.5 Technical Report
Figure 4 for Qwen2.5 Technical Report
Viaarxiv icon

Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation

Add code
Dec 18, 2024
Figure 1 for Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation
Figure 2 for Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation
Figure 3 for Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation
Figure 4 for Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose Estimation
Viaarxiv icon

ExecRepoBench: Multi-level Executable Code Completion Evaluation

Add code
Dec 16, 2024
Figure 1 for ExecRepoBench: Multi-level Executable Code Completion Evaluation
Figure 2 for ExecRepoBench: Multi-level Executable Code Completion Evaluation
Figure 3 for ExecRepoBench: Multi-level Executable Code Completion Evaluation
Figure 4 for ExecRepoBench: Multi-level Executable Code Completion Evaluation
Viaarxiv icon

StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors

Add code
Dec 16, 2024
Figure 1 for StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors
Figure 2 for StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors
Figure 3 for StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors
Figure 4 for StrandHead: Text to Strand-Disentangled 3D Head Avatars Using Hair Geometric Priors
Viaarxiv icon

Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video

Add code
Dec 16, 2024
Figure 1 for Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video
Figure 2 for Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video
Figure 3 for Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video
Figure 4 for Depth-Centric Dehazing and Depth-Estimation from Real-World Hazy Driving Video
Viaarxiv icon

InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption

Add code
Dec 12, 2024
Figure 1 for InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption
Figure 2 for InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption
Figure 3 for InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption
Figure 4 for InstanceCap: Improving Text-to-Video Generation via Instance-aware Structured Caption
Viaarxiv icon

Agent-based Video Trimming

Add code
Dec 12, 2024
Figure 1 for Agent-based Video Trimming
Figure 2 for Agent-based Video Trimming
Figure 3 for Agent-based Video Trimming
Figure 4 for Agent-based Video Trimming
Viaarxiv icon

ATPrompt: Textual Prompt Learning with Embedded Attributes

Add code
Dec 12, 2024
Figure 1 for ATPrompt: Textual Prompt Learning with Embedded Attributes
Figure 2 for ATPrompt: Textual Prompt Learning with Embedded Attributes
Figure 3 for ATPrompt: Textual Prompt Learning with Embedded Attributes
Figure 4 for ATPrompt: Textual Prompt Learning with Embedded Attributes
Viaarxiv icon

EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing

Add code
Dec 12, 2024
Figure 1 for EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing
Figure 2 for EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing
Figure 3 for EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing
Figure 4 for EmoDubber: Towards High Quality and Emotion Controllable Movie Dubbing
Viaarxiv icon