Picture for Xiaojun Chang

Xiaojun Chang

Generating Action-conditioned Prompts for Open-vocabulary Video Action Recognition

Add code
Dec 04, 2023
Viaarxiv icon

Disentangled Representation Learning with Transmitted Information Bottleneck

Add code
Nov 03, 2023
Figure 1 for Disentangled Representation Learning with Transmitted Information Bottleneck
Figure 2 for Disentangled Representation Learning with Transmitted Information Bottleneck
Figure 3 for Disentangled Representation Learning with Transmitted Information Bottleneck
Figure 4 for Disentangled Representation Learning with Transmitted Information Bottleneck
Viaarxiv icon

Mask Propagation for Efficient Video Semantic Segmentation

Add code
Oct 29, 2023
Figure 1 for Mask Propagation for Efficient Video Semantic Segmentation
Figure 2 for Mask Propagation for Efficient Video Semantic Segmentation
Figure 3 for Mask Propagation for Efficient Video Semantic Segmentation
Figure 4 for Mask Propagation for Efficient Video Semantic Segmentation
Viaarxiv icon

No Token Left Behind: Efficient Vision Transformer via Dynamic Token Idling

Add code
Oct 09, 2023
Figure 1 for No Token Left Behind: Efficient Vision Transformer via Dynamic Token Idling
Figure 2 for No Token Left Behind: Efficient Vision Transformer via Dynamic Token Idling
Figure 3 for No Token Left Behind: Efficient Vision Transformer via Dynamic Token Idling
Figure 4 for No Token Left Behind: Efficient Vision Transformer via Dynamic Token Idling
Viaarxiv icon

PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement

Add code
Sep 20, 2023
Figure 1 for PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
Figure 2 for PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
Figure 3 for PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
Figure 4 for PSDiff: Diffusion Model for Person Search with Iterative and Collaborative Refinement
Viaarxiv icon

ProAgent: Building Proactive Cooperative AI with Large Language Models

Add code
Aug 28, 2023
Figure 1 for ProAgent: Building Proactive Cooperative AI with Large Language Models
Figure 2 for ProAgent: Building Proactive Cooperative AI with Large Language Models
Figure 3 for ProAgent: Building Proactive Cooperative AI with Large Language Models
Viaarxiv icon

SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation

Add code
Aug 20, 2023
Figure 1 for SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation
Figure 2 for SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation
Figure 3 for SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation
Figure 4 for SSMG: Spatial-Semantic Map Guided Diffusion Model for Free-form Layout-to-Image Generation
Viaarxiv icon

FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration

Add code
Jul 31, 2023
Figure 1 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Figure 2 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Figure 3 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Figure 4 for FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration
Viaarxiv icon

Two-stream Multi-level Dynamic Point Transformer for Two-person Interaction Recognition

Add code
Jul 22, 2023
Viaarxiv icon

Maximum Entropy Heterogeneous-Agent Mirror Learning

Add code
Jun 19, 2023
Viaarxiv icon