Picture for Yuren Cong

Yuren Cong

WorldAfford: Affordance Grounding based on Natural Language Instructions

May 21, 2024
Viaarxiv icon

Segment Any Object Model : Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation

Mar 16, 2024
Figure 1 for Segment Any Object Model : Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation
Figure 2 for Segment Any Object Model : Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation
Figure 3 for Segment Any Object Model : Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation
Figure 4 for Segment Any Object Model : Real-to-Simulation Fine-Tuning Strategy for Multi-Class Multi-Instance Segmentation
Viaarxiv icon

GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation

Dec 07, 2023
Figure 1 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 2 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 3 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Figure 4 for GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation
Viaarxiv icon

FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing

Add code
Oct 09, 2023
Figure 1 for FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Figure 2 for FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Figure 3 for FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Figure 4 for FLATTEN: optical FLow-guided ATTENtion for consistent text-to-video editing
Viaarxiv icon

Learning Similarity between Scene Graphs and Images with Transformers

Add code
Apr 02, 2023
Figure 1 for Learning Similarity between Scene Graphs and Images with Transformers
Figure 2 for Learning Similarity between Scene Graphs and Images with Transformers
Figure 3 for Learning Similarity between Scene Graphs and Images with Transformers
Figure 4 for Learning Similarity between Scene Graphs and Images with Transformers
Viaarxiv icon

Attribute-Centric Compositional Text-to-Image Generation

Add code
Jan 04, 2023
Figure 1 for Attribute-Centric Compositional Text-to-Image Generation
Figure 2 for Attribute-Centric Compositional Text-to-Image Generation
Figure 3 for Attribute-Centric Compositional Text-to-Image Generation
Figure 4 for Attribute-Centric Compositional Text-to-Image Generation
Viaarxiv icon

SSGVS: Semantic Scene Graph-to-Video Synthesis

Nov 17, 2022
Figure 1 for SSGVS: Semantic Scene Graph-to-Video Synthesis
Figure 2 for SSGVS: Semantic Scene Graph-to-Video Synthesis
Figure 3 for SSGVS: Semantic Scene Graph-to-Video Synthesis
Figure 4 for SSGVS: Semantic Scene Graph-to-Video Synthesis
Viaarxiv icon

RelTR: Relation Transformer for Scene Graph Generation

Add code
Jan 27, 2022
Figure 1 for RelTR: Relation Transformer for Scene Graph Generation
Figure 2 for RelTR: Relation Transformer for Scene Graph Generation
Figure 3 for RelTR: Relation Transformer for Scene Graph Generation
Figure 4 for RelTR: Relation Transformer for Scene Graph Generation
Viaarxiv icon

Spatial-Temporal Transformer for Dynamic Scene Graph Generation

Add code
Aug 08, 2021
Figure 1 for Spatial-Temporal Transformer for Dynamic Scene Graph Generation
Figure 2 for Spatial-Temporal Transformer for Dynamic Scene Graph Generation
Figure 3 for Spatial-Temporal Transformer for Dynamic Scene Graph Generation
Figure 4 for Spatial-Temporal Transformer for Dynamic Scene Graph Generation
Viaarxiv icon