Picture for Peize Sun

Peize Sun

IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model

Add code
Jul 10, 2024
Viaarxiv icon

Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation

Add code
Jun 10, 2024
Figure 1 for Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Figure 2 for Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Figure 3 for Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Figure 4 for Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation
Viaarxiv icon

RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis

Add code
Feb 25, 2024
Figure 1 for RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
Figure 2 for RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
Figure 3 for RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
Figure 4 for RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis
Viaarxiv icon

Enhancing Your Trained DETRs with Box Refinement

Add code
Jul 21, 2023
Figure 1 for Enhancing Your Trained DETRs with Box Refinement
Figure 2 for Enhancing Your Trained DETRs with Box Refinement
Figure 3 for Enhancing Your Trained DETRs with Box Refinement
Figure 4 for Enhancing Your Trained DETRs with Box Refinement
Viaarxiv icon

Semantic-SAM: Segment and Recognize Anything at Any Granularity

Add code
Jul 10, 2023
Figure 1 for Semantic-SAM: Segment and Recognize Anything at Any Granularity
Figure 2 for Semantic-SAM: Segment and Recognize Anything at Any Granularity
Figure 3 for Semantic-SAM: Segment and Recognize Anything at Any Granularity
Figure 4 for Semantic-SAM: Segment and Recognize Anything at Any Granularity
Viaarxiv icon

GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest

Add code
Jul 07, 2023
Figure 1 for GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
Figure 2 for GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
Figure 3 for GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
Figure 4 for GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest
Viaarxiv icon

Going Denser with Open-Vocabulary Part Segmentation

Add code
May 18, 2023
Figure 1 for Going Denser with Open-Vocabulary Part Segmentation
Figure 2 for Going Denser with Open-Vocabulary Part Segmentation
Figure 3 for Going Denser with Open-Vocabulary Part Segmentation
Figure 4 for Going Denser with Open-Vocabulary Part Segmentation
Viaarxiv icon

ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box

Add code
Mar 27, 2023
Figure 1 for ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box
Figure 2 for ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box
Figure 3 for ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box
Figure 4 for ByteTrackV2: 2D and 3D Multi-Object Tracking by Associating Every Detection Box
Viaarxiv icon

Learning Object-Language Alignments for Open-Vocabulary Object Detection

Add code
Nov 27, 2022
Figure 1 for Learning Object-Language Alignments for Open-Vocabulary Object Detection
Figure 2 for Learning Object-Language Alignments for Open-Vocabulary Object Detection
Figure 3 for Learning Object-Language Alignments for Open-Vocabulary Object Detection
Figure 4 for Learning Object-Language Alignments for Open-Vocabulary Object Detection
Viaarxiv icon

DiffusionDet: Diffusion Model for Object Detection

Add code
Nov 17, 2022
Figure 1 for DiffusionDet: Diffusion Model for Object Detection
Figure 2 for DiffusionDet: Diffusion Model for Object Detection
Figure 3 for DiffusionDet: Diffusion Model for Object Detection
Figure 4 for DiffusionDet: Diffusion Model for Object Detection
Viaarxiv icon