Picture for Feng Zheng

Feng Zheng

All Robots in One: A New Standard and Unified Dataset for Versatile, General-Purpose Embodied Agents

Add code
Aug 20, 2024
Figure 1 for All Robots in One: A New Standard and Unified Dataset for Versatile, General-Purpose Embodied Agents
Figure 2 for All Robots in One: A New Standard and Unified Dataset for Versatile, General-Purpose Embodied Agents
Figure 3 for All Robots in One: A New Standard and Unified Dataset for Versatile, General-Purpose Embodied Agents
Figure 4 for All Robots in One: A New Standard and Unified Dataset for Versatile, General-Purpose Embodied Agents
Viaarxiv icon

Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models

Add code
Jul 16, 2024
Figure 1 for Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
Figure 2 for Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
Figure 3 for Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
Figure 4 for Reflective Instruction Tuning: Mitigating Hallucinations in Large Vision-Language Models
Viaarxiv icon

Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion

Add code
Jul 15, 2024
Figure 1 for Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
Figure 2 for Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
Figure 3 for Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
Figure 4 for Mutual Learning for Acoustic Matching and Dereverberation via Visual Scene-driven Diffusion
Viaarxiv icon

PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

Add code
Jun 24, 2024
Figure 1 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 2 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 3 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Figure 4 for PVUW 2024 Challenge on Complex Video Understanding: Methods and Results
Viaarxiv icon

1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation

Add code
Jun 11, 2024
Figure 1 for 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
Figure 2 for 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
Figure 3 for 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
Figure 4 for 1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation
Viaarxiv icon

LLplace: The 3D Indoor Scene Layout Generation and Editing via Large Language Model

Add code
Jun 06, 2024
Viaarxiv icon

On the Noise Robustness of In-Context Learning for Text Generation

Add code
May 27, 2024
Viaarxiv icon

Two in One Go: Single-stage Emotion Recognition with Decoupled Subject-context Transformer

Add code
Apr 29, 2024
Viaarxiv icon

UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization

Add code
Apr 04, 2024
Figure 1 for UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization
Figure 2 for UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization
Figure 3 for UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization
Figure 4 for UniAV: Unified Audio-Visual Perception for Multi-Task Video Localization
Viaarxiv icon

Negative Label Guided OOD Detection with Pretrained Vision-Language Models

Add code
Mar 29, 2024
Viaarxiv icon