Picture for Feng Zheng

Feng Zheng

A0: An Affordance-Aware Hierarchical Model for General Robotic Manipulation

Add code
Apr 21, 2025
Viaarxiv icon

RefComp: A Reference-guided Unified Framework for Unpaired Point Cloud Completion

Add code
Apr 18, 2025
Viaarxiv icon

Few-Shot Referring Video Single- and Multi-Object Segmentation via Cross-Modal Affinity with Instance Sequence Matching

Add code
Apr 18, 2025
Viaarxiv icon

GIFT: Generated Indoor video frames for Texture-less point tracking

Add code
Mar 17, 2025
Viaarxiv icon

RoboReflect: Robotic Reflective Reasoning for Grasping Ambiguous-Condition Objects

Add code
Jan 16, 2025
Figure 1 for RoboReflect: Robotic Reflective Reasoning for Grasping Ambiguous-Condition Objects
Figure 2 for RoboReflect: Robotic Reflective Reasoning for Grasping Ambiguous-Condition Objects
Figure 3 for RoboReflect: Robotic Reflective Reasoning for Grasping Ambiguous-Condition Objects
Figure 4 for RoboReflect: Robotic Reflective Reasoning for Grasping Ambiguous-Condition Objects
Viaarxiv icon

Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection

Add code
Jan 02, 2025
Figure 1 for Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection
Figure 2 for Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection
Figure 3 for Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection
Figure 4 for Enhancing Uncertainty Modeling with Semantic Graph for Hallucination Detection
Viaarxiv icon

Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs

Add code
Jan 02, 2025
Figure 1 for Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs
Figure 2 for Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs
Figure 3 for Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs
Figure 4 for Image-based Multimodal Models as Intruders: Transferable Multimodal Attacks on Video-based MLLMs
Viaarxiv icon

SoftPatch+: Fully Unsupervised Anomaly Classification and Segmentation

Add code
Dec 30, 2024
Figure 1 for SoftPatch+: Fully Unsupervised Anomaly Classification and Segmentation
Figure 2 for SoftPatch+: Fully Unsupervised Anomaly Classification and Segmentation
Figure 3 for SoftPatch+: Fully Unsupervised Anomaly Classification and Segmentation
Figure 4 for SoftPatch+: Fully Unsupervised Anomaly Classification and Segmentation
Viaarxiv icon

A Self-guided Multimodal Approach to Enhancing Graph Representation Learning for Alzheimer's Diseases

Add code
Dec 09, 2024
Figure 1 for A Self-guided Multimodal Approach to Enhancing Graph Representation Learning for Alzheimer's Diseases
Figure 2 for A Self-guided Multimodal Approach to Enhancing Graph Representation Learning for Alzheimer's Diseases
Figure 3 for A Self-guided Multimodal Approach to Enhancing Graph Representation Learning for Alzheimer's Diseases
Figure 4 for A Self-guided Multimodal Approach to Enhancing Graph Representation Learning for Alzheimer's Diseases
Viaarxiv icon

InfiniteWorld: A Unified Scalable Simulation Framework for General Visual-Language Robot Interaction

Add code
Dec 08, 2024
Viaarxiv icon