Picture for Yan Peng

Yan Peng

Towards Instance Segmentation with Polygon Detection Transformers

Add code
Mar 10, 2026
Viaarxiv icon

FVG-PT: Adaptive Foreground View-Guided Prompt Tuning for Vision-Language Models

Add code
Mar 09, 2026
Viaarxiv icon

DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models

Add code
Mar 17, 2025
Figure 1 for DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models
Figure 2 for DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models
Figure 3 for DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models
Figure 4 for DPC: Dual-Prompt Collaboration for Tuning Vision-Language Models
Viaarxiv icon

PointVLA: Injecting the 3D World into Vision-Language-Action Models

Add code
Mar 10, 2025
Figure 1 for PointVLA: Injecting the 3D World into Vision-Language-Action Models
Figure 2 for PointVLA: Injecting the 3D World into Vision-Language-Action Models
Figure 3 for PointVLA: Injecting the 3D World into Vision-Language-Action Models
Figure 4 for PointVLA: Injecting the 3D World into Vision-Language-Action Models
Viaarxiv icon

Unity RL Playground: A Versatile Reinforcement Learning Framework for Mobile Robots

Add code
Mar 07, 2025
Figure 1 for Unity RL Playground: A Versatile Reinforcement Learning Framework for Mobile Robots
Figure 2 for Unity RL Playground: A Versatile Reinforcement Learning Framework for Mobile Robots
Figure 3 for Unity RL Playground: A Versatile Reinforcement Learning Framework for Mobile Robots
Figure 4 for Unity RL Playground: A Versatile Reinforcement Learning Framework for Mobile Robots
Viaarxiv icon

VIKSER: Visual Knowledge-Driven Self-Reinforcing Reasoning Framework

Add code
Feb 02, 2025
Figure 1 for VIKSER: Visual Knowledge-Driven Self-Reinforcing Reasoning Framework
Figure 2 for VIKSER: Visual Knowledge-Driven Self-Reinforcing Reasoning Framework
Figure 3 for VIKSER: Visual Knowledge-Driven Self-Reinforcing Reasoning Framework
Figure 4 for VIKSER: Visual Knowledge-Driven Self-Reinforcing Reasoning Framework
Viaarxiv icon

PhiP-G: Physics-Guided Text-to-3D Compositional Scene Generation

Add code
Feb 02, 2025
Figure 1 for PhiP-G: Physics-Guided Text-to-3D Compositional Scene Generation
Figure 2 for PhiP-G: Physics-Guided Text-to-3D Compositional Scene Generation
Figure 3 for PhiP-G: Physics-Guided Text-to-3D Compositional Scene Generation
Figure 4 for PhiP-G: Physics-Guided Text-to-3D Compositional Scene Generation
Viaarxiv icon

Hybrid Physics-ML Modeling for Marine Vehicle Maneuvering Motions in the Presence of Environmental Disturbances

Add code
Nov 21, 2024
Figure 1 for Hybrid Physics-ML Modeling for Marine Vehicle Maneuvering Motions in the Presence of Environmental Disturbances
Figure 2 for Hybrid Physics-ML Modeling for Marine Vehicle Maneuvering Motions in the Presence of Environmental Disturbances
Figure 3 for Hybrid Physics-ML Modeling for Marine Vehicle Maneuvering Motions in the Presence of Environmental Disturbances
Figure 4 for Hybrid Physics-ML Modeling for Marine Vehicle Maneuvering Motions in the Presence of Environmental Disturbances
Viaarxiv icon

Reducing Hallucinations: Enhancing VQA for Flood Disaster Damage Assessment with Visual Contexts

Add code
Dec 21, 2023
Viaarxiv icon

Unleashing the Potential of Large Language Model: Zero-shot VQA for Flood Disaster Scenario

Add code
Dec 04, 2023
Viaarxiv icon