Picture for Long Zeng

Long Zeng

GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models

Add code
Oct 09, 2025
Figure 1 for GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models
Figure 2 for GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models
Figure 3 for GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models
Figure 4 for GTR-Bench: Evaluating Geo-Temporal Reasoning in Vision-Language Models
Viaarxiv icon

MUSE: MCTS-Driven Red Teaming Framework for Enhanced Multi-Turn Dialogue Safety in Large Language Models

Add code
Sep 18, 2025
Viaarxiv icon

Perception Before Reasoning: Two-Stage Reinforcement Learning for Visual Reasoning in Vision-Language Models

Add code
Sep 16, 2025
Viaarxiv icon

$S^3$LAM: Surfel Splatting SLAM for Geometrically Accurate Tracking and Mapping

Add code
Jul 28, 2025
Viaarxiv icon

Embodied Intelligent Industrial Robotics: Concepts and Techniques

Add code
May 15, 2025
Viaarxiv icon

Demonstrating DVS: Dynamic Virtual-Real Simulation Platform for Mobile Robotic Tasks

Add code
Apr 26, 2025
Viaarxiv icon

Hierarchical Vector Quantized Graph Autoencoder with Annealing-Based Code Selection

Add code
Apr 17, 2025
Figure 1 for Hierarchical Vector Quantized Graph Autoencoder with Annealing-Based Code Selection
Figure 2 for Hierarchical Vector Quantized Graph Autoencoder with Annealing-Based Code Selection
Figure 3 for Hierarchical Vector Quantized Graph Autoencoder with Annealing-Based Code Selection
Figure 4 for Hierarchical Vector Quantized Graph Autoencoder with Annealing-Based Code Selection
Viaarxiv icon

DSM: Building A Diverse Semantic Map for 3D Visual Grounding

Add code
Apr 11, 2025
Figure 1 for DSM: Building A Diverse Semantic Map for 3D Visual Grounding
Figure 2 for DSM: Building A Diverse Semantic Map for 3D Visual Grounding
Figure 3 for DSM: Building A Diverse Semantic Map for 3D Visual Grounding
Figure 4 for DSM: Building A Diverse Semantic Map for 3D Visual Grounding
Viaarxiv icon

SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning

Add code
Apr 01, 2025
Figure 1 for SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning
Figure 2 for SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning
Figure 3 for SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning
Figure 4 for SPF-Portrait: Towards Pure Portrait Customization with Semantic Pollution-Free Fine-tuning
Viaarxiv icon

HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment

Add code
Mar 31, 2025
Figure 1 for HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment
Figure 2 for HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment
Figure 3 for HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment
Figure 4 for HumanAesExpert: Advancing a Multi-Modality Foundation Model for Human Image Aesthetic Assessment
Viaarxiv icon