Picture for Lin Ma

Lin Ma

Learning Instruction-Guided Manipulation Affordance via Large Models for Embodied Robotic Tasks

Add code
Aug 20, 2024
Figure 1 for Learning Instruction-Guided Manipulation Affordance via Large Models for Embodied Robotic Tasks
Figure 2 for Learning Instruction-Guided Manipulation Affordance via Large Models for Embodied Robotic Tasks
Figure 3 for Learning Instruction-Guided Manipulation Affordance via Large Models for Embodied Robotic Tasks
Figure 4 for Learning Instruction-Guided Manipulation Affordance via Large Models for Embodied Robotic Tasks
Viaarxiv icon

3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance

Add code
Jul 13, 2024
Figure 1 for 3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance
Figure 2 for 3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance
Figure 3 for 3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance
Figure 4 for 3D Weakly Supervised Semantic Segmentation with 2D Vision-Language Guidance
Viaarxiv icon

Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization

Add code
Jul 11, 2024
Figure 1 for Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization
Figure 2 for Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization
Figure 3 for Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization
Figure 4 for Enhancing Robustness of Vision-Language Models through Orthogonality Learning and Cross-Regularization
Viaarxiv icon

OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion

Add code
Jul 10, 2024
Figure 1 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 2 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 3 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Figure 4 for OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion
Viaarxiv icon

Experimental Demonstration of 16D Voronoi Constellation with Two-Level Coding over 50km Four-Core Fiber

Add code
Jul 09, 2024
Viaarxiv icon

RoboCAS: A Benchmark for Robotic Manipulation in Complex Object Arrangement Scenarios

Add code
Jul 09, 2024
Figure 1 for RoboCAS: A Benchmark for Robotic Manipulation in Complex Object Arrangement Scenarios
Figure 2 for RoboCAS: A Benchmark for Robotic Manipulation in Complex Object Arrangement Scenarios
Figure 3 for RoboCAS: A Benchmark for Robotic Manipulation in Complex Object Arrangement Scenarios
Figure 4 for RoboCAS: A Benchmark for Robotic Manipulation in Complex Object Arrangement Scenarios
Viaarxiv icon

Corki: Enabling Real-time Embodied AI Robots via Algorithm-Architecture Co-Design

Add code
Jul 05, 2024
Figure 1 for Corki: Enabling Real-time Embodied AI Robots via Algorithm-Architecture Co-Design
Figure 2 for Corki: Enabling Real-time Embodied AI Robots via Algorithm-Architecture Co-Design
Figure 3 for Corki: Enabling Real-time Embodied AI Robots via Algorithm-Architecture Co-Design
Figure 4 for Corki: Enabling Real-time Embodied AI Robots via Algorithm-Architecture Co-Design
Viaarxiv icon

MindBench: A Comprehensive Benchmark for Mind Map Structure Recognition and Analysis

Add code
Jul 03, 2024
Viaarxiv icon

RoboUniView: Visual-Language Model with Unified View Representation for Robotic Manipulaiton

Add code
Jun 27, 2024
Viaarxiv icon

Splatter a Video: Video Gaussian Representation for Versatile Processing

Add code
Jun 19, 2024
Figure 1 for Splatter a Video: Video Gaussian Representation for Versatile Processing
Figure 2 for Splatter a Video: Video Gaussian Representation for Versatile Processing
Figure 3 for Splatter a Video: Video Gaussian Representation for Versatile Processing
Figure 4 for Splatter a Video: Video Gaussian Representation for Versatile Processing
Viaarxiv icon