Picture for Xinyao Zhang

Xinyao Zhang

READ More than What You See: Reinforcement Learning for Accurate and Coherent Audio Description Generations

Add code
Jun 22, 2026
Viaarxiv icon

Machine Learning Modeling for Real-Time Melt Pool Monitoring in Laser Powder Bed Fusion Additive Manufacturing: A Hybrid Approach

Add code
Jun 22, 2026
Viaarxiv icon

Predictive Repair Management Using a Multi-Head Attention Transformer and Online Learning

Add code
Jun 19, 2026
Viaarxiv icon

Employing General-Purpose and Biomedical Large Language Models with Advanced Prompt Engineering for Pharmacoepidemiologic Study Design

Add code
Apr 20, 2026
Viaarxiv icon

Precise Robot Command Understanding Using Grammar-Constrained Large Language Models

Add code
Apr 05, 2026
Viaarxiv icon

Evaluating Large and Lightweight Vision Models for Irregular Component Segmentation in E-Waste Disassembly

Add code
Mar 28, 2026
Viaarxiv icon

SAMA: Factorized Semantic Anchoring and Motion Alignment for Instruction-Guided Video Editing

Add code
Mar 19, 2026
Viaarxiv icon

MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-Language Navigation

Add code
Feb 19, 2025
Figure 1 for MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-Language Navigation
Figure 2 for MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-Language Navigation
Figure 3 for MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-Language Navigation
Figure 4 for MapNav: A Novel Memory Representation via Annotated Semantic Maps for VLM-based Vision-and-Language Navigation
Viaarxiv icon

Multi-Floor Zero-Shot Object Navigation Policy

Add code
Sep 17, 2024
Figure 1 for Multi-Floor Zero-Shot Object Navigation Policy
Figure 2 for Multi-Floor Zero-Shot Object Navigation Policy
Figure 3 for Multi-Floor Zero-Shot Object Navigation Policy
Figure 4 for Multi-Floor Zero-Shot Object Navigation Policy
Viaarxiv icon

RoboGPT: an intelligent agent of making embodied long-term decisions for daily instruction tasks

Add code
Nov 27, 2023
Viaarxiv icon