Picture for Yong Zhao

Yong Zhao

Fred

Bootstrapping Imitation Learning for Long-horizon Manipulation via Hierarchical Data Collection Space

Add code
May 23, 2025
Viaarxiv icon

Towards Autonomous UAV Visual Object Search in City Space: Benchmark and Agentic Methodology

Add code
May 14, 2025
Viaarxiv icon

DLW-CI: A Dynamic Likelihood-Weighted Cooperative Infotaxis Approach for Multi-Source Search in Urban Environments Using Consumer Drone Networks

Add code
Apr 19, 2025
Viaarxiv icon

GeoNav: Empowering MLLMs with Explicit Geospatial Reasoning Abilities for Language-Goal Aerial Navigation

Add code
Apr 13, 2025
Viaarxiv icon

10K is Enough: An Ultra-Lightweight Binarized Network for Infrared Small-Target Detection

Add code
Mar 04, 2025
Viaarxiv icon

CityEQA: A Hierarchical LLM Agent on Embodied Question Answering Benchmark in City Space

Add code
Feb 20, 2025
Viaarxiv icon

Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation

Add code
Feb 04, 2025
Figure 1 for Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation
Figure 2 for Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation
Figure 3 for Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation
Figure 4 for Streaming Speaker Change Detection and Gender Classification for Transducer-Based Multi-Talker Speech Translation
Viaarxiv icon

RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection

Add code
Jan 16, 2025
Figure 1 for RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection
Figure 2 for RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection
Figure 3 for RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection
Figure 4 for RE-POSE: Synergizing Reinforcement Learning-Based Partitioning and Offloading for Edge Object Detection
Viaarxiv icon

Aligning Large Language Models for Faithful Integrity Against Opposing Argument

Add code
Jan 02, 2025
Figure 1 for Aligning Large Language Models for Faithful Integrity Against Opposing Argument
Figure 2 for Aligning Large Language Models for Faithful Integrity Against Opposing Argument
Figure 3 for Aligning Large Language Models for Faithful Integrity Against Opposing Argument
Figure 4 for Aligning Large Language Models for Faithful Integrity Against Opposing Argument
Viaarxiv icon

Hadamard Attention Recurrent Transformer: A Strong Baseline for Stereo Matching Transformer

Add code
Jan 02, 2025
Viaarxiv icon