Picture for Feng Yang

Feng Yang

Joint Source-Channel-Generation Coding: From Distortion-oriented Reconstruction to Semantic-consistent Generation

Add code
Jan 19, 2026
Viaarxiv icon

Listen, Look, Drive: Coupling Audio Instructions for User-aware VLA-based Autonomous Driving

Add code
Jan 17, 2026
Viaarxiv icon

LLMTrack: Semantic Multi-Object Tracking with Multi-modal Large Language Models

Add code
Jan 10, 2026
Viaarxiv icon

WaTeRFlow: Watermark Temporal Robustness via Flow Consistency

Add code
Dec 22, 2025
Viaarxiv icon

StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection

Add code
Dec 19, 2025
Figure 1 for StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection
Figure 2 for StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection
Figure 3 for StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection
Figure 4 for StereoMV2D: A Sparse Temporal Stereo-Enhanced Framework for Robust Multi-View 3D Object Detection
Viaarxiv icon

DeCoP: Enhancing Self-Supervised Time Series Representation with Dependency Controlled Pre-training

Add code
Sep 18, 2025
Viaarxiv icon

ICDM: Interference Cancellation Diffusion Models for Wireless Semantic Communications

Add code
May 26, 2025
Viaarxiv icon

Bridging Geometry-Coherent Text-to-3D Generation with Multi-View Diffusion Priors and Gaussian Splatting

Add code
May 07, 2025
Figure 1 for Bridging Geometry-Coherent Text-to-3D Generation with Multi-View Diffusion Priors and Gaussian Splatting
Figure 2 for Bridging Geometry-Coherent Text-to-3D Generation with Multi-View Diffusion Priors and Gaussian Splatting
Figure 3 for Bridging Geometry-Coherent Text-to-3D Generation with Multi-View Diffusion Priors and Gaussian Splatting
Figure 4 for Bridging Geometry-Coherent Text-to-3D Generation with Multi-View Diffusion Priors and Gaussian Splatting
Viaarxiv icon

Dual-Branch Residual Network for Cross-Domain Few-Shot Hyperspectral Image Classification with Refined Prototype

Add code
Apr 27, 2025
Viaarxiv icon

OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model

Add code
Mar 30, 2025
Figure 1 for OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model
Figure 2 for OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model
Figure 3 for OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model
Figure 4 for OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action Model
Viaarxiv icon