Picture for Yifan Yang

Yifan Yang

The Fourth Monocular Depth Estimation Challenge

Add code
Apr 24, 2025
Viaarxiv icon

EmoVoice: LLM-based Emotional Text-To-Speech Model with Freestyle Text Prompting

Add code
Apr 22, 2025
Viaarxiv icon

Pseudo-Autoregressive Neural Codec Language Models for Efficient Zero-Shot Text-to-Speech Synthesis

Add code
Apr 14, 2025
Viaarxiv icon

Hyperlocal disaster damage assessment using bi-temporal street-view imagery and pre-trained vision models

Add code
Apr 12, 2025
Viaarxiv icon

Adaptive Bounded Exploration and Intermediate Actions for Data Debiasing

Add code
Apr 10, 2025
Viaarxiv icon

Multi-Mission Tool Bench: Assessing the Robustness of LLM based Agents through Related and Dynamic Missions

Add code
Apr 03, 2025
Viaarxiv icon

HiTVideo: Hierarchical Tokenizers for Enhancing Text-to-Video Generation with Autoregressive Large Language Models

Add code
Mar 14, 2025
Viaarxiv icon

Simulating Automotive Radar with Lidar and Camera Inputs

Add code
Mar 11, 2025
Viaarxiv icon

StreamMind: Unlocking Full Frame Rate Streaming Video Dialogue through Event-Gated Cognition

Add code
Mar 08, 2025
Viaarxiv icon

Large-Scale AI in Telecom: Charting the Roadmap for Innovation, Scalability, and Enhanced Digital Experiences

Add code
Mar 06, 2025
Viaarxiv icon