Picture for Yiyang Zhang

Yiyang Zhang

Ultrasound Vision-Language Alignment via Contrastive Learning

Add code
May 04, 2026
Viaarxiv icon

Stealthy and Adjustable Text-Guided Backdoor Attacks on Multimodal Pretrained Models

Add code
Apr 07, 2026
Viaarxiv icon

HabitatAgent: An End-to-End Multi-Agent System for Housing Consultation

Add code
Apr 01, 2026
Viaarxiv icon

MOSS-TTSD: Text to Spoken Dialogue Generation

Add code
Mar 20, 2026
Viaarxiv icon

MOSS-TTS Technical Report

Add code
Mar 18, 2026
Viaarxiv icon

Stable Spike: Dual Consistency Optimization via Bitwise AND Operations for Spiking Neural Networks

Add code
Mar 12, 2026
Viaarxiv icon

MOVA: Towards Scalable and Synchronized Video-Audio Generation

Add code
Feb 09, 2026
Viaarxiv icon

MOSS Transcribe Diarize: Accurate Transcription with Speaker Diarization

Add code
Jan 08, 2026
Viaarxiv icon

MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance

Add code
Oct 02, 2025
Figure 1 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Figure 2 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Figure 3 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Figure 4 for MOSS-Speech: Towards True Speech-to-Speech Models Without Text Guidance
Viaarxiv icon

SitLLM: Large Language Models for Sitting Posture Health Understanding via Pressure Sensor Data

Add code
Sep 16, 2025
Viaarxiv icon