Picture for Mengjie Zhao

Mengjie Zhao

Speech-Worthy Alignment for Japanese SpeechLLMs via Direct Preference Optimization

Add code
Mar 13, 2026
Viaarxiv icon

Streaming Translation and Transcription Through Speech-to-Text Causal Alignment

Add code
Mar 12, 2026
Viaarxiv icon

Deep Whole-body Parkour

Add code
Jan 12, 2026
Viaarxiv icon

Hiking in the Wild: A Scalable Perceptive Parkour Framework for Humanoids

Add code
Jan 12, 2026
Viaarxiv icon

High-Rank Structured Modulation for Parameter-Efficient Fine-Tuning

Add code
Jan 12, 2026
Viaarxiv icon

Time-Vertex Machine Learning for Optimal Sensor Placement in Temporal Graph Signals: Applications in Structural Health Monitoring

Add code
Dec 22, 2025
Viaarxiv icon

VinaBench: Benchmark for Faithful and Consistent Visual Narratives

Add code
Mar 26, 2025
Figure 1 for VinaBench: Benchmark for Faithful and Consistent Visual Narratives
Figure 2 for VinaBench: Benchmark for Faithful and Consistent Visual Narratives
Figure 3 for VinaBench: Benchmark for Faithful and Consistent Visual Narratives
Figure 4 for VinaBench: Benchmark for Faithful and Consistent Visual Narratives
Viaarxiv icon

Cross-Modal Learning for Music-to-Music-Video Description Generation

Add code
Mar 14, 2025
Viaarxiv icon

DeepResonance: Enhancing Multimodal Music Understanding via Music-centric Multi-way Instruction Tuning

Add code
Feb 18, 2025
Viaarxiv icon

OpenMU: Your Swiss Army Knife for Music Understanding

Add code
Oct 21, 2024
Figure 1 for OpenMU: Your Swiss Army Knife for Music Understanding
Figure 2 for OpenMU: Your Swiss Army Knife for Music Understanding
Figure 3 for OpenMU: Your Swiss Army Knife for Music Understanding
Figure 4 for OpenMU: Your Swiss Army Knife for Music Understanding
Viaarxiv icon