Picture for Zhewei Huang

Zhewei Huang

Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model

Add code
Jun 10, 2025
Viaarxiv icon

ViStoryBench: Comprehensive Benchmark Suite for Story Visualization

Add code
May 30, 2025
Viaarxiv icon

Advancing Video Self-Supervised Learning via Image Foundation Models

Add code
May 25, 2025
Viaarxiv icon

DialogueReason: Rule-Based RL Sparks Dialogue Reasoning in LLMs

Add code
May 11, 2025
Viaarxiv icon

Step-Audio: Unified Understanding and Generation in Intelligent Speech Interaction

Add code
Feb 18, 2025
Viaarxiv icon

Advancing Auto-Regressive Continuation for Video Frames

Add code
Dec 04, 2024
Figure 1 for Advancing Auto-Regressive Continuation for Video Frames
Figure 2 for Advancing Auto-Regressive Continuation for Video Frames
Figure 3 for Advancing Auto-Regressive Continuation for Video Frames
Figure 4 for Advancing Auto-Regressive Continuation for Video Frames
Viaarxiv icon

Recent Advances in Attack and Defense Approaches of Large Language Models

Add code
Sep 05, 2024
Viaarxiv icon

A Survey on Video Prediction: From Deterministic to Generative Approaches

Add code
Jan 31, 2024
Figure 1 for A Survey on Video Prediction: From Deterministic to Generative Approaches
Figure 2 for A Survey on Video Prediction: From Deterministic to Generative Approaches
Viaarxiv icon

Scale-Adaptive Feature Aggregation for Efficient Space-Time Video Super-Resolution

Add code
Oct 26, 2023
Viaarxiv icon

A Dynamic Multi-Scale Voxel Flow Network for Video Prediction

Add code
Mar 24, 2023
Viaarxiv icon