Picture for Ke Hu

Ke Hu

GenPO: Generative Diffusion Models Meet On-Policy Reinforcement Learning

Add code
May 24, 2025
Viaarxiv icon

Word Level Timestamp Generation for Automatic Speech Recognition and Translation

Add code
May 21, 2025
Viaarxiv icon

Efficient and Direct Duplex Modeling for Speech-to-Speech Language Model

Add code
May 21, 2025
Viaarxiv icon

Training and Inference Efficiency of Encoder-Decoder Speech Models

Add code
Mar 07, 2025
Viaarxiv icon

NeKo: Toward Post Recognition Generative Correction Large Language Models with Task-Oriented Experts

Add code
Nov 08, 2024
Viaarxiv icon

VoiceTextBlender: Augmenting Large Language Models with Speech Capabilities via Single-Stage Joint Speech-Text Supervised Fine-Tuning

Add code
Oct 23, 2024
Viaarxiv icon

Chain-of-Thought Prompting for Speech Translation

Add code
Sep 17, 2024
Figure 1 for Chain-of-Thought Prompting for Speech Translation
Figure 2 for Chain-of-Thought Prompting for Speech Translation
Figure 3 for Chain-of-Thought Prompting for Speech Translation
Figure 4 for Chain-of-Thought Prompting for Speech Translation
Viaarxiv icon

Robust Principal Component Analysis via Discriminant Sample Weight Learning

Add code
Aug 22, 2024
Viaarxiv icon

Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization

Add code
May 25, 2024
Figure 1 for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Figure 2 for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Figure 3 for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Figure 4 for Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization
Viaarxiv icon

Enhancing Visual Continual Learning with Language-Guided Supervision

Add code
Mar 24, 2024
Figure 1 for Enhancing Visual Continual Learning with Language-Guided Supervision
Figure 2 for Enhancing Visual Continual Learning with Language-Guided Supervision
Figure 3 for Enhancing Visual Continual Learning with Language-Guided Supervision
Figure 4 for Enhancing Visual Continual Learning with Language-Guided Supervision
Viaarxiv icon