Picture for Wei-Cheng Tseng

Wei-Cheng Tseng

Auden-Voice: General-Purpose Voice Encoder for Speech and Language Understanding

Add code
Nov 19, 2025
Viaarxiv icon

Scalable Policy Evaluation with Video World Models

Add code
Nov 17, 2025
Viaarxiv icon

Probing the Robustness Properties of Neural Speech Codecs

Add code
May 30, 2025
Viaarxiv icon

Cosmos World Foundation Model Platform for Physical AI

Add code
Jan 07, 2025
Figure 1 for Cosmos World Foundation Model Platform for Physical AI
Figure 2 for Cosmos World Foundation Model Platform for Physical AI
Figure 3 for Cosmos World Foundation Model Platform for Physical AI
Figure 4 for Cosmos World Foundation Model Platform for Physical AI
Viaarxiv icon

Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks

Add code
Nov 08, 2024
Figure 1 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 2 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 3 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Figure 4 for Dynamic-SUPERB Phase-2: A Collaboratively Expanding Benchmark for Measuring the Capabilities of Spoken Language Models with 180 Tasks
Viaarxiv icon

Gaussian Splatting Visual MPC for Granular Media Manipulation

Add code
Oct 13, 2024
Figure 1 for Gaussian Splatting Visual MPC for Granular Media Manipulation
Figure 2 for Gaussian Splatting Visual MPC for Granular Media Manipulation
Figure 3 for Gaussian Splatting Visual MPC for Granular Media Manipulation
Figure 4 for Gaussian Splatting Visual MPC for Granular Media Manipulation
Viaarxiv icon

SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks

Add code
Aug 23, 2024
Figure 1 for SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks
Figure 2 for SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks
Figure 3 for SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks
Figure 4 for SpeechPrompt: Prompting Speech Language Models for Speech Processing Tasks
Viaarxiv icon

A Large-Scale Evaluation of Speech Foundation Models

Add code
Apr 15, 2024
Viaarxiv icon

VMCML: Video and Music Matching via Cross-Modality Lifting

Add code
Mar 22, 2023
Viaarxiv icon

SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks

Add code
Mar 01, 2023
Figure 1 for SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks
Figure 2 for SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks
Figure 3 for SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks
Figure 4 for SpeechPrompt v2: Prompt Tuning for Speech Classification Tasks
Viaarxiv icon