Picture for Jiachen Lian

Jiachen Lian

LCS-CTC: Leveraging Soft Alignments to Enhance Phonetic Transcription Robustness

Add code
Aug 05, 2025
Viaarxiv icon

Analysis and Evaluation of Synthetic Data Generation in Speech Dysfluency Detection

Add code
May 28, 2025
Viaarxiv icon

Dysfluent WFST: A Framework for Zero-Shot Speech Dysfluency Transcription and Detection

Add code
May 22, 2025
Viaarxiv icon

Full-Duplex-Bench: A Benchmark to Evaluate Full-duplex Spoken Dialogue Models on Turn-taking Capabilities

Add code
Mar 06, 2025
Viaarxiv icon

SSDM 2.0: Time-Accurate Speech Rich Transcription with Non-Fluencies

Add code
Nov 29, 2024
Figure 1 for SSDM 2.0: Time-Accurate Speech Rich Transcription with Non-Fluencies
Figure 2 for SSDM 2.0: Time-Accurate Speech Rich Transcription with Non-Fluencies
Figure 3 for SSDM 2.0: Time-Accurate Speech Rich Transcription with Non-Fluencies
Figure 4 for SSDM 2.0: Time-Accurate Speech Rich Transcription with Non-Fluencies
Viaarxiv icon

Stutter-Solver: End-to-end Multi-lingual Dysfluency Detection

Add code
Sep 15, 2024
Viaarxiv icon

YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection

Add code
Sep 09, 2024
Figure 1 for YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection
Figure 2 for YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection
Figure 3 for YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection
Figure 4 for YOLO-Stutter: End-to-end Region-Wise Speech Dysfluency Detection
Viaarxiv icon

SSDM: Scalable Speech Dysfluency Modeling

Add code
Aug 29, 2024
Figure 1 for SSDM: Scalable Speech Dysfluency Modeling
Figure 2 for SSDM: Scalable Speech Dysfluency Modeling
Figure 3 for SSDM: Scalable Speech Dysfluency Modeling
Figure 4 for SSDM: Scalable Speech Dysfluency Modeling
Viaarxiv icon

VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis

Add code
Mar 01, 2024
Figure 1 for VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis
Figure 2 for VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis
Figure 3 for VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis
Figure 4 for VoxGenesis: Unsupervised Discovery of Latent Speaker Manifold for Speech Synthesis
Viaarxiv icon

Towards Hierarchical Spoken Language Dysfluency Modeling

Add code
Jan 21, 2024
Figure 1 for Towards Hierarchical Spoken Language Dysfluency Modeling
Figure 2 for Towards Hierarchical Spoken Language Dysfluency Modeling
Figure 3 for Towards Hierarchical Spoken Language Dysfluency Modeling
Figure 4 for Towards Hierarchical Spoken Language Dysfluency Modeling
Viaarxiv icon