Picture for Richard Cartwright

Richard Cartwright

Text-To-Speech with Chain-of-Details: modeling temporal dynamics in speech generation

Add code
Apr 21, 2026
Viaarxiv icon

V2A-Mapper: A Lightweight Solution for Vision-to-Audio Generation by Connecting Foundation Models

Add code
Aug 21, 2023
Viaarxiv icon

Low latency transformers for speech processing

Add code
Feb 27, 2023
Figure 1 for Low latency transformers for speech processing
Figure 2 for Low latency transformers for speech processing
Figure 3 for Low latency transformers for speech processing
Figure 4 for Low latency transformers for speech processing
Viaarxiv icon