Picture for Quan Wang

Quan Wang

Arden

ExpertPrompting: Instructing Large Language Models to be Distinguished Experts

Add code
May 24, 2023
Viaarxiv icon

Not All Image Regions Matter: Masked Vector Quantization for Autoregressive Image Generation

Add code
May 23, 2023
Viaarxiv icon

Inductive Relation Prediction from Relational Paths and Context with Hierarchical Transformers

Add code
Apr 19, 2023
Viaarxiv icon

$k$NN Prompting: Beyond-Context Learning with Calibration-Free Nearest Neighbor Inference

Add code
Mar 24, 2023
Viaarxiv icon

CGOF++: Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields

Add code
Nov 23, 2022
Figure 1 for CGOF++: Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields
Figure 2 for CGOF++: Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields
Figure 3 for CGOF++: Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields
Figure 4 for CGOF++: Controllable 3D Face Synthesis with Conditional Generative Occupancy Fields
Viaarxiv icon

Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting

Add code
Nov 11, 2022
Figure 1 for Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting
Figure 2 for Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting
Figure 3 for Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting
Figure 4 for Exploring Sequence-to-Sequence Transformer-Transducer Models for Keyword Spotting
Viaarxiv icon

Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss

Add code
Nov 11, 2022
Figure 1 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 2 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 3 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Figure 4 for Augmenting Transformer-Transducer Based Speaker Change Detection With Token-Level Training Loss
Viaarxiv icon

Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering

Add code
Oct 25, 2022
Figure 1 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 2 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 3 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Figure 4 for Highly Efficient Real-Time Streaming and Fully On-Device Speaker Diarization with Multi-Stage Clustering
Viaarxiv icon

A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation

Add code
Sep 14, 2022
Figure 1 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Figure 2 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Figure 3 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Figure 4 for A Universally-Deployable ASR Frontend for Joint Acoustic Echo Cancellation, Speech Enhancement, and Voice Separation
Viaarxiv icon

Compact and Robust Deep Learning Architecture for Fluorescence Lifetime Imaging and FPGA Implementation

Add code
Sep 09, 2022
Figure 1 for Compact and Robust Deep Learning Architecture for Fluorescence Lifetime Imaging and FPGA Implementation
Figure 2 for Compact and Robust Deep Learning Architecture for Fluorescence Lifetime Imaging and FPGA Implementation
Figure 3 for Compact and Robust Deep Learning Architecture for Fluorescence Lifetime Imaging and FPGA Implementation
Figure 4 for Compact and Robust Deep Learning Architecture for Fluorescence Lifetime Imaging and FPGA Implementation
Viaarxiv icon