Picture for Shih-Lun Wu

Shih-Lun Wu

Music ControlNet: Multiple Time-varying Controls for Music Generation

Add code
Nov 13, 2023
Figure 1 for Music ControlNet: Multiple Time-varying Controls for Music Generation
Figure 2 for Music ControlNet: Multiple Time-varying Controls for Music Generation
Figure 3 for Music ControlNet: Multiple Time-varying Controls for Music Generation
Figure 4 for Music ControlNet: Multiple Time-varying Controls for Music Generation
Viaarxiv icon

Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation

Add code
Sep 29, 2023
Figure 1 for Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation
Figure 2 for Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation
Figure 3 for Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation
Figure 4 for Improving Audio Captioning Models with Fine-grained Audio Features, Text Embedding Supervision, and LLM Mix-up Augmentation
Viaarxiv icon

Listener Model for the PhotoBook Referential Game with CLIPScores as Implicit Reference Chain

Add code
Jun 16, 2023
Figure 1 for Listener Model for the PhotoBook Referential Game with CLIPScores as Implicit Reference Chain
Figure 2 for Listener Model for the PhotoBook Referential Game with CLIPScores as Implicit Reference Chain
Figure 3 for Listener Model for the PhotoBook Referential Game with CLIPScores as Implicit Reference Chain
Figure 4 for Listener Model for the PhotoBook Referential Game with CLIPScores as Implicit Reference Chain
Viaarxiv icon

Tensor decomposition for minimization of E2E SLU model toward on-device processing

Add code
Jun 02, 2023
Figure 1 for Tensor decomposition for minimization of E2E SLU model toward on-device processing
Figure 2 for Tensor decomposition for minimization of E2E SLU model toward on-device processing
Figure 3 for Tensor decomposition for minimization of E2E SLU model toward on-device processing
Figure 4 for Tensor decomposition for minimization of E2E SLU model toward on-device processing
Viaarxiv icon

The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge

Add code
May 11, 2023
Figure 1 for The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge
Figure 2 for The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge
Figure 3 for The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge
Figure 4 for The Pipeline System of ASR and NLU with MLM-based Data Augmentation toward STOP Low-resource Challenge
Viaarxiv icon

A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge

Add code
May 06, 2023
Figure 1 for A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge
Figure 2 for A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge
Figure 3 for A Study on the Integration of Pipeline and E2E SLU systems for Spoken Semantic Parsing toward STOP Quality Challenge
Viaarxiv icon

Compose & Embellish: Well-Structured Piano Performance Generation via A Two-Stage Approach

Add code
Sep 17, 2022
Figure 1 for Compose & Embellish: Well-Structured Piano Performance Generation via A Two-Stage Approach
Figure 2 for Compose & Embellish: Well-Structured Piano Performance Generation via A Two-Stage Approach
Figure 3 for Compose & Embellish: Well-Structured Piano Performance Generation via A Two-Stage Approach
Figure 4 for Compose & Embellish: Well-Structured Piano Performance Generation via A Two-Stage Approach
Viaarxiv icon

Theme Transformer: Symbolic Music Generation with Theme-Conditioned Transformer

Add code
Nov 07, 2021
Figure 1 for Theme Transformer: Symbolic Music Generation with Theme-Conditioned Transformer
Figure 2 for Theme Transformer: Symbolic Music Generation with Theme-Conditioned Transformer
Figure 3 for Theme Transformer: Symbolic Music Generation with Theme-Conditioned Transformer
Figure 4 for Theme Transformer: Symbolic Music Generation with Theme-Conditioned Transformer
Viaarxiv icon

Relative Positional Encoding for Transformers with Linear Complexity

Add code
Jun 10, 2021
Figure 1 for Relative Positional Encoding for Transformers with Linear Complexity
Figure 2 for Relative Positional Encoding for Transformers with Linear Complexity
Figure 3 for Relative Positional Encoding for Transformers with Linear Complexity
Figure 4 for Relative Positional Encoding for Transformers with Linear Complexity
Viaarxiv icon

MuseMorphose: Full-Song and Fine-Grained Music Style Transfer with Just One Transformer VAE

Add code
May 10, 2021
Figure 1 for MuseMorphose: Full-Song and Fine-Grained Music Style Transfer with Just One Transformer VAE
Figure 2 for MuseMorphose: Full-Song and Fine-Grained Music Style Transfer with Just One Transformer VAE
Figure 3 for MuseMorphose: Full-Song and Fine-Grained Music Style Transfer with Just One Transformer VAE
Figure 4 for MuseMorphose: Full-Song and Fine-Grained Music Style Transfer with Just One Transformer VAE
Viaarxiv icon