Picture for Jian Guan

Jian Guan

Mixture-of-Modules: Reinventing Transformers as Dynamic Assemblies of Modules

Add code
Jul 09, 2024
Viaarxiv icon

A Reference-free Metric for Language-Queried Audio Source Separation using Contrastive Language-Audio Pretraining

Add code
Jul 06, 2024
Viaarxiv icon

From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis

Add code
Jun 28, 2024
Figure 1 for From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis
Figure 2 for From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis
Figure 3 for From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis
Figure 4 for From the Least to the Most: Building a Plug-and-Play Visual Reasoner via Data Synthesis
Viaarxiv icon

FastDrag: Manipulate Anything in One Step

Add code
May 24, 2024
Figure 1 for FastDrag: Manipulate Anything in One Step
Figure 2 for FastDrag: Manipulate Anything in One Step
Figure 3 for FastDrag: Manipulate Anything in One Step
Figure 4 for FastDrag: Manipulate Anything in One Step
Viaarxiv icon

SISP: A Benchmark Dataset for Fine-grained Ship Instance Segmentation in Panchromatic Satellite Images

Add code
Feb 06, 2024
Viaarxiv icon

AMOR: A Recipe for Building Adaptable Modular Knowledge Agents Through Process Feedback

Add code
Feb 02, 2024
Viaarxiv icon

Language Models Hallucinate, but May Excel at Fact Verification

Add code
Oct 23, 2023
Viaarxiv icon

First-Shot Unsupervised Anomalous Sound Detection With Unknown Anomalies Estimated by Metadata-Assisted Audio Generation

Add code
Oct 22, 2023
Viaarxiv icon

Transformer-based Autoencoder with ID Constraint for Unsupervised Anomalous Sound Detection

Add code
Oct 13, 2023
Viaarxiv icon

Synth-AC: Enhancing Audio Captioning with Synthetic Supervision

Add code
Sep 18, 2023
Figure 1 for Synth-AC: Enhancing Audio Captioning with Synthetic Supervision
Figure 2 for Synth-AC: Enhancing Audio Captioning with Synthetic Supervision
Figure 3 for Synth-AC: Enhancing Audio Captioning with Synthetic Supervision
Figure 4 for Synth-AC: Enhancing Audio Captioning with Synthetic Supervision
Viaarxiv icon