Picture for Jinxiang Liu

Jinxiang Liu

DENOISER: Rethinking the Robustness for Open-Vocabulary Action Recognition

Add code
Apr 23, 2024
Viaarxiv icon

Audio-Visual Segmentation via Unlabeled Frame Exploitation

Add code
Mar 17, 2024
Viaarxiv icon

Audio-aware Query-enhanced Transformer for Audio-Visual Segmentation

Add code
Jul 25, 2023
Viaarxiv icon

Annotation-free Audio-Visual Segmentation

Add code
May 19, 2023
Viaarxiv icon

DiffusionSeg: Adapting Diffusion Towards Unsupervised Object Discovery

Add code
Mar 17, 2023
Viaarxiv icon

Constraint and Union for Partially-Supervised Temporal Sentence Grounding

Add code
Feb 20, 2023
Viaarxiv icon

Distilling Vision-Language Pre-training to Collaborate with Weakly-Supervised Temporal Action Localization

Add code
Dec 19, 2022
Viaarxiv icon

Exploiting Transformation Invariance and Equivariance for Self-supervised Sound Localisation

Add code
Jun 26, 2022
Figure 1 for Exploiting Transformation Invariance and Equivariance for Self-supervised Sound Localisation
Figure 2 for Exploiting Transformation Invariance and Equivariance for Self-supervised Sound Localisation
Figure 3 for Exploiting Transformation Invariance and Equivariance for Self-supervised Sound Localisation
Figure 4 for Exploiting Transformation Invariance and Equivariance for Self-supervised Sound Localisation
Viaarxiv icon

A 3D Mesh-based Lifting-and-Projection Network for Human Pose Transfer

Add code
Sep 24, 2021
Figure 1 for A 3D Mesh-based Lifting-and-Projection Network for Human Pose Transfer
Figure 2 for A 3D Mesh-based Lifting-and-Projection Network for Human Pose Transfer
Figure 3 for A 3D Mesh-based Lifting-and-Projection Network for Human Pose Transfer
Figure 4 for A 3D Mesh-based Lifting-and-Projection Network for Human Pose Transfer
Viaarxiv icon