Picture for Lianyu Hu

Lianyu Hu

TennisExpert: Towards Expert-Level Analytical Sports Video Understanding

Add code
Mar 17, 2026
Viaarxiv icon

HiMemVLN: Enhancing Reliability of Open-Source Zero-Shot Vision-and-Language Navigation with Hierarchical Memory System

Add code
Mar 16, 2026
Viaarxiv icon

SSL-SSAW: Self-Supervised Learning with Sigmoid Self-Attention Weighting for Question-Based Sign Language Translation

Add code
Sep 17, 2025
Figure 1 for SSL-SSAW: Self-Supervised Learning with Sigmoid Self-Attention Weighting for Question-Based Sign Language Translation
Figure 2 for SSL-SSAW: Self-Supervised Learning with Sigmoid Self-Attention Weighting for Question-Based Sign Language Translation
Figure 3 for SSL-SSAW: Self-Supervised Learning with Sigmoid Self-Attention Weighting for Question-Based Sign Language Translation
Figure 4 for SSL-SSAW: Self-Supervised Learning with Sigmoid Self-Attention Weighting for Question-Based Sign Language Translation
Viaarxiv icon

Adversarial Fair Multi-View Clustering

Add code
Aug 06, 2025
Viaarxiv icon

Interpretable Clustering Ensemble

Add code
Jun 06, 2025
Viaarxiv icon

iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models

Add code
Dec 09, 2024
Figure 1 for iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models
Figure 2 for iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models
Figure 3 for iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models
Figure 4 for iLLaVA: An Image is Worth Fewer Than 1/3 Input Tokens in Large Multimodal Models
Viaarxiv icon

Conjunction Subspaces Test for Conformal and Selective Classification

Add code
Oct 16, 2024
Figure 1 for Conjunction Subspaces Test for Conformal and Selective Classification
Figure 2 for Conjunction Subspaces Test for Conformal and Selective Classification
Figure 3 for Conjunction Subspaces Test for Conformal and Selective Classification
Figure 4 for Conjunction Subspaces Test for Conformal and Selective Classification
Viaarxiv icon

Deep Correlated Prompting for Visual Recognition with Missing Modalities

Add code
Oct 10, 2024
Figure 1 for Deep Correlated Prompting for Visual Recognition with Missing Modalities
Figure 2 for Deep Correlated Prompting for Visual Recognition with Missing Modalities
Figure 3 for Deep Correlated Prompting for Visual Recognition with Missing Modalities
Figure 4 for Deep Correlated Prompting for Visual Recognition with Missing Modalities
Viaarxiv icon

Pose-Guided Fine-Grained Sign Language Video Generation

Add code
Sep 25, 2024
Figure 1 for Pose-Guided Fine-Grained Sign Language Video Generation
Figure 2 for Pose-Guided Fine-Grained Sign Language Video Generation
Figure 3 for Pose-Guided Fine-Grained Sign Language Video Generation
Figure 4 for Pose-Guided Fine-Grained Sign Language Video Generation
Viaarxiv icon

Interpretable Clustering: A Survey

Add code
Sep 01, 2024
Figure 1 for Interpretable Clustering: A Survey
Figure 2 for Interpretable Clustering: A Survey
Figure 3 for Interpretable Clustering: A Survey
Figure 4 for Interpretable Clustering: A Survey
Viaarxiv icon