Picture for Junxiao Xue

Junxiao Xue

AD-AVSR: Asymmetric Dual-stream Enhancement for Robust Audio-Visual Speech Recognition

Add code
Aug 11, 2025
Viaarxiv icon

A Trustworthy Method for Multimodal Emotion Recognition

Add code
Aug 11, 2025
Viaarxiv icon

eMotions: A Large-Scale Dataset and Audio-Visual Fusion Network for Emotion Analysis in Short-form Videos

Add code
Aug 09, 2025
Viaarxiv icon

HOLA: Enhancing Audio-visual Deepfake Detection via Hierarchical Contextual Aggregations and Efficient Pre-training

Add code
Jul 30, 2025
Viaarxiv icon

ViC-Bench: Benchmarking Visual-Interleaved Chain-of-Thought Capability in MLLMs with Free-Style Intermediate State Representations

Add code
May 20, 2025
Viaarxiv icon

Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering

Add code
Dec 30, 2024
Figure 1 for Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering
Figure 2 for Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering
Figure 3 for Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering
Figure 4 for Enhanced Multimodal RAG-LLM for Accurate Visual Question Answering
Viaarxiv icon

3A-YOLO: New Real-Time Object Detectors with Triple Discriminative Awareness and Coordinated Representations

Add code
Dec 10, 2024
Viaarxiv icon

Pilot-guided Multimodal Semantic Communication for Audio-Visual Event Localization

Add code
Dec 09, 2024
Viaarxiv icon

Edge-Cloud Collaborative Satellite Image Analysis for Efficient Man-Made Structure Recognition

Add code
Oct 08, 2024
Figure 1 for Edge-Cloud Collaborative Satellite Image Analysis for Efficient Man-Made Structure Recognition
Figure 2 for Edge-Cloud Collaborative Satellite Image Analysis for Efficient Man-Made Structure Recognition
Figure 3 for Edge-Cloud Collaborative Satellite Image Analysis for Efficient Man-Made Structure Recognition
Figure 4 for Edge-Cloud Collaborative Satellite Image Analysis for Efficient Man-Made Structure Recognition
Viaarxiv icon

Hierarchical Action Recognition: A Contrastive Video-Language Approach with Hierarchical Interactions

Add code
May 28, 2024
Figure 1 for Hierarchical Action Recognition: A Contrastive Video-Language Approach with Hierarchical Interactions
Figure 2 for Hierarchical Action Recognition: A Contrastive Video-Language Approach with Hierarchical Interactions
Figure 3 for Hierarchical Action Recognition: A Contrastive Video-Language Approach with Hierarchical Interactions
Figure 4 for Hierarchical Action Recognition: A Contrastive Video-Language Approach with Hierarchical Interactions
Viaarxiv icon