Picture for Xiao Yang

Xiao Yang

BPQP: A Differentiable Convex Optimization Framework for Efficient End-to-End Learning

Add code
Nov 28, 2024
Viaarxiv icon

IKUN: Initialization to Keep snn training and generalization great with sUrrogate-stable variaNce

Add code
Nov 27, 2024
Viaarxiv icon

Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension

Add code
Nov 20, 2024
Figure 1 for Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
Figure 2 for Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
Figure 3 for Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
Figure 4 for Video-RAG: Visually-aligned Retrieval-Augmented Long Video Comprehension
Viaarxiv icon

ARN-LSTM: A Multi-Stream Attention-Based Model for Action Recognition with Temporal Dynamics

Add code
Nov 04, 2024
Figure 1 for ARN-LSTM: A Multi-Stream Attention-Based Model for Action Recognition with Temporal Dynamics
Figure 2 for ARN-LSTM: A Multi-Stream Attention-Based Model for Action Recognition with Temporal Dynamics
Figure 3 for ARN-LSTM: A Multi-Stream Attention-Based Model for Action Recognition with Temporal Dynamics
Figure 4 for ARN-LSTM: A Multi-Stream Attention-Based Model for Action Recognition with Temporal Dynamics
Viaarxiv icon

Efficient Mixture-of-Expert for Video-based Driver State and Physiological Multi-task Estimation in Conditional Autonomous Driving

Add code
Oct 28, 2024
Viaarxiv icon

From Pixels to Tokens: Revisiting Object Hallucinations in Large Vision-Language Models

Add code
Oct 09, 2024
Figure 1 for From Pixels to Tokens: Revisiting Object Hallucinations in Large Vision-Language Models
Figure 2 for From Pixels to Tokens: Revisiting Object Hallucinations in Large Vision-Language Models
Figure 3 for From Pixels to Tokens: Revisiting Object Hallucinations in Large Vision-Language Models
Figure 4 for From Pixels to Tokens: Revisiting Object Hallucinations in Large Vision-Language Models
Viaarxiv icon

Defense-as-a-Service: Black-box Shielding against Backdoored Graph Models

Add code
Oct 07, 2024
Viaarxiv icon

Collaborative Evolving Strategy for Automatic Data-Centric Development

Add code
Jul 26, 2024
Figure 1 for Collaborative Evolving Strategy for Automatic Data-Centric Development
Figure 2 for Collaborative Evolving Strategy for Automatic Data-Centric Development
Figure 3 for Collaborative Evolving Strategy for Automatic Data-Centric Development
Figure 4 for Collaborative Evolving Strategy for Automatic Data-Centric Development
Viaarxiv icon

Are Large Language Models a Good Replacement of Taxonomies?

Add code
Jun 17, 2024
Viaarxiv icon

Graph Neural Backdoor: Fundamentals, Methodologies, Applications, and Future Directions

Add code
Jun 15, 2024
Viaarxiv icon