Picture for Pan Zhou

Pan Zhou

The Hubei Engineering Research Center on Big Data Security, School of Cyber Science and Engineering, Huazhong University of Science and Technology

A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends

Add code
Jul 10, 2024
Figure 1 for A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends
Figure 2 for A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends
Figure 3 for A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends
Figure 4 for A Survey of Attacks on Large Vision-Language Models: Resources, Advances, and Future Trends
Viaarxiv icon

LoCo: Low-Bit Communication Adaptor for Large-scale Model Training

Add code
Jul 05, 2024
Viaarxiv icon

Self-Cognition in Large Language Models: An Exploratory Study

Add code
Jul 01, 2024
Figure 1 for Self-Cognition in Large Language Models: An Exploratory Study
Figure 2 for Self-Cognition in Large Language Models: An Exploratory Study
Figure 3 for Self-Cognition in Large Language Models: An Exploratory Study
Figure 4 for Self-Cognition in Large Language Models: An Exploratory Study
Viaarxiv icon

A Hopfieldian View-based Interpretation for Chain-of-Thought Reasoning

Add code
Jun 18, 2024
Figure 1 for A Hopfieldian View-based Interpretation for Chain-of-Thought Reasoning
Figure 2 for A Hopfieldian View-based Interpretation for Chain-of-Thought Reasoning
Figure 3 for A Hopfieldian View-based Interpretation for Chain-of-Thought Reasoning
Figure 4 for A Hopfieldian View-based Interpretation for Chain-of-Thought Reasoning
Viaarxiv icon

GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents

Add code
Jun 16, 2024
Figure 1 for GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents
Figure 2 for GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents
Figure 3 for GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents
Figure 4 for GUI-WORLD: A Dataset for GUI-oriented Multimodal LLM-based Agents
Viaarxiv icon

Living in the Moment: Can Large Language Models Grasp Co-Temporal Reasoning?

Add code
Jun 13, 2024
Viaarxiv icon

MVGamba: Unify 3D Content Generation as State Space Sequence Modeling

Add code
Jun 10, 2024
Figure 1 for MVGamba: Unify 3D Content Generation as State Space Sequence Modeling
Figure 2 for MVGamba: Unify 3D Content Generation as State Space Sequence Modeling
Figure 3 for MVGamba: Unify 3D Content Generation as State Space Sequence Modeling
Figure 4 for MVGamba: Unify 3D Content Generation as State Space Sequence Modeling
Viaarxiv icon

4-bit Shampoo for Memory-Efficient Network Training

Add code
May 28, 2024
Figure 1 for 4-bit Shampoo for Memory-Efficient Network Training
Figure 2 for 4-bit Shampoo for Memory-Efficient Network Training
Figure 3 for 4-bit Shampoo for Memory-Efficient Network Training
Figure 4 for 4-bit Shampoo for Memory-Efficient Network Training
Viaarxiv icon

LOVA3: Learning to Visual Question Answering, Asking and Assessment

Add code
May 23, 2024
Figure 1 for LOVA3: Learning to Visual Question Answering, Asking and Assessment
Figure 2 for LOVA3: Learning to Visual Question Answering, Asking and Assessment
Figure 3 for LOVA3: Learning to Visual Question Answering, Asking and Assessment
Figure 4 for LOVA3: Learning to Visual Question Answering, Asking and Assessment
Viaarxiv icon

Physical Backdoor: Towards Temperature-based Backdoor Attacks in the Physical World

Add code
Apr 30, 2024
Figure 1 for Physical Backdoor: Towards Temperature-based Backdoor Attacks in the Physical World
Figure 2 for Physical Backdoor: Towards Temperature-based Backdoor Attacks in the Physical World
Figure 3 for Physical Backdoor: Towards Temperature-based Backdoor Attacks in the Physical World
Figure 4 for Physical Backdoor: Towards Temperature-based Backdoor Attacks in the Physical World
Viaarxiv icon