Picture for Xirui Li

Xirui Li

Hunyuan-TurboS: Advancing Large Language Models through Mamba-Transformer Synergy and Adaptive Chain-of-Thought

Add code
May 21, 2025
Viaarxiv icon

R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model

Add code
Mar 07, 2025
Viaarxiv icon

Teaching Video Diffusion Model with Latent Physical Phenomenon Knowledge

Add code
Nov 18, 2024
Viaarxiv icon

Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent

Add code
Nov 05, 2024
Figure 1 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 2 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 3 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Figure 4 for Hunyuan-Large: An Open-Source MoE Model with 52 Billion Activated Parameters by Tencent
Viaarxiv icon

A Simple Approach to Unifying Diffusion-based Conditional Generation

Add code
Oct 15, 2024
Figure 1 for A Simple Approach to Unifying Diffusion-based Conditional Generation
Figure 2 for A Simple Approach to Unifying Diffusion-based Conditional Generation
Figure 3 for A Simple Approach to Unifying Diffusion-based Conditional Generation
Figure 4 for A Simple Approach to Unifying Diffusion-based Conditional Generation
Viaarxiv icon

MOSSBench: Is Your Multimodal Language Model Oversensitive to Safe Queries?

Add code
Jun 22, 2024
Figure 1 for MOSSBench: Is Your Multimodal Language Model Oversensitive to Safe Queries?
Figure 2 for MOSSBench: Is Your Multimodal Language Model Oversensitive to Safe Queries?
Figure 3 for MOSSBench: Is Your Multimodal Language Model Oversensitive to Safe Queries?
Figure 4 for MOSSBench: Is Your Multimodal Language Model Oversensitive to Safe Queries?
Viaarxiv icon

DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers

Add code
Mar 01, 2024
Figure 1 for DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers
Figure 2 for DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers
Figure 3 for DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers
Figure 4 for DrAttack: Prompt Decomposition and Reconstruction Makes Powerful LLM Jailbreakers
Viaarxiv icon

VidToMe: Video Token Merging for Zero-Shot Video Editing

Add code
Dec 19, 2023
Viaarxiv icon

Frame Fusion with Vehicle Motion Prediction for 3D Object Detection

Add code
Jun 19, 2023
Figure 1 for Frame Fusion with Vehicle Motion Prediction for 3D Object Detection
Figure 2 for Frame Fusion with Vehicle Motion Prediction for 3D Object Detection
Figure 3 for Frame Fusion with Vehicle Motion Prediction for 3D Object Detection
Figure 4 for Frame Fusion with Vehicle Motion Prediction for 3D Object Detection
Viaarxiv icon

Gait Identification under Surveillance Environment based on Human Skeleton

Add code
Nov 24, 2021
Figure 1 for Gait Identification under Surveillance Environment based on Human Skeleton
Figure 2 for Gait Identification under Surveillance Environment based on Human Skeleton
Figure 3 for Gait Identification under Surveillance Environment based on Human Skeleton
Figure 4 for Gait Identification under Surveillance Environment based on Human Skeleton
Viaarxiv icon