Picture for Xiaohan Li

Xiaohan Li

University of Science and Technology of China

MIDAS: Multimodal Interactive Digital-humAn Synthesis via Real-time Autoregressive Video Generation

Add code
Aug 28, 2025
Viaarxiv icon

Stereo 3D Gaussian Splatting SLAM for Outdoor Urban Scenes

Add code
Jul 31, 2025
Viaarxiv icon

Ov3R: Open-Vocabulary Semantic 3D Reconstruction from RGB Videos

Add code
Jul 29, 2025
Viaarxiv icon

Natural Language Generation in Healthcare: A Review of Methods and Applications

Add code
May 07, 2025
Viaarxiv icon

Bayesian Reasoning Enabled by Spin-Orbit Torque Magnetic Tunnel Junctions

Add code
Apr 11, 2025
Viaarxiv icon

DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately

Add code
Dec 22, 2024
Figure 1 for DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately
Figure 2 for DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately
Figure 3 for DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately
Figure 4 for DR-Encoder: Encode Low-rank Gradients with Random Prior for Large Language Models Differentially Privately
Viaarxiv icon

Improving Sequential Recommender Systems with Online and In-store User Behavior

Add code
Dec 03, 2024
Figure 1 for Improving Sequential Recommender Systems with Online and In-store User Behavior
Figure 2 for Improving Sequential Recommender Systems with Online and In-store User Behavior
Figure 3 for Improving Sequential Recommender Systems with Online and In-store User Behavior
Figure 4 for Improving Sequential Recommender Systems with Online and In-store User Behavior
Viaarxiv icon

RiTeK: A Dataset for Large Language Models Complex Reasoning over Textual Knowledge Graphs

Add code
Oct 17, 2024
Figure 1 for RiTeK: A Dataset for Large Language Models Complex Reasoning over Textual Knowledge Graphs
Figure 2 for RiTeK: A Dataset for Large Language Models Complex Reasoning over Textual Knowledge Graphs
Figure 3 for RiTeK: A Dataset for Large Language Models Complex Reasoning over Textual Knowledge Graphs
Figure 4 for RiTeK: A Dataset for Large Language Models Complex Reasoning over Textual Knowledge Graphs
Viaarxiv icon

Triple Modality Fusion: Aligning Visual, Textual, and Graph Data with Large Language Models for Multi-Behavior Recommendations

Add code
Oct 16, 2024
Figure 1 for Triple Modality Fusion: Aligning Visual, Textual, and Graph Data with Large Language Models for Multi-Behavior Recommendations
Figure 2 for Triple Modality Fusion: Aligning Visual, Textual, and Graph Data with Large Language Models for Multi-Behavior Recommendations
Figure 3 for Triple Modality Fusion: Aligning Visual, Textual, and Graph Data with Large Language Models for Multi-Behavior Recommendations
Figure 4 for Triple Modality Fusion: Aligning Visual, Textual, and Graph Data with Large Language Models for Multi-Behavior Recommendations
Viaarxiv icon

Iter-AHMCL: Alleviate Hallucination for Large Language Model via Iterative Model-level Contrastive Learning

Add code
Oct 16, 2024
Viaarxiv icon